[Benchmarking] Add disable_shuffle option for dataset loading #26258

ymoslem · 2025-10-05T16:46:02Z

Added disable_shuffle argument to control the dataset shuffling behaviour. The option keeps the dataset in the original order in the result to be able to evaluate the responses against the ground truth.

Currently, the dataset is shuffled, which make the requests in a different order than the original dataset. To make evaluation easier, the shuffling should be optional. This disable_shuffle argument disables data shuffling.

The change was tested with a custom dataset where the input is a *.jsonl file.

The main change is modifying:

random.seed(self.random_seed)
random.shuffle(self.data)

to be:

if not getattr(self, 'disable_shuffle', False):
    random.seed(self.random_seed)
    random.shuffle(self.data)

If the change is merged, the new argument should be added to this documentation page.

Added 'disable_shuffle' argument to control the dataset shuffling behaviour. The option keeps the dataset in the original order in the result to be able to evaluate the responses against the ground truth. Signed-off-by: Yasmin Moslem <[email protected]>

github-actions · 2025-10-05T16:46:12Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

gemini-code-assist

Code Review

This pull request introduces a disable_shuffle option to control dataset shuffling, which is a valuable addition for ensuring deterministic evaluation. The implementation correctly adds the necessary command-line argument and propagates it to the dataset classes. However, I've identified a critical issue regarding reproducibility. In several dataset loading methods, the random.seed() call has been moved inside the conditional shuffling block. This causes the random number generator to be unseeded when shuffling is disabled, leading to non-reproducible behavior for other random operations within the benchmark. I've provided comments with suggestions to fix this.

vllm/benchmarks/datasets.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

vllm/benchmarks/datasets.py

random.seed(self.random_seed) if not getattr(self, 'disable_shuffle', False): random.shuffle(self.data) Signed-off-by: Yasmin Moslem <[email protected]>

Signed-off-by: Yasmin Moslem <[email protected]>

ywang96

Looks fine to me!

…roject#26258) Signed-off-by: Yasmin Moslem <[email protected]> Signed-off-by: Karan Goel <[email protected]>

…roject#26258) Signed-off-by: Yasmin Moslem <[email protected]>

…roject#26258) Signed-off-by: Yasmin Moslem <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Merge branch 'main' into patch-1

b91294a

mergify bot added the performance Performance-related issues label Oct 5, 2025

gemini-code-assist bot reviewed Oct 5, 2025

View reviewed changes

vllm/benchmarks/datasets.py Outdated Show resolved Hide resolved

vllm/benchmarks/datasets.py Outdated Show resolved Hide resolved

vllm/benchmarks/datasets.py Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Oct 5, 2025

View reviewed changes

vllm/benchmarks/datasets.py Outdated Show resolved Hide resolved

ymoslem added 3 commits October 5, 2025 17:57

Moving random.seed() out of the disable_shuffle condition

11f2d07

random.seed(self.random_seed) if not getattr(self, 'disable_shuffle', False): random.shuffle(self.data) Signed-off-by: Yasmin Moslem <[email protected]>

Merge branch 'main' into patch-1

9648dd9

Ruff formatting

6792570

Signed-off-by: Yasmin Moslem <[email protected]>

ywang96 approved these changes Oct 6, 2025

View reviewed changes

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 6, 2025

Merge branch 'main' into patch-1

2b909d8

ywang96 enabled auto-merge (squash) October 6, 2025 05:16

ywang96 merged commit 7c2ec0f into vllm-project:main Oct 6, 2025
47 checks passed

karan pushed a commit to karan/vllm that referenced this pull request Oct 6, 2025

[Benchmarking] Add disable_shuffle option for dataset loading (vllm-p…

6ebb39d

…roject#26258) Signed-off-by: Yasmin Moslem <[email protected]> Signed-off-by: Karan Goel <[email protected]>

southfreebird pushed a commit to southfreebird/vllm that referenced this pull request Oct 7, 2025

[Benchmarking] Add disable_shuffle option for dataset loading (vllm-p…

36b982f

…roject#26258) Signed-off-by: Yasmin Moslem <[email protected]>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Benchmarking] Add disable_shuffle option for dataset loading (vllm-p…

0a0d816

…roject#26258) Signed-off-by: Yasmin Moslem <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Benchmarking] Add disable_shuffle option for dataset loading #26258

[Benchmarking] Add disable_shuffle option for dataset loading #26258

Uh oh!

ymoslem commented Oct 5, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Oct 5, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

ywang96 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Benchmarking] Add disable_shuffle option for dataset loading #26258

[Benchmarking] Add disable_shuffle option for dataset loading #26258

Uh oh!

Conversation

ymoslem commented Oct 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 5, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

ywang96 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ymoslem commented Oct 5, 2025 •

edited by github-actions bot

Loading