Fix some issues with benchmark data output #13641

huydhn · 2025-02-21T02:35:17Z

This is a follow-up of #13068 to fix:

Handle the inf qps config. The JSON parser used by the database doesn't like this value. So, I add a custom InfEncoder to convert this to an inf string.
~~Passing the parameter --tensor-parallel-size to benchmark_serving.py script so that it can be stored in JSON and picked up by the dashboard.~~ I'll use --metadata to get this instead per @ywang96 comment
Handle missing .commands because the new .pytorch.json is also a JSON file.

cc @ywang96 @simon-mo

Just FYI, here is a preview of the new dashboard.

Signed-off-by: Huy Do <[email protected]>

github-actions · 2025-02-21T02:35:28Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

ywang96 · 2025-02-22T07:23:23Z

benchmarks/benchmark_serving.py

+    parser.add_argument(
+        "--tensor-parallel-size",
+        type=int,
+        default=0,
+        help=
+        "The tensor parallel used by the server to display on the dashboard")
+


If you just need a way to pass tp as an argument to this script, we already have --metadata

Ah got it, --metadata is indeed more appropriate for this as I just need to get tp parameter to show on the dashboard. It's probably a good idea to save other server parameters too just in case.

Signed-off-by: Huy Do <[email protected]>

ywang96

LGTM! Thanks for the fix and addressing my comment!

Signed-off-by: Huy Do <[email protected]>

Signed-off-by: Huy Do <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

Signed-off-by: Huy Do <[email protected]>

huydhn added 5 commits February 20, 2025 15:25

Fix several issues on benchmark outputs

6de056a

Signed-off-by: Huy Do <[email protected]>

Add the option

0c5651b

Signed-off-by: Huy Do <[email protected]>

Handle inf value

b846138

Signed-off-by: Huy Do <[email protected]>

json.dump use iterencode

cdeff0e

Signed-off-by: Huy Do <[email protected]>

Handle missing command file

e404a42

Signed-off-by: Huy Do <[email protected]>

mergify bot added the ci/build label Feb 21, 2025

huydhn mentioned this pull request Feb 21, 2025

Create vLLM v1 benchmark dashboard pytorch/test-infra#6306

Merged

Merge branch 'main' into fix-some-benchmark-data-issue

c0f76b4

ywang96 reviewed Feb 22, 2025

View reviewed changes

huydhn added 3 commits February 22, 2025 18:18

Use --metadata

d0bdd26

Signed-off-by: Huy Do <[email protected]>

Merge branch 'main' into fix-some-benchmark-data-issue

20230ca

Fix lint

d5e27f1

Signed-off-by: Huy Do <[email protected]>

huydhn requested a review from ywang96 February 23, 2025 03:43

ywang96 approved these changes Feb 23, 2025

View reviewed changes

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 23, 2025

DarkLight1337 merged commit e7ef74e into vllm-project:main Feb 24, 2025
39 checks passed

Akshat-Tripathi pushed a commit to krai/vllm that referenced this pull request Mar 3, 2025

Fix some issues with benchmark data output (vllm-project#13641)

bc7c0aa

Signed-off-by: Huy Do <[email protected]>

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

Fix some issues with benchmark data output (vllm-project#13641)

c6f6b93

Signed-off-by: Huy Do <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

Fix some issues with benchmark data output (vllm-project#13641)

975bb50

Signed-off-by: Huy Do <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix some issues with benchmark data output #13641

Fix some issues with benchmark data output #13641

Uh oh!

huydhn commented Feb 21, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Feb 21, 2025

Uh oh!

ywang96 Feb 22, 2025

Uh oh!

huydhn Feb 22, 2025

Uh oh!

ywang96 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fix some issues with benchmark data output #13641

Fix some issues with benchmark data output #13641

Uh oh!

Conversation

huydhn commented Feb 21, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 21, 2025

Uh oh!

ywang96 Feb 22, 2025

Choose a reason for hiding this comment

Uh oh!

huydhn Feb 22, 2025

Choose a reason for hiding this comment

Uh oh!

ywang96 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

huydhn commented Feb 21, 2025 •

edited by github-actions bot

Loading