Create vLLM v1 benchmark dashboard #6306

huydhn · 2025-02-20T05:09:18Z

This is the initial version of vLLM v1 benchmark dashboard. The benchmark is run periodically on vLLM main commits. The script running the benchmark is at https://github.com/pytorch/pytorch-integration-testing/tree/master/vllm-benchmarks.

Besides all the custom logic for vllm-project/vllm, I also add a new extra map in the query to store arbitrary information about how the benchmark is setup.

Some UX features are left for subsequent PRs:

Provide more information about how the benchmark is setup to be on par with the v0 dashboard
Fix the issue where request_rate and tensor_parallel_size are missing when the former is set to Inf leading to an invalid JSON. This fix needs to be done on vLLM side

Preview

https://torchci-git-fork-huydhn-create-vllm-benchma-323632-fbopensource.vercel.app/benchmark/llms?repoName=vllm-project%2Fvllm

Last 1 day - All benchmarks are now running with the exception of speculative decoding serving benchmark, which is not yet supported in v1.
Last 7 days - There was llama3-8b model because of this issue [Bug]: Benchmark v1 on multi-gpu crashes with ValueError: Pointer argument (at 0) cannot be accessed from Triton vllm-project/vllm#13392, which was fixed last weekend.

vercel · 2025-02-20T05:09:22Z

@huydhn is attempting to deploy a commit to the Meta Open Source Team on Vercel.

A member of the Team first needs to authorize it.

vercel · 2025-02-20T05:09:29Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated (UTC)
torchci	✅ Ready (Inspect)	Visit Preview	Feb 21, 2025 6:20pm

huydhn · 2025-02-21T02:37:42Z

torchci/components/benchmark/llms/ModelGraphPanel.tsx

-                : `${model} (${dtype} / ${device})`;
+              if (repoName === "vllm-project/vllm") {
+                let requestRate = record.extra!["request_rate"];
+                // TODO (huydhn): Fix the invalid JSON on vLLM side


Here is the fix on vLLM side vllm-project/vllm#13641 under review. Once this lands, we can remove these hacks

torchci/lib/benchmark/aoUtils.ts

Co-authored-by: clee2000 <[email protected]>

yangw-dev · 2025-02-21T19:31:42Z

torchci/components/benchmark/CommitPanel.tsx

-        </a>
-        . {children}
-      </Typography>
+      {repoName !== "vllm-project/vllm" && (


option: maybe can place a list of repoName that has this exception

Yeah, the benchmark UX seems to be growing pretty fast recently, so I think I will take a step back to see if we could refactor the code here for better modularization

huydhn added 3 commits February 19, 2025 12:13

Add vLLM benchmark dashboard

a6ee19d

Merge branch 'main' into create-vllm-benchmark-dashboard

1287b6c

Add the initial version of vLLM v1 benchmark dashboard

e7032d2

huydhn requested review from clee2000 and yangw-dev February 20, 2025 05:09

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 20, 2025

vercel bot deployed to Preview February 20, 2025 05:11 View deployment

vLLM v1

746c486

vercel bot deployed to Preview February 20, 2025 05:14 View deployment

huydhn requested a review from seemethere February 20, 2025 05:26

huydhn commented Feb 21, 2025

View reviewed changes

clee2000 approved these changes Feb 21, 2025

View reviewed changes

torchci/lib/benchmark/aoUtils.ts Outdated Show resolved Hide resolved

torchci/lib/benchmark/aoUtils.ts Outdated Show resolved Hide resolved

Apply suggestions from code review

0fb730b

Co-authored-by: clee2000 <[email protected]>

vercel bot deployed to Preview February 21, 2025 18:20 View deployment

huydhn merged commit ab1f268 into pytorch:main Feb 21, 2025
5 of 6 checks passed

yangw-dev reviewed Feb 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Create vLLM v1 benchmark dashboard #6306

Create vLLM v1 benchmark dashboard #6306

Uh oh!

huydhn commented Feb 20, 2025 •

edited

Loading

Uh oh!

vercel bot commented Feb 20, 2025

Uh oh!

vercel bot commented Feb 20, 2025 •

edited

Loading

Uh oh!

huydhn Feb 21, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yangw-dev Feb 21, 2025

Uh oh!

huydhn Feb 21, 2025

Uh oh!

Uh oh!

Create vLLM v1 benchmark dashboard #6306

Create vLLM v1 benchmark dashboard #6306

Uh oh!

Conversation

huydhn commented Feb 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Preview

Uh oh!

vercel bot commented Feb 20, 2025

Uh oh!

vercel bot commented Feb 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

huydhn Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yangw-dev Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

huydhn Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

huydhn commented Feb 20, 2025 •

edited

Loading

vercel bot commented Feb 20, 2025 •

edited

Loading