Skip to content

Conversation

huydhn
Copy link
Contributor

@huydhn huydhn commented Feb 20, 2025

This is the initial version of vLLM v1 benchmark dashboard. The benchmark is run periodically on vLLM main commits. The script running the benchmark is at https://github.com/pytorch/pytorch-integration-testing/tree/master/vllm-benchmarks.

Besides all the custom logic for vllm-project/vllm, I also add a new extra map in the query to store arbitrary information about how the benchmark is setup.

Some UX features are left for subsequent PRs:

  • Provide more information about how the benchmark is setup to be on par with the v0 dashboard
  • Fix the issue where request_rate and tensor_parallel_size are missing when the former is set to Inf leading to an invalid JSON. This fix needs to be done on vLLM side

Preview

https://torchci-git-fork-huydhn-create-vllm-benchma-323632-fbopensource.vercel.app/benchmark/llms?repoName=vllm-project%2Fvllm

@huydhn huydhn requested review from clee2000 and yangw-dev February 20, 2025 05:09
Copy link

vercel bot commented Feb 20, 2025

@huydhn is attempting to deploy a commit to the Meta Open Source Team on Vercel.

A member of the Team first needs to authorize it.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 20, 2025
Copy link

vercel bot commented Feb 20, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Updated (UTC)
torchci ✅ Ready (Inspect) Visit Preview Feb 21, 2025 6:20pm

: `${model} (${dtype} / ${device})`;
if (repoName === "vllm-project/vllm") {
let requestRate = record.extra!["request_rate"];
// TODO (huydhn): Fix the invalid JSON on vLLM side
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here is the fix on vLLM side vllm-project/vllm#13641 under review. Once this lands, we can remove these hacks

@huydhn huydhn merged commit ab1f268 into pytorch:main Feb 21, 2025
5 of 6 checks passed
</a>
. {children}
</Typography>
{repoName !== "vllm-project/vllm" && (
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

option: maybe can place a list of repoName that has this exception

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, the benchmark UX seems to be growing pretty fast recently, so I think I will take a step back to see if we could refactor the code here for better modularization

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants