Skip to content

Conversation

WoosukKwon
Copy link
Collaborator

Example usage:

  • Generating a single completion: python benchmark/benchmark_text_completion.py --dataset alpaca_opt_text_completion.pkl --model facebook/opt-13b --request-rate 1.0 --duration 3600 --n1 1.0
  • Generating two completions in parallel: python benchmark/benchmark_text_completion.py --dataset alpaca_opt_text_completion.pkl --model facebook/opt-13b --request-rate 1.0 --duration 3600 --n2 1.0
  • Generating two completions with beam search: python benchmark/benchmark_text_completion.py --dataset alpaca_opt_text_completion.pkl --model facebook/opt-13b --request-rate 1.0 --duration 3600 --n2-beam 1.0

@WoosukKwon WoosukKwon requested a review from zhuohan123 April 6, 2023 09:46
@WoosukKwon
Copy link
Collaborator Author

Merging this PR to main, as we have too many branches.

@WoosukKwon WoosukKwon merged commit 84eee24 into main Apr 12, 2023
@WoosukKwon WoosukKwon deleted the experiment branch April 12, 2023 22:04
slyalin pushed a commit to slyalin/vllm that referenced this pull request Apr 19, 2024
…ce_artifacts

Revert "Produce artifacts for bare metal installation in Dockerfile.openvino"
dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024
This PR logs all errors during validation or generation
for a request like TGIS does. 

Signed-off-by: Joe Runde <[email protected]>
z103cb pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024
…ensions

Dockerfile.ubi: get rid of prebuilt-wheel stage
tianyil1 pushed a commit to tianyil1/vllm that referenced this pull request Jun 5, 2024
…um_wa

WA: Disable cumsum in HPU _prepare_prompt
fxmarty pushed a commit to fxmarty/vllm-public that referenced this pull request Jun 12, 2024
@alixiaodi alixiaodi mentioned this pull request Aug 2, 2024
wuhuikx pushed a commit to wuhuikx/vllm that referenced this pull request Mar 27, 2025
Some PR for plugin support is not merged by vllm yet. This PR add monkey
patch to vllm-ascend to make vllm-ascend work with vllm directly.

This patch code should be removed once the related function is supported
by vllm originally.

Signed-off-by: wangxiyuan <[email protected]>
zyongye added a commit to zyongye/vllm that referenced this pull request Aug 5, 2025
Signed-off-by: simon-mo <[email protected]>
Co-authored-by: simon-mo <[email protected]>
zyongye added a commit to zyongye/vllm that referenced this pull request Aug 6, 2025
Signed-off-by: simon-mo <[email protected]>
Co-authored-by: simon-mo <[email protected]>
heheda12345 pushed a commit to heheda12345/vllm that referenced this pull request Sep 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant