Continuous batching load test limited at 75 VU with 1x3090 #3160

Kev1ntan · 2024-03-03T00:30:52Z

Hi, i am doing a load test on vllm server. below is the way to reproduce:

instance: 1xRTX 3090

load test tool: k6

server command:
python -m vllm.entrypoints.api_server --model mistralai/Mistral-7B-v0.1 --disable-log-requests --port 9009 --max-num-seqs 500

then run k6 with 100 VU:

export const options = {
  vus: 100, // simulate 100 virtual users
  duration: '60s', // running the test for 60 seconds
};

i tried to adjust the --max-num-seqs and --max-num-batched-tokens but still cant pass 100 VU. is there any best config for the server?

any help is appreciate, thank you.

The text was updated successfully, but these errors were encountered:

github-actions · 2024-10-30T02:01:09Z

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

github-actions · 2024-11-30T02:02:19Z

This issue has been automatically closed due to inactivity. Please feel free to reopen if you feel it is still relevant. Thank you!

github-actions bot added the stale Over 90 days of inactivity label Oct 30, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Nov 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Continuous batching load test limited at 75 VU with 1x3090 #3160

Continuous batching load test limited at 75 VU with 1x3090 #3160

Kev1ntan commented Mar 3, 2024 •

edited

Loading

github-actions bot commented Oct 30, 2024

Uh oh!

github-actions bot commented Nov 30, 2024

Uh oh!

Uh oh!

Continuous batching load test limited at 75 VU with 1x3090 #3160

Continuous batching load test limited at 75 VU with 1x3090 #3160

Comments

Kev1ntan commented Mar 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

github-actions bot commented Oct 30, 2024

Uh oh!

github-actions bot commented Nov 30, 2024

Uh oh!

Kev1ntan commented Mar 3, 2024 •

edited

Loading