-
Notifications
You must be signed in to change notification settings - Fork 76
Description
Hi,
I installed guidellm from source (commit hash 9108369).
I used vLLM to serve a model (llama 3.1 8B) and ran this guidellm command:
export MODEL_NAME=/hf_models/meta-llama/Meta-Llama-3.1-8B-Instruct/
guidellm benchmark \
--target "http://localhost:8008" \
--model $MODEL_NAME \
--rate-type concurrent \
--rate 4.0 \
--max-seconds 15 \
--data "prompt_tokens=256,output_tokens=16"
The benchmark seems to run for 15 seconds, but then fails:
pydantic_core._pydantic_core.ValidationError: 1 validation error for BenchmarkRunStats
requests_made
Input should be a valid dictionary or instance of StatusBreakdown[int, int, int, int] [type=model_type,
input_value=StatusBreakdown(successfu...incomplete=4, total=144), input_type=StatusBreakdown]
For further information visit https://errors.pydantic.dev/2.9/v/model_type
The python/pydantic version I use:
python --version
Python 3.10.12
pip show pydantic
Name: pydantic
Version: 2.9.2
Metadata
Metadata
Assignees
Labels
No labels