-
-
Notifications
You must be signed in to change notification settings - Fork 10.6k
Closed as not planned
Labels
bugSomething isn't workingSomething isn't workingstaleOver 90 days of inactivityOver 90 days of inactivity
Description
Your current environment
In this example https://github.com/vllm-project/vllm/blob/main/examples/multilora_inference.py if you gave two different loras instead of same loras with different name, the output is coming from the first lora initialized ```
### 🐛 Describe the bug
give two any two different loras and do inference, the model is pocking only the first lora in order , the second one inference also coming from the first lora(
"[user] Write a SQL query to answer the question based on the table schema.\n\n context: CREATE TABLE table_name_74 (icao VARCHAR, airport VARCHAR)\n\n question: Name the ICAO for lilongwe international airport [/user] [assistant]",
SamplingParams(temperature=0.0,
logprobs=1,
prompt_logprobs=1,
max_tokens=128,
stop_token_ids=[32003]),
LoRARequest("sql-lora2", 2, lora_path)),
(
"my nam is",
SamplingParams(n=3,
best_of=3,
use_beam_search=True,
temperature=0,
max_tokens=128,
stop_token_ids=[32003]),
LoRARequest("sql-lora", 1, 'timdettmers/qlora-flan-7b')),
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workingstaleOver 90 days of inactivityOver 90 days of inactivity