Fix v1/chat/completions Gibberish API Responses #41

keldenl · 2023-04-07T08:01:58Z

The chat completion api specifically in fastapi wasn't doing a very consistent job in completing chat. The results seem to consistently generate gibberish (like \nA\n/imagine prompt: User is asking about , or just referencing to the system message in general), so I went ahead and tweaked the prompt (it was also weirdly formatted which probably confused the text generation even more).

Here it is before and after with the default example (running vicuna-13B unfiltered:

Before
Prompt

 

### Instructions:Complete the following chat conversation between the user and the assistant. System messages should be strictly followed as additional instructions.

### Inputs:system None: You are a helpful assistant.
user None: What is the capital of France?

### Response:
assistant:

Results

{
  "id": "chatcmpl-8d9ce5a6-841d-4568-acbe-67ea9640954a",
  "object": "chat.completion",
  "created": 1680854923,
  "model": "../llama.cpp/models/vicuna/13B/ggml-vicuna-unfiltered-13b-4bit.bin",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "\nA\n/imagine prompt: User is asking about the capital of France, Assistant should provide a clear and concise answer, perhaps mentioning some interesting facts about the city or its history. The response should be friendly and helpful, using positive language and encouraging further questions. It should also include some basic information about Paris, such as its location in the north of France, its famous landmarks or cultural attractions, or its population and history.\n\n"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 70,
    "completion_tokens": 98,
    "total_tokens": 168
  }
}

After
Prompt

### Instructions:
Complete the following chat conversation between the user and the assistant. System messages should be strictly followed as additional instructions.

system None: You are a helpful assistant.
user None: What is the capital of France?

### Response:
assistant:

Results

{
  "id": "chatcmpl-35a2850c-e9cd-445b-ad63-046cb98cb107",
  "object": "chat.completion",
  "created": 1680854743,
  "model": "../llama.cpp/models/vicuna/13B/ggml-vicuna-unfiltered-13b-4bit.bin",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": " The capital of France is Paris.\n"
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 61,
    "completion_tokens": 12,
    "total_tokens": 73
  }
}

I also followed the general guidance around default parameters for chatting in https://www.reddit.com/r/LocalLLaMA/comments/11o6o3f/how_to_install_llama_8bit_and_4bit/ to help with results as well.

Also added some .gitignore things that were specific to macOS that helps with contributing.

jmtatsch · 2023-04-07T08:55:37Z

@keldenl any hints where one could find an unfiltered vicuna grazing? asking for a friend ...

keldenl · 2023-04-07T09:09:02Z

@keldenl any hints where one could find an unfiltered vicuna grazing? asking for a friend ...

hug.. some.. faces?

abetlen · 2023-04-08T00:04:41Z

Thanks for the contribution I'll try to address this in a more general way with #17 by allowing you to load multiple models and set defaults based on the specific model

abetlen · 2023-04-08T07:18:23Z

Also, I haven't tested out the vicuna model yet but it looks very promising, I've found using alpaca for chat is less than ideal.

keldenl · 2023-04-08T07:22:49Z

Vicuña has given me some good results. I've tweaked the chat-ui (chatgpt clone with open ai api) and been able to run the fast api against it! the chat is pretty good other than the slower generation due to lack of chat mode :/

abetlen · 2023-04-08T07:31:25Z

@keldenl awesome, yeah now that the mac install bugs
are fixed improving chat speed is definitely next on my list

keldenl · 2023-04-08T07:32:26Z

lmk if i can help in parallel in any way 😀

Niek · 2023-04-12T10:07:47Z

Related to this - currently the completion prompt returns gibberish if the system prompt "You are a helpful assistant." is not set. It would be great if this could be omitted, similar to the actual OpenAI API.

gjmulder · 2023-05-23T14:55:48Z

Update?

earonesty · 2023-11-03T20:34:20Z

i think the issue is you now need to specify the chat_format correctly ... it won't guess anymore.

abetlen · 2023-11-21T09:50:33Z

@earonesty correct, this is all handled correctly now by the chat format and chat handler APIs.

keldenl added 3 commits April 7, 2023 00:58

update default params for chat completion and provide better prompting

a48f522

ignore wasn't ignore some mac files, so added it in

e6ddac6

Merge branch 'improve-chat-completion' into improve-chat-completion-api

b9341e7

keldenl changed the title ~~Improve chat completion api~~ Fix Chat Completion API assistance response content Apr 7, 2023

keldenl changed the title ~~Fix Chat Completion API assistance response content~~ Fix Chat Completion API assistant response content Apr 7, 2023

keldenl changed the title ~~Fix Chat Completion API assistant response content~~ Fix Chat Completion API Responses Apr 7, 2023

keldenl changed the title ~~Fix Chat Completion API Responses~~ Fix Chat Completion Gibberish API Responses Apr 7, 2023

keldenl changed the title ~~Fix Chat Completion Gibberish API Responses~~ Fix /chat/completion Gibberish API Responses Apr 7, 2023

keldenl changed the title ~~Fix /chat/completion Gibberish API Responses~~ Fix v1/chat/completions Gibberish API Responses Apr 7, 2023

gjmulder added the bug Something isn't working label May 23, 2023

abetlen force-pushed the main branch 2 times, most recently from 8c93cf8 to cc0fe43 Compare November 14, 2023 20:24

abetlen closed this Nov 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix v1/chat/completions Gibberish API Responses #41

Fix v1/chat/completions Gibberish API Responses #41

Uh oh!

keldenl commented Apr 7, 2023 •

edited

Loading

Uh oh!

jmtatsch commented Apr 7, 2023

Uh oh!

keldenl commented Apr 7, 2023

Uh oh!

abetlen commented Apr 8, 2023

Uh oh!

abetlen commented Apr 8, 2023

Uh oh!

keldenl commented Apr 8, 2023

Uh oh!

abetlen commented Apr 8, 2023

Uh oh!

keldenl commented Apr 8, 2023

Uh oh!

Niek commented Apr 12, 2023

Uh oh!

gjmulder commented May 23, 2023

Uh oh!

earonesty commented Nov 3, 2023

Uh oh!

abetlen commented Nov 21, 2023

Uh oh!

Uh oh!

Fix v1/chat/completions Gibberish API Responses #41

Fix v1/chat/completions Gibberish API Responses #41

Uh oh!

Conversation

keldenl commented Apr 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jmtatsch commented Apr 7, 2023

Uh oh!

keldenl commented Apr 7, 2023

Uh oh!

abetlen commented Apr 8, 2023

Uh oh!

abetlen commented Apr 8, 2023

Uh oh!

keldenl commented Apr 8, 2023

Uh oh!

abetlen commented Apr 8, 2023

Uh oh!

keldenl commented Apr 8, 2023

Uh oh!

Niek commented Apr 12, 2023

Uh oh!

gjmulder commented May 23, 2023

Uh oh!

earonesty commented Nov 3, 2023

Uh oh!

abetlen commented Nov 21, 2023

Uh oh!

Uh oh!

keldenl commented Apr 7, 2023 •

edited

Loading