Inference sometimes gens control chars such as ASCII 28 (\xc1) which can pollute context / cause havoc

While troubleshooting https://github.com/ggerganov/llama.cpp/issues/2678, I caught llama2 (or something?) generating a bunch of non-printable characters (ASCII 28 specifically) in my bot conversations. It is possible that this is the root cause of https://github.com/ggerganov/llama.cpp/issues/2678

Since I was feeding LLM responses into a transcript in the context, it seems then these pollute the conversation (server.cpp does not directly write responses into the context like how main.cpp does). This would cause my next round of token processing to go crazy and spin the GPU but not return anything (presumably trying to do something w the control characters)

I was using upstage-llama-2-70b-instruct-v2.ggmlv3.q5_K_M.bin on Metal M1 64GB. I am not using MPS

This was observed with build bf83bff6742c0f1795b4c18695a13a34ac7adf62

I have not been able to reliably repro this yet with dadbed99e65252d79f81101a392d0d6497b86caa so take it with a grain of salt, but if anyone else is suffering from https://github.com/ggerganov/llama.cpp/issues/2678 check to see if the data returned contains non-printables. I'm not sure if these are  coming from code or data but how those got the 70b data in the first place is probably another question


Example here
https://github.com/ggerganov/llama.cpp/issues/2678#issuecomment-1690951702




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inference sometimes gens control chars such as ASCII 28 (\xc1) which can pollute context / cause havoc #2758

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Inference sometimes gens control chars such as ASCII 28 (\xc1) which can pollute context / cause havoc #2758

Description

Activity

lshzh-ww commented on Aug 24, 2023

ProjectAtlantis-dev commented on Aug 24, 2023

ProjectAtlantis-dev commented on Aug 29, 2023

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions