The initial token is always empty.

Hello,

I noticed something when trying the chat with Bob is that I always get the first token as empty.

     1 -> ''
  4103 -> ' Trans'
   924 -> 'cript'
   310 -> ' of'
   263 -> ' a'
  7928 -> ' dialog'

So the result is this: 

![image](https://user-images.githubusercontent.com/110173477/226732298-38c21252-059e-4acd-9dfb-70f745347efe.png)

There's this little space at the begining of the text. Maybe this alone can significantly impact the quality of the output, that's why I decided to post this issue.

I'm on a windows 10 using WSL to emulate the linux environnement (the main.exe is not as good as the linux main atm).

I'm using a file that is the result of all those manipulations:

1) I have first a llama-7b-4bit.pt file
2) I converted it with the gptq-to-ggml converter (convert-gptq-to-ggml.py) 
3) I converted it again into the new version of ggml with this script https://github.com/ggerganov/llama.cpp/issues/324#issuecomment-1476227818

Here's the .sh command (7B_CHAT_Bob.sh): 

```
#!/bin/bash
dos2unix 7B_CHAT_Bob.sh

./main -m ./models/llama7b-4bit-GPTQ.bin -t 14 -n 256 --repeat_penalty 1.0 --color -i -r "User:" -f prompts/chat-with-bob.txt

```

Everything is updated on this repository as I apply a git pull everytime I launch the powershell.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The initial token is always empty. #367

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

The initial token is always empty. #367

Description

Activity

gjmulder commented on Mar 21, 2023

BadisG commented on Mar 21, 2023

PriNova commented on Mar 21, 2023

BadisG commented on Mar 21, 2023

mattsta commented on Mar 22, 2023

Green-Sky commented on Mar 22, 2023

github-actions commented on Apr 10, 2024

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions