Add classifier-free guidance #377

philpax · 2023-07-17T10:03:24Z

llama.cpp has recently developed support for CFG:

llama : add support for Classifier-Free Guidance (CFG) sampling to stay on topic better ggml-org/llama.cpp#2083
Implement classifier-free guidance ggml-org/llama.cpp#2135

We should mirror this support. I'm not sure how well it will apply to the other models; I haven't investigated too deeply into this.

Vermeille · 2023-07-21T00:13:26Z

Make sure to remove the smooth_factor and the last log_softmax in order to remain consistent with llama.cpp's and HF's implementation ( ggml-org/llama.cpp#2280 )

KerfuffleV2 · 2023-08-12T12:21:29Z

I'm going to try to look at how to add this to llm-samplers. It will need the CFG logits though, so llm will need to handle that itself. I guess it can be supplied as a sampler resource similar to the RNG and last tokens. I'd like to figure out a more general way to handle resources but in the worst case I can just add another type of resource to that trait.

llama.cpp CFG sampler for reference (doesn't look too complicated): https://github.com/ggerganov/llama.cpp/blob/b19edd54d51cef5e3616c18b1d0d8626895b2cba/llama.cpp#L2709-L2740

On the llm side it looks like you have to maintain a guidance context and run the model for both contexts every token — so using CFG means evaluating the model is twice as slow (also, I think you need two K/V caches). Main relevant sections from llama.cpp's main example: https://github.com/ggerganov/llama.cpp/blob/b19edd54d51cef5e3616c18b1d0d8626895b2cba/examples/main/main.cpp#L208-L215 and https://github.com/ggerganov/llama.cpp/blob/b19edd54d51cef5e3616c18b1d0d8626895b2cba/examples/main/main.cpp#L484-L523

philpax added the issue:enhancement label Jul 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add classifier-free guidance #377

Add classifier-free guidance #377

philpax commented Jul 17, 2023

Vermeille commented Jul 21, 2023

Uh oh!

KerfuffleV2 commented Aug 12, 2023

Uh oh!

Add classifier-free guidance #377

Add classifier-free guidance #377

Comments

philpax commented Jul 17, 2023

Vermeille commented Jul 21, 2023

Uh oh!

KerfuffleV2 commented Aug 12, 2023

Uh oh!