llama : add support for Classifier-Free Guidance (CFG) sampling to stay on topic better

@ggerganov [retweeted](https://twitter.com/Vermeille_/status/1675664118500454400) the "Stay on topic with Classifier-Free Guidance" paper that came out showing that "Classifier-Free Guidance (CFG)"... "can be used broadly as an inference-time technique in pure language modeling. " ... "brings improvements equivalent to a model with twice the parameter-count" (with no retraining needed). -  https://arxiv.org/abs/2306.17806

I saw that the Transformers library has one of the paper's author [working on an implementation](https://github.com/huggingface/transformers/issues/24536).

I didn't see an issue for it yet here so I figured pointing to it is the least I could do for this awesome library!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : add support for Classifier-Free Guidance (CFG) sampling to stay on topic better #2083

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

llama : add support for Classifier-Free Guidance (CFG) sampling to stay on topic better #2083

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions