LoRA Model Support and Full Fine-tune options #4645

RonanKMcGovern · 2023-12-26T14:51:42Z

Full fine-tuning with Llama.cpp

Are there any efforts to allow for full-fine tuning with Llama cpp (not LoRA)?

If not, is this because there isn't a back-propagation capability? It would be great to at least be able to train the norm and embed modules so that model context can be extended. This also improves performance when chat fine-tuning models.

LoRA fine-tuning

I notice in the examples that only Llama models are supported, but I see in the issues that there are other models being fine-tuned successfully. Is there a list of what works and what doesn't? Happy to make a PR on the example to update that.

RonanKMcGovern · 2023-12-29T10:05:31Z

@the-crypt-keeper would you know either answer here? - you seem knowledgeable on the llama.cpp examples. Thanks

RonanKMcGovern · 2023-12-29T17:59:36Z

Regarding full fine-tuning, on closer reading, I see that this example allows for full fine-tuning (incl. training from scratch): train-text-from-scratch

the-crypt-keeper · 2023-12-30T19:24:48Z

@RonanKMcGovern you found it, train-text-from-scratch can init a fresh model and do a full training run but last time I tried it my results were absolutely awful vs training a gpt2-like model with similar parameters and identical dataset using nanoGPT but unfortunately there is no path I am aware of to convert nanoGPT outputs to GGUF.

RonanKMcGovern · 2023-12-30T20:21:12Z

I tried fine-tuning TinyLlama locally for chat and couldn't even load it - probably was a bad model to pick: #4703 . Easy to do on transformers but would be nice to be able to do it on mac.

github-actions · 2024-03-18T01:35:17Z

This issue is stale because it has been open for 30 days with no activity.

github-actions · 2024-04-02T01:09:51Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

RonanKMcGovern added the enhancement New feature or request label Dec 26, 2023

teleprint-me mentioned this issue Jan 11, 2024

Requesting Support for phi-1_5 by Microsoft #3146

Closed

github-actions bot added the stale label Mar 18, 2024

github-actions bot closed this as completed Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LoRA Model Support and Full Fine-tune options #4645

LoRA Model Support and Full Fine-tune options #4645

RonanKMcGovern commented Dec 26, 2023

RonanKMcGovern commented Dec 29, 2023

Uh oh!

RonanKMcGovern commented Dec 29, 2023

Uh oh!

the-crypt-keeper commented Dec 30, 2023

Uh oh!

RonanKMcGovern commented Dec 30, 2023

Uh oh!

github-actions bot commented Mar 18, 2024

Uh oh!

github-actions bot commented Apr 2, 2024

Uh oh!

LoRA Model Support and Full Fine-tune options #4645

LoRA Model Support and Full Fine-tune options #4645

Comments

RonanKMcGovern commented Dec 26, 2023

Full fine-tuning with Llama.cpp

LoRA fine-tuning

RonanKMcGovern commented Dec 29, 2023

Uh oh!

RonanKMcGovern commented Dec 29, 2023

Uh oh!

the-crypt-keeper commented Dec 30, 2023

Uh oh!

RonanKMcGovern commented Dec 30, 2023

Uh oh!

github-actions bot commented Mar 18, 2024

Uh oh!

github-actions bot commented Apr 2, 2024

Uh oh!