Fix missing rope_freqs with convert_hf_to_gguf #402

saood06 · 2025-05-09T12:23:08Z

This ports ggml-org/llama.cpp#9396 and ggml-org/llama.cpp#9117 (I don't think I needed this as the changes in here are basically reverted in 9396).

The issue was that the convert script used generate_extra_tensors for those tensors but there was no code that called that function.

I tested with Llama-3_1-Nemotron-51B-Instruct and it now generates the rope_freqs.weight which was missing previously.

Look at #377 for more information.

This should also fix vocab-only conversion for Phi-3.

MiniCPM3's tokenizer is treated as a SentencePiece tokenizer to avoid having to run its custom Python code which mixes tokenization in the same file as tool calls. gguf-py : add long and short RoPE factors to tensor mappings Empty, but the key names are used to populate the mappings.

ngxson and others added 3 commits May 9, 2025 06:17

lora : fix llama conversion script with ROPE_FREQS

2c6e01d

convert : refactor rope_freqs generation

43b8bcf

This should also fix vocab-only conversion for Phi-3.

saood06 requested a review from ikawrakow May 9, 2025 12:23

ikawrakow approved these changes May 9, 2025

View reviewed changes

saood06 merged commit 967a2e1 into main May 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix missing rope_freqs with convert_hf_to_gguf #402

Fix missing rope_freqs with convert_hf_to_gguf #402

Uh oh!

saood06 commented May 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Fix missing rope_freqs with convert_hf_to_gguf #402

Fix missing rope_freqs with convert_hf_to_gguf #402

Uh oh!

Conversation

saood06 commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

saood06 commented May 9, 2025 •

edited

Loading