You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
So I compared the two versions, and the only changes I can see are
they renamed the "su" scaling method to "longrope"
they removed the yarn implementation from the modeling_phi3.py
If you wouldn't mind, could you try just changing the name to "su" in the config? If that works I can just add an alias and it shouldn't need any other changes.
Seems to be caused by:
Useful references:
ggml-org/llama.cpp#8262
ggml-org/llama.cpp#6849 (comment)
Conversion log:
The text was updated successfully, but these errors were encountered: