STILL no way to convert phi-3-small to GGUF #8241

0wwafa · 2024-07-01T18:52:49Z

Why is that?
Phi-3 are the best models around at the moment (for their size).

foldl · 2024-07-02T02:51:06Z

It uses a different model architecture: Triton block sparse attention.

It needs a lot of efforts. Is the work worth doing? I don't think so. Medium is better than Small, and Mini is faster than Small.

0wwafa · 2024-07-02T11:32:17Z

@foldl

It uses a different model architecture: Triton block sparse attention.

It needs a lot of efforts. Is the work worth doing? I don't think so. Medium is better than Small, and Mini is faster than Small.

I tested the small and it's not bad at all...
sincerely, perhaps implementing GLM-4 would be more important than this.
and the guys behind GLM-4 used a modified version of llama.cpp so it should not be difficult to port.

BUT
the PHI-3 family is the best I have seen so far. and won on the leaderboard against models twice it's size... would be interesting to test also the SMALL.

foldl · 2024-07-02T13:23:08Z

Regarding GLM-4, there is #8031.

Or, you can try chatglm.cpp and chatllm.cpp. Tool calling is also supported.

0wwafa · 2024-07-02T16:46:52Z

see that it WAS important? now also the new mini-128k-instruct does not convert!

foldl · 2024-07-03T04:18:34Z

It can after #8262 merged or you change 'longrope' to 'su' in 'config.json'.

0wwafa · 2024-07-03T06:41:01Z

change 'longrope' to 'su' in 'config.json'.

yep. that did it.

bartowski1182 · 2024-07-03T13:33:18Z

@foldl I'm pretty sure that change alone is not enough to make these models work past 4k, we need an actual longrope implementation which is not yet supported. The older 128k method from microsoft was added, but longrope isn't, that change just allows it to fall through and likely uses the wrong method and will result in a broken model

foldl · 2024-07-03T14:24:22Z

@bartowski1182 I am sure Phi3LongRoPEScaledRotaryEmbedding is just renamed from Phi3SuScaledRotaryEmbedding, only a new name and nothing else. I am not sure about the status of Phi3SuScaledRotaryEmbedding in llama.cpp. If supported, then, June 2024 Update will just work too.

HanClinto · 2024-07-03T16:02:16Z

Duplicate of #7922 and #6849. Please refer to #6849, #7705 or #8031 to contribute.

Creating intentionally duplicate issues every few days is splitting the discussion across an unhelpful number of threads and making work more difficult. Please search for previously created issues before opening new ones.

Closing this one as duplicate.

Thank you.

0wwafa · 2024-07-03T18:14:07Z

Duplicate of #7922 and #6849. Please refer to #6849, #7705 or #8031 to contribute.

Creating intentionally duplicate issues every few days is splitting the discussion across an unhelpful number of threads and making work more difficult. Please search for previously created issues before opening new ones.

Closing this one as duplicate.

Thank you.

I did not do it intentionally. sorry.

HanClinto mentioned this issue Jul 3, 2024

Failing to convert the new PHI-3 models. #8259

Closed

HanClinto closed this as not planned Won't fix, can't repro, duplicate, stale Jul 3, 2024

HanClinto added the duplicate This issue or pull request already exists label Jul 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

STILL no way to convert phi-3-small to GGUF #8241

STILL no way to convert phi-3-small to GGUF #8241

0wwafa commented Jul 1, 2024

foldl commented Jul 2, 2024

0wwafa commented Jul 2, 2024 •

edited

Loading

foldl commented Jul 2, 2024

0wwafa commented Jul 2, 2024

foldl commented Jul 3, 2024

0wwafa commented Jul 3, 2024

bartowski1182 commented Jul 3, 2024

foldl commented Jul 3, 2024

HanClinto commented Jul 3, 2024 •

edited

Loading

0wwafa commented Jul 3, 2024

STILL no way to convert phi-3-small to GGUF #8241

STILL no way to convert phi-3-small to GGUF #8241

Comments

0wwafa commented Jul 1, 2024

foldl commented Jul 2, 2024

0wwafa commented Jul 2, 2024 • edited Loading

foldl commented Jul 2, 2024

0wwafa commented Jul 2, 2024

foldl commented Jul 3, 2024

0wwafa commented Jul 3, 2024

bartowski1182 commented Jul 3, 2024

foldl commented Jul 3, 2024

HanClinto commented Jul 3, 2024 • edited Loading

0wwafa commented Jul 3, 2024

0wwafa commented Jul 2, 2024 •

edited

Loading

HanClinto commented Jul 3, 2024 •

edited

Loading