Skip to content

Releases: l3utterfly/llama.cpp

b6029

30 Jul 06:56
a118d80
Compare
Choose a tag to compare
embeddings: fix extraction of CLS pooling results (#14927)

* embeddings: fix extraction of CLS pooling results

* merge RANK pooling into CLS case for inputs

b5891

14 Jul 10:13
0d92267
Compare
Choose a tag to compare
llama : add jinja template for rwkv-world (#14665)

* llama : add jinja template for rwkv-world

Signed-off-by: Molly Sophia <[email protected]>

* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <[email protected]>

---------

Signed-off-by: Molly Sophia <[email protected]>
Co-authored-by: Sigbjørn Skjæret <[email protected]>

b5871

11 Jul 15:06
aaa088d
Compare
Choose a tag to compare
readme : add hot PRs (#14636)

* readme : add hot PRs

* cont

* readme : update title

* readme : hot PRs links

* cont

b5835

06 Jul 11:50
6491d6e
Compare
Choose a tag to compare
vulkan: increase LOAD_VEC_A to 8 (IQ1/IQ2) or 4 (IQ3) (#14485)

Commit taken from remyoudompheng's PR https://github.com/ggml-org/llama.cpp/pull/12260

Co-authored-by: Rémy Oudompheng <[email protected]>

b5581

03 Jun 09:07
71e74a3
Compare
Choose a tag to compare
opencl: add `backend_synchronize` (#13939)

* This is not needed by the normal use where the result is read
  using `tensor_get`, but it allows perf mode of `test-backend-ops`
  to properly measure performance.

b5416

19 May 08:46
33d7aed
Compare
Choose a tag to compare
CANN: Support MOE Model MUL_MAT_ID (#13042)

Signed-off-by: noemotiovon <[email protected]>

b5158

20 Apr 05:37
0013715
Compare
Choose a tag to compare
Disable CI cross-compile builds (#13022)

b5061

07 Apr 10:07
916c83b
Compare
Choose a tag to compare
musa: fix compilation warnings in mp_22/31 (#12780)

Signed-off-by: Xiaodong Ye <[email protected]>

b4959

26 Mar 05:23
53af4db
Compare
Choose a tag to compare
convert: fix Mistral3/Gemma3 model hparams init (#12571)

* Fix Mistral3/Gemma3 model hparams init

* set positional args correctly

* use existing hparams if passed

b4913

18 Mar 11:32
35cae5b
Compare
Choose a tag to compare
SYCL: using graphs is configurable by environment variable and compil…