Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1167 commits
Select commit Hold shift + click to select a range
19f6518
cmake: add ggml find package (#11369)
bandoti Jan 26, 2025
6f53d8a
docker: add missing vulkan library to base layer and update to 24.04 …
rare-magma Jan 26, 2025
178a7eb
metal : use residency sets (#11427)
ggerganov Jan 26, 2025
caf773f
docker : fix ARM build and Vulkan build (#11434)
ngxson Jan 26, 2025
acd38ef
metal: Handle null returned from MTLCreateSystemDefaultDevice() (#11441)
Jan 27, 2025
df984e0
llama: refactor llama_decode_impl (#11381)
JohannesGaessler Jan 27, 2025
a5203b4
llama : minor fixes for up llama load model speed (#11448)
lexasub Jan 27, 2025
d6d24cd
AMD: parse the architecture as supplied by gcnArchName (#11244)
Haus1 Jan 27, 2025
a4417dd
Add new hf protocol for ollama (#11449)
ericcurtin Jan 27, 2025
2b8525d
Handle missing model in CLI parameters for llama-run (#11399)
engelmi Jan 28, 2025
6e84b0a
SYCL : SOFTMAX F16 mask support and other fixes (#11261)
qnixsynapse Jan 28, 2025
f643120
docker: add perplexity and bench commands to full image (#11438)
rare-magma Jan 28, 2025
4bf3119
cmake : don't fail on `GGML_CPU=OFF` (#11457)
someone13574 Jan 28, 2025
d7d1ecc
docker: allow installing pip packages system-wide (#11437)
rare-magma Jan 28, 2025
7fee288
Add github protocol pulling and http:// (#11465)
ericcurtin Jan 28, 2025
cae9fb4
HIP: Only call rocblas_initialize on rocblas versions with the multip…
sARY77 Jan 28, 2025
be5ef79
HIP: Supress transformation warning in softmax.cu
IMbackK Jan 28, 2025
d0c0804
ci : fix build CPU arm64 (#11472)
ngxson Jan 28, 2025
cf8cc85
server : Fixed wrong function name in llamacpp server unit test (#11473)
peidaqi Jan 28, 2025
794fe23
cmake: add hints for locating ggml on Windows using Llama find-packag…
Emreerdog Jan 28, 2025
325afb3
llama: fix missing k_cache store for rwkv6qwen2 (#11445)
MollySophia Jan 29, 2025
b636228
embedding : enable --no-warmup option (#11475)
danbev Jan 29, 2025
d2e518e
ggml-cpu : fix ggml_graph_compute_thread did not terminate on abort. …
issixx Jan 17, 2025
1a0e87d
ggml : add option to not print stack on abort (ggml/1081)
WilliamTambellini Jan 23, 2025
8158577
sync : ggml
ggerganov Jan 29, 2025
f0d4b29
Parse https://ollama.com/library/ syntax (#11480)
ericcurtin Jan 29, 2025
2711d02
vulkan: Catch pipeline creation failure and print an error message (#…
jeffbolznv Jan 29, 2025
e51c47b
server : update auto gen files comments [no ci] (#11484)
danbev Jan 29, 2025
66ee4f2
vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360)
remyoudompheng Jan 29, 2025
eb7cf15
server : add /apply-template endpoint for additional use cases of Min…
pnb Jan 29, 2025
e044976
server : update json snippets in README.md [no ci] (#11492)
danbev Jan 30, 2025
7919256
readme : reference examples relative links (#11505)
guspan-tanadi Jan 30, 2025
496e5bf
server : (docs) added response format for /apply-template [no ci] (#1…
isaac-mcfadyen Jan 30, 2025
4314e56
server : use lambda instead of std::bind (#11507)
danbev Jan 30, 2025
ffd0821
vocab : correctly identify LF token for GPT-2 style BPE tokenizer (#1…
mgroeber9110 Jan 30, 2025
3d804de
sync: minja (#11499)
ochafik Jan 30, 2025
c300e68
CUDA/HIP: add warp_size to cuda_device_info
IMbackK Jan 29, 2025
6af1ca4
HIP: Prepare reduction operators for wave 64
IMbackK Jan 29, 2025
27d135c
HIP: require at least HIP 5.5
IMbackK Jan 29, 2025
8b576b6
Tool call support (generic + native for Llama, Functionary, Hermes, M…
ochafik Jan 30, 2025
553f1e4
`ci`: ccache for all github worfklows (#11516)
ochafik Jan 30, 2025
a2df278
server : update help metrics processing/deferred (#11512)
danbev Jan 31, 2025
1bd3047
common: Add missing va_end (#11529)
stevegrubb Jan 31, 2025
4a2b196
server : fix --jinja when there's no tools or schema (typo was forcin…
ochafik Jan 31, 2025
5783575
Fix chatml fallback for unsupported builtin templates (when --jinja n…
ochafik Jan 31, 2025
b1bcd30
fix stop regression (#11543)
ochafik Jan 31, 2025
a83f528
`tool-call`: fix llama 3.x and functionary 3.2, play nice w/ pydantic…
ochafik Jan 31, 2025
aa6fb13
`ci`: use sccache on windows instead of ccache (#11545)
ochafik Jan 31, 2025
5bbc736
ci: simplify cmake build commands (#11548)
ochafik Feb 1, 2025
ecef206
Implement s3:// protocol (#11511)
ericcurtin Feb 1, 2025
cfd74c8
`sync`: minja (https://github.com/google/minja/commit/418a2364b56dc9b…
ochafik Feb 1, 2025
53debe6
ci: use sccache on windows HIP jobs (#11553)
ochafik Feb 1, 2025
0cec062
llama : add support for GLM-Edge and GLM-Edge-V series models (#10573)
piDack Feb 2, 2025
ff22770
sampling : support for llguidance grammars (#10224)
mmoskal Feb 2, 2025
6980448
Fix exotic ci env that lacks ostringstream::str (#11581)
ochafik Feb 2, 2025
bfcce4d
`tool-call`: support Command R7B (+ return tool_plan "thoughts" in AP…
ochafik Feb 2, 2025
84ec8a5
Name colors (#11573)
ericcurtin Feb 2, 2025
864a0b6
CUDA: use mma PTX instructions for FlashAttention (#11583)
JohannesGaessler Feb 2, 2025
90f9b88
nit: more informative crash when grammar sampler fails (#11593)
ochafik Feb 2, 2025
4d0598e
HIP: add GGML_CUDA_CC_IS_* for amd familys as increasing cc archtectu…
IMbackK Feb 2, 2025
396856b
CUDA/HIP: add support for selectable warp size to mmv (#11519)
IMbackK Feb 2, 2025
6eecde3
HIP: fix flash_attn_stream_k_fixup warning (#11604)
JohannesGaessler Feb 2, 2025
d92cb67
server : (webui) Fix Shift+Enter handling (#11609)
mashdragon Feb 3, 2025
21c84b5
CUDA: fix Volta FlashAttention logic (#11615)
JohannesGaessler Feb 3, 2025
8ec0583
sync : ggml
ggerganov Feb 3, 2025
5598f47
server : remove CPPHTTPLIB_NO_EXCEPTIONS define (#11622)
danbev Feb 3, 2025
1d1e6a9
server : (webui) allow typing and submitting during llm response (#11…
woof-dog Feb 3, 2025
b345178
server : (webui) revert hacky solution from #11626 (#11634)
ngxson Feb 3, 2025
cde3833
`tool-call`: allow `--chat-template chatml` w/ `--jinja`, default to …
ochafik Feb 3, 2025
b34aedd
ci : do not stale-close roadmap issues
ggerganov Feb 4, 2025
8f8290a
cmake: Add ability to pass in GGML_BUILD_NUMBER (ggml/1096)
ckastner Feb 3, 2025
7c9e0ca
sync : ggml
ggerganov Feb 4, 2025
387a159
authors : update
ggerganov Feb 4, 2025
534c46b
metal : use residency set for other platforms (#11648)
jhen0409 Feb 4, 2025
f117d84
swift : fix llama-vocab api usage (#11645)
jhen0409 Feb 4, 2025
106045e
readme : add llm_client Rust crate to readme bindings (#11628)
ShelbyJenkins Feb 4, 2025
db288b6
`tool-call`: command r7b fix for normal responses (#11608)
ochafik Feb 4, 2025
1bef571
arg : list RPC devices first when using --list-devices (#11655)
rgerganov Feb 4, 2025
3962fc1
server : add try..catch to places not covered by set_exception_handle…
ngxson Feb 4, 2025
3ec9fd4
HIP: force max threads per block to be 1024 (#11621)
fxzjshm Feb 4, 2025
fd08255
CUDA: non-contiguous (RMS) norm support (#11659)
JohannesGaessler Feb 4, 2025
9f4cc8f
`sync`: minja (#11641)
ochafik Feb 5, 2025
1ec2080
llava: add quantization for the visual projector LLAVA, Qwen2VL (#11644)
samkoesnadi Feb 5, 2025
fa62da9
CUDA: support for mat. mul. with ne03 != ne13 (#11656)
JohannesGaessler Feb 5, 2025
d774ab3
metal : adjust support conditions for norm operators (#11671)
ggerganov Feb 5, 2025
c3db048
readme : add link to Autopen under UIs (#11684)
blackhole89 Feb 6, 2025
902368a
metal : avoid breaking build when metal API predates TARGET_OS_VISION…
charles-dyfis-net Feb 6, 2025
1b598b3
vulkan: use smaller combined allocations to avoid fragmentation (#11551)
jeffbolznv Feb 6, 2025
8a7e3bf
vulkan: initial support for IQ4_XS quantization (#11501)
remyoudompheng Feb 6, 2025
2c6c8df
vulkan: optimize coopmat2 iq2/iq3 callbacks (#11521)
jeffbolznv Feb 6, 2025
8d4d2be
ggml : fix LoongArch compile error with 128-bit SIMD (#11701)
junchao-loongson Feb 6, 2025
c0d4843
build : fix llama.pc (#11658)
angt Feb 6, 2025
9dd7a03
llama : add log about loading model tensors (#11699)
ggerganov Feb 6, 2025
194b2e6
SYCL: Adjust support condition for norm operators (#11674)
qnixsynapse Feb 6, 2025
9ab42dc
docs: update fedora cuda guide for 12.8 release (#11393)
teihome Feb 6, 2025
2fb3c32
server : (webui) migrate project to ReactJS with typescript (#11688)
ngxson Feb 6, 2025
1d20e53
rpc: fix known RCE in rpc-server (ggml/1103)
retr0reg Feb 6, 2025
8a59053
sync : ggml
ggerganov Feb 6, 2025
855cd07
llama : fix old glm4 models (#11670)
tv1wnd Feb 6, 2025
225bbbf
ggml : optimize and build warning fix for LoongArch (#11709)
MQ-mengqing Feb 7, 2025
b7552cf
common : add default embeddings presets (#11677)
danbev Feb 7, 2025
ec3bc82
SYCL: remove XMX info from print devices (#11712)
qnixsynapse Feb 7, 2025
7ee953a
llama : add llama_sampler_init for safe usage of llama_sampler_free (…
cfillion Feb 7, 2025
c026ba3
vulkan: print shared memory size (#11719)
jeffbolznv Feb 7, 2025
333820d
llama : fix progress dots (#11730)
magicse Feb 7, 2025
2d219b3
vocab : ignore invalid UTF-8 input in the BPE tokenizer (#11729)
cfillion Feb 7, 2025
ed926d8
llama : fix defrag logic (#11707)
ggerganov Feb 7, 2025
d2fe216
Make logging more verbose (#11714)
ericcurtin Feb 7, 2025
0cf8671
server : (webui) fix numeric settings being saved as string (#11739)
ngxson Feb 8, 2025
3ab410f
readme : update front-end framework (#11753)
pothitos Feb 8, 2025
d80be89
CUDA: fix min. version for movmatrix (#11751)
JohannesGaessler Feb 8, 2025
4d3465c
ggml: Fix data race in ggml threadpool (#11736)
kkontny Feb 8, 2025
bdcf8b6
cont : fix mmap flag print (#11699)
ggerganov Feb 8, 2025
aaa5505
server : minor log updates (#11760)
ggerganov Feb 8, 2025
e6e6583
server : (webui) increase edit textarea size (#11763)
woof-dog Feb 8, 2025
55ac8c7
server : (webui) revamp Settings dialog, add Pyodide interpreter (#11…
ngxson Feb 8, 2025
98f6b0f
vulkan: account for lookup tables when checking shared memory size (#…
jeffbolznv Feb 9, 2025
19d3c82
There's a better way of clearing lines (#11756)
ericcurtin Feb 9, 2025
b044a0f
vulkan: add environment variable GGML_VK_PREFER_HOST_MEMORY to avoid …
wbruna Feb 10, 2025
c2a67ef
vulkan: Make Vulkan optional at runtime (#11493). (#11494)
daym Feb 10, 2025
9ac3457
Update README.md [no ci] (#11781)
pascal-lc Feb 10, 2025
d7b31a9
sync: minja (https://github.com/google/minja/commit/a72057e5190de2c61…
ochafik Feb 10, 2025
0893e01
server : correct signal handler (#11795)
ngxson Feb 10, 2025
19b392d
llama-mmap: fix missing include (#11796)
wgottwalt Feb 10, 2025
507f917
server : (webui) introduce conversation branching + idb storage (#11792)
ngxson Feb 10, 2025
8173261
docs: utilize the forward slash (/) as the path separator for Unix-li…
MambaWong Feb 10, 2025
7b891bd
fix: typos in documentation files (#11791)
maximevtush Feb 10, 2025
b9ab0a4
CUDA: use arch list for compatibility check (#11775)
JohannesGaessler Feb 10, 2025
a18f481
server : use common_token_to_piece instead of common_detokenize (#11740)
danbev Feb 11, 2025
90e4dba
Fix #11802: Compile bug - RegQueryValueExA changed to RegQueryValueEx…
sheldonrobinson Feb 11, 2025
4078c77
docs: add OpenCL (#11697)
lhez Feb 11, 2025
369be55
llama : fix typo in llama-grammar.h [no ci] (#11816)
danbev Feb 12, 2025
c3d6af7
CUDA: fix CUDART_VERSION checks (#11821)
JohannesGaessler Feb 12, 2025
198b1ec
ggml-cpu: Fix duplicate MATMUL_INT8 (#11817)
ownia Feb 12, 2025
748ee9f
ggml : fix multi-threaded clamp_f32 (#11824)
Burton2000 Feb 12, 2025
fef0cbe
cleanup: fix compile warnings associated with gnu_printf (#11811)
bandoti Feb 12, 2025
e598697
HIP: Switch to std::vector in rocblas version check (#11820)
IMbackK Feb 12, 2025
0fb77f8
sync : ggml
ggerganov Feb 12, 2025
bfd11a2
Fix: Compile failure due to Microsoft STL breaking change (#11836)
MrSMlT Feb 12, 2025
5c4284d
HIP: Remove GCN from list of devices that avoid MMQ (#11831)
IMbackK Feb 12, 2025
31afcbe
server : (webui) Give copy button back to all message bubbles (#11814)
woof-dog Feb 12, 2025
be3bbd6
ggml : x2 speed for WASM by optimizing SIMD (#11453)
ngxson Feb 12, 2025
a394039
ggml-cpu : add chunking support to mul_mat_id (#11666)
slaren Feb 13, 2025
3e69319
llama : update llama_decode_internal ref [no ci] (#11840)
danbev Feb 13, 2025
e437627
llama.cpp: fix warning message (#11839)
okuvshynov Feb 13, 2025
27e8a23
sampling: add Top-nσ sampler (#11223)
VJHack Feb 13, 2025
c7f460a
`server`: fix tool-call of DeepSeek R1 Qwen, return reasoning_content…
ochafik Feb 13, 2025
bd6e55b
musa: bump MUSA SDK version to rc3.1.1 (#11822)
yeahdongcn Feb 13, 2025
c48f630
llama : add --completion-bash option (#11846)
danbev Feb 13, 2025
c1f958c
server : (docs) Update wrong tool calling example (#11809)
RezaRahemtola Feb 13, 2025
8a8c4ce
llamafile: use member variable instead of constant for iq4nlt (#11780)
jmorganca Feb 13, 2025
04045bb
readme : minor
ggerganov Feb 13, 2025
a7b8ce2
llama-bench : fix unexpected global variable initialize sequence issu…
theraininsky Feb 14, 2025
a4f011e
vulkan: linux builds + small subgroup size fixes (#11767)
netrunnereve Feb 14, 2025
38e32eb
ggml: optimize some vec dot functions for LoongArch ASX (#11842)
MQ-mengqing Feb 14, 2025
3d68f03
llama : add completion for --chat-template-file (#11860)
danbev Feb 14, 2025
dbc2ec5
docker : drop to CUDA 12.4 (#11869)
ggerganov Feb 14, 2025
94b87f8
cuda : add ampere to the list of default architectures (#11870)
slaren Feb 14, 2025
300907b
opencl: Fix rope and softmax (#11833)
lhez Feb 14, 2025
89daa25
llguidance build fixes for Windows (#11664)
mmoskal Feb 14, 2025
fc1b0d0
vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528)
remyoudompheng Feb 15, 2025
f355229
server: fix type promotion typo causing crashes w/ --jinja w/o tools …
ochafik Feb 15, 2025
68ff663
repo : update links to new url (#11886)
ggerganov Feb 15, 2025
c2cd24f
readme : add notice about new package registry (#11890)
ggerganov Feb 15, 2025
2288510
metal : optimize dequant q6_K kernel (#11892)
akretz Feb 15, 2025
fc10c38
examples: fix typo in imatrix/README.md (#11884)
708-145 Feb 15, 2025
6dde178
scripts: fix compare-llama-bench commit hash logic (#11891)
JohannesGaessler Feb 15, 2025
c2ea16f
metal : fix the crash caused by the lack of residency set support on …
halechan Feb 16, 2025
bf42a23
vulkan: support multi/vision rope, and noncontiguous rope (#11902)
jeffbolznv Feb 16, 2025
818a340
ci : fix (again) arm64 build fails (#11895)
ngxson Feb 16, 2025
fe163d5
common : Fix a typo in help (#11899)
standby24x7 Feb 16, 2025
0f2bbe6
server : bump httplib to 0.19.0 (#11908)
ngxson Feb 16, 2025
2eea03d
vulkan: implement several ops relevant for ggml_opt (#11769)
remyoudompheng Feb 17, 2025
c4d29ba
server : fix divide-by-zero in metrics reporting (#11915)
aviallon Feb 17, 2025
f7b1116
update release requirements (#11897)
netrunnereve Feb 17, 2025
73e2ed3
CUDA: use async data loading for FlashAttention (#11894)
JohannesGaessler Feb 17, 2025
09aaf4f
docs : Fix duplicated file extension in test command (#11935)
xiaobing318 Feb 18, 2025
5137da7
scripts: corrected encoding when getting chat template (#11866) (#11907)
MoonRide303 Feb 18, 2025
63ac128
server : add TEI API format for /rerank endpoint (#11942)
ngxson Feb 18, 2025
63e489c
tool-call: refactor common chat / tool-call api (+ tests / fixes) (#1…
ochafik Feb 18, 2025
b58934c
server : (webui) Enable communication with parent html (if webui is i…
igardev Feb 18, 2025
9626d93
llama : fix indentation in llama-grammar [no ci] (#11943)
danbev Feb 19, 2025
abd4d0b
speculative : update default params (#11954)
ggerganov Feb 19, 2025
d07c621
common : add llama.vim preset for Qwen2.5 Coder (#11945)
danbev Feb 19, 2025
d04e716
doc: add links to ggml examples [no ci] (#11958)
JohannesGaessler Feb 19, 2025
0d55958
run : add --chat-template-file (#11961)
engelmi Feb 20, 2025
4806498
ggml: aarch64: implement SVE kernels for q3_K_q8_K vector dot (#11917)
Vithulep Feb 20, 2025
c5d91a7
ggml-cpu: Add CPU backend support for KleidiAI library (#11390)
chaxu01 Feb 20, 2025
c392e50
server (webui): Fix Premature Submission During IME Conversion (#11971)
mmngays Feb 20, 2025
ee02ad0
clip : fix visual encoders with no CLS (#11982)
alex-jw-brooks Feb 21, 2025
0b3863f
MUSA: support ARM64 and enable dp4a .etc (#11843)
BodhiHu Feb 21, 2025
ecc8e3a
CUDA: correct the lowest Maxwell supported by CUDA 12 (#11984)
PureJourney Feb 21, 2025
586d5fe
doc: update contributing guidelines [no ci] (#11969)
JohannesGaessler Feb 21, 2025
51f311e
llama : skip loading unused tensors (#12004)
ggerganov Feb 21, 2025
de8b5a3
llama.swiftui : add "Done" dismiss button to help view (#11998)
danbev Feb 22, 2025
d709084
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#12000)
gcp Feb 22, 2025
cf756d6
server : disable Nagle's algorithm (#12020)
ggerganov Feb 22, 2025
335eb04
ci : Build on Github-hosted arm64 runners (#12009)
Rohanjames1997 Feb 22, 2025
5fa07c2
CUDA: optimize FA for GQA + large batches (#12014)
JohannesGaessler Feb 22, 2025
f3e6485
ci : fix arm upload artifacts (#12024)
ggerganov Feb 22, 2025
36c258e
llava: build clip image from pixels (#11999)
tinglou Feb 22, 2025
a28e0d5
CUDA: app option to compile without FlashAttention (#12025)
JohannesGaessler Feb 22, 2025
af7747c
ggml-cpu: Support s390x SIMD Instruction Set (#12019)
taronaeo Feb 22, 2025
f777a73
Some llama-run cleanups (#11973)
ericcurtin Feb 23, 2025
7ad0779
run: allow to customize prompt by env var LLAMA_PROMPT_PREFIX (#12041)
benoitf Feb 23, 2025
8303e8b
SYCL: Fix GGML_SYCL_DEBUG macro (#11995)
qnixsynapse Feb 24, 2025
651adf4
gguf_convert_endian.py: implement byteswapping for q4_k and q6_k (#11…
AlekseiNikiforovIBM Feb 24, 2025
08d5986
[SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035)
NeoZhangJianyu Feb 24, 2025
7a2c913
llava : Add Granite Vision Support (#11794)
alex-jw-brooks Feb 24, 2025
34a846b
opencl: fix for small models (#11950)
lhez Feb 24, 2025
58d07a8
metal : copy kernels for quant to F32/F16 conversions (#12017)
gcp Feb 25, 2025
3e9a286
llama : expose llama_model_n_head_kv in the API (#11997)
vlovich Feb 25, 2025
4d1051a
Add Doc for Converting Granite Vision -> GGUF (#12006)
alex-jw-brooks Feb 25, 2025
0b52745
server: support add_generation_prompt query param (#12062)
ochafik Feb 25, 2025
61d4f39
vulkan: implement more backpropagation operators (#11914)
remyoudompheng Feb 25, 2025
393fca6
ggml-cpu: Fix build with sve (#12059)
MollySophia Feb 25, 2025
c132239
add OP sigmoid (#12056)
foldl Feb 25, 2025
401af80
server: handle echo=false on /v1/completions (#12060)
rhjdvsgsgks Feb 25, 2025
a82c9e7
vulkan: fix assertion when qy_needs_dequant (#12068)
jeffbolznv Feb 25, 2025
d7cfe1f
docs: add docs/function-calling.md to lighten server/README.md's plig…
ochafik Feb 25, 2025
53e4db1
readme : update infra list (#9096)
kerthcet Feb 26, 2025
3567ee3
gguf-py: enable reading non-native endian files (#12081)
AlekseiNikiforovIBM Feb 26, 2025
69050a1
Refactor gguf scripts to improve metadata handling (#11909)
CISC Feb 26, 2025
a800ae4
llava : add struct for FFI bindgen (#12079)
tinglou Feb 26, 2025
b95c8af
cmake: Fix ggml backend dependencies and installation (#11818)
vvuksanovic Feb 27, 2025
581650b
vulkan: improve im2col (#11826)
daniandtheweb Feb 28, 2025
fbeda90
vulkan: matmul dequantization improvements (#12015)
netrunnereve Feb 28, 2025
673cfef
CANN: Fix build error with GCC 13 (#11990)
hipudding Feb 28, 2025
05e6f5a
ggml: aarch64: implement SVE kernels for q2_k_q8_k vector dot (#12064)
Vithulep Feb 28, 2025
9c42b17
CUDA: fix logic for V100 + GGML_CUDA_FORCE_MMQ (#12098)
JohannesGaessler Feb 28, 2025
438a839
vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizatio…
remyoudompheng Feb 28, 2025
84d5f4b
Update granite vision docs for 3.2 model (#12105)
alex-jw-brooks Feb 28, 2025
c43a3e7
llama : add Phi-4-mini support (supersede #12099) (#12108)
ngxson Feb 28, 2025
70680c4
ggml : upgrade init_tensor API to return a ggml_status (#11854)
WilliamTambellini Feb 28, 2025
06c2b15
convert : fix Norway problem when parsing YAML (#12114)
ngxson Feb 28, 2025
abdab54
Manually updates examples/CMakeLists.txt
xsxszab Mar 3, 2025
249b205
Fully Updates llama.cpp version.
xsxszab Mar 3, 2025
12d7d06
Updates android example.
xsxszab Mar 3, 2025
e022286
Updating examples.
xsxszab Mar 3, 2025
67d62a5
Partially updated examples.
xsxszab Mar 3, 2025
5f72744
Nexa models currently not working, disable them for now.
xsxszab Mar 3, 2025
27e84d8
update
xsxszab Mar 3, 2025
4f75568
Fixes omni vlm.
xsxszab Mar 4, 2025
5f306d1
Updates README.
xsxszab Mar 4, 2025
22abab5
Removed unnecessary kompute dependency.
xsxszab Mar 4, 2025
d6f8965
Adds update date to readme.
xsxszab Mar 4, 2025
0ee75e7
Merges master into current branch.
xsxszab Mar 5, 2025
4c7c557
Simplifies code base.
xsxszab Mar 5, 2025
f6c655b
Adds todos.
xsxszab Mar 5, 2025
41aa79a
Merge pull request #47 from NexaAI/yifei/sync_upstream
Davidqian123 Mar 5, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
161 changes: 161 additions & 0 deletions .clang-format
Original file line number Diff line number Diff line change
@@ -0,0 +1,161 @@
---
Language: Cpp
AlignAfterOpenBracket: Align
AlignArrayOfStructures: Left
AlignConsecutiveAssignments: AcrossComments
AlignConsecutiveBitFields: AcrossComments
AlignConsecutiveDeclarations: AcrossComments
AlignConsecutiveMacros: AcrossComments
# AlignConsecutiveShortCaseStatements: AcrossComments
AlignEscapedNewlines: Left # LeftWithLastLine
AlignOperands: Align
AlignTrailingComments:
Kind: Always
OverEmptyLines: 1
AllowAllArgumentsOnNextLine: true
AllowAllParametersOfDeclarationOnNextLine: false
# AllowBreakBeforeNoexceptSpecifier: OnlyWithParen
AllowShortBlocksOnASingleLine: Never
AllowShortCaseLabelsOnASingleLine: false
AllowShortFunctionsOnASingleLine: Inline
AllowShortIfStatementsOnASingleLine: Never
AllowShortLambdasOnASingleLine: Inline
AllowShortLoopsOnASingleLine: false
AlwaysBreakBeforeMultilineStrings: true
BinPackArguments: true
BinPackParameters: true # OnePerLine
BitFieldColonSpacing: Both
BreakBeforeBraces: Custom # Attach
BraceWrapping:
AfterCaseLabel: true
AfterClass: false
AfterControlStatement: false
AfterEnum: false
AfterFunction: false
AfterNamespace: false
AfterObjCDeclaration: false
AfterStruct: false
AfterUnion: false
AfterExternBlock: false
BeforeCatch: false
BeforeElse: false
BeforeLambdaBody: false
BeforeWhile: false
IndentBraces: false
SplitEmptyFunction: false
SplitEmptyRecord: false
SplitEmptyNamespace: false
# BreakAdjacentStringLiterals: true
BreakAfterAttributes: Never
BreakBeforeBinaryOperators: None
BreakBeforeInlineASMColon: OnlyMultiline
BreakBeforeTernaryOperators: false
# BreakBinaryOperations: Never
BreakConstructorInitializers: AfterColon
# BreakFunctionDefinitionParameters: false
BreakInheritanceList: AfterComma
BreakStringLiterals: true
# BreakTemplateDeclarations: Yes
ColumnLimit: 120
CommentPragmas: '^ IWYU pragma:'
CompactNamespaces: false
ConstructorInitializerIndentWidth: 4
ContinuationIndentWidth: 4
Cpp11BracedListStyle: false
DerivePointerAlignment: false
DisableFormat: false
EmptyLineBeforeAccessModifier: Leave
EmptyLineAfterAccessModifier: Never
ExperimentalAutoDetectBinPacking: false
FixNamespaceComments: true
IncludeBlocks: Regroup
IncludeCategories:
- Regex: '^<.*\.h>'
Priority: 1
SortPriority: 0
- Regex: '^<.*'
Priority: 2
SortPriority: 0
- Regex: '.*'
Priority: 3
SortPriority: 0
IncludeIsMainRegex: '([-_](test|unittest))?$'
IncludeIsMainSourceRegex: ''
IndentAccessModifiers: false
IndentCaseBlocks: true
IndentCaseLabels: true
IndentExternBlock: NoIndent
IndentGotoLabels: false
IndentPPDirectives: AfterHash
IndentWidth: 4
IndentWrappedFunctionNames: false
InsertBraces: true # NOTE: may lead to incorrect formatting
InsertNewlineAtEOF: true
JavaScriptQuotes: Leave
JavaScriptWrapImports: true
KeepEmptyLinesAtTheStartOfBlocks: false
LambdaBodyIndentation: Signature
LineEnding: LF
MacroBlockBegin: ''
MacroBlockEnd: ''
MaxEmptyLinesToKeep: 1
NamespaceIndentation: None
ObjCBinPackProtocolList: Auto
ObjCBlockIndentWidth: 4
ObjCSpaceAfterProperty: true
ObjCSpaceBeforeProtocolList: true
PPIndentWidth: -1
PackConstructorInitializers: CurrentLine
PenaltyBreakAssignment: 2
PenaltyBreakBeforeFirstCallParameter: 1
PenaltyBreakComment: 300
PenaltyBreakFirstLessLess: 120
PenaltyBreakString: 1000
PenaltyBreakTemplateDeclaration: 10
PenaltyExcessCharacter: 1000000
PenaltyReturnTypeOnItsOwnLine: 200
PointerAlignment: Middle
QualifierAlignment: Left
#QualifierOrder: ['static', 'inline', 'friend', 'constexpr', 'const', 'volatile', 'type', 'restrict']
RawStringFormats:
- Language: Cpp
Delimiters:
- cc
- CC
- cpp
- Cpp
- CPP
- 'c++'
- 'C++'
CanonicalDelimiter: ''
ReferenceAlignment: Middle
ReflowComments: false # IndentOnly
SeparateDefinitionBlocks: Always
SortIncludes: CaseInsensitive
SortUsingDeclarations: LexicographicNumeric
SpaceAfterCStyleCast: true
SpaceAfterLogicalNot: false
SpaceAfterTemplateKeyword: true
SpaceBeforeAssignmentOperators: true
SpaceBeforeCpp11BracedList: false
SpaceBeforeCtorInitializerColon: true
SpaceBeforeInheritanceColon: true
SpaceBeforeParens: ControlStatements
SpaceBeforeRangeBasedForLoopColon: true
SpaceInEmptyBlock: false
SpaceInEmptyParentheses: false
SpacesBeforeTrailingComments: 2
SpacesInAngles: Never
SpacesInContainerLiterals: true
SpacesInLineCommentPrefix:
Minimum: 1
Maximum: -1
SpacesInParentheses: false
SpacesInSquareBrackets: false
SpaceBeforeSquareBrackets: false
Standard: c++17
TabWidth: 4
UseTab: Never
WhitespaceSensitiveMacros: ['STRINGIZE']
...

26 changes: 26 additions & 0 deletions .clang-tidy
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
---
Checks: >
bugprone-*,
-bugprone-easily-swappable-parameters,
-bugprone-implicit-widening-of-multiplication-result,
-bugprone-misplaced-widening-cast,
-bugprone-narrowing-conversions,
readability-*,
-readability-avoid-unconditional-preprocessor-if,
-readability-function-cognitive-complexity,
-readability-identifier-length,
-readability-implicit-bool-conversion,
-readability-magic-numbers,
-readability-uppercase-literal-suffix,
-readability-simplify-boolean-expr,
clang-analyzer-*,
-clang-analyzer-security.insecureAPI.DeprecatedOrUnsafeBufferHandling,
performance-*,
portability-*,
-portability-simd-intrinsics,
misc-*,
-misc-const-correctness,
-misc-non-private-member-variables-in-classes,
-misc-no-recursion,
-misc-use-anonymous-namespace,
FormatStyle: none
50 changes: 0 additions & 50 deletions .github/ISSUE_TEMPLATE/01-bug-low.yml

This file was deleted.

50 changes: 0 additions & 50 deletions .github/ISSUE_TEMPLATE/02-bug-medium.yml

This file was deleted.

50 changes: 0 additions & 50 deletions .github/ISSUE_TEMPLATE/03-bug-high.yml

This file was deleted.

50 changes: 0 additions & 50 deletions .github/ISSUE_TEMPLATE/04-bug-critical.yml

This file was deleted.

Loading