Skip to content

ggml-cuda : perform cublas mat mul of quantized types as f16#3412

Merged
slaren merged 3 commits intomasterfrom
cublas-q-f16
Sep 30, 2023
Merged

ggml-cuda : perform cublas mat mul of quantized types as f16#3412
slaren merged 3 commits intomasterfrom
cublas-q-f16

Commits