Clean C language version of quantizing llama2 model and running quantized llama2 model
quantization google-colab quantization-algorithms quantization-efficient-network large-language-models
-
Updated
Sep 8, 2023 - C