We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 36b904e commit bc9d3e3Copy full SHA for bc9d3e3
README.md
@@ -557,6 +557,10 @@ python3 convert.py models/7B/
557
# quantize the model to 4-bits (using q4_0 method)
558
./quantize ./models/7B/ggml-model-f16.gguf ./models/7B/ggml-model-q4_0.gguf q4_0
559
560
+# update the gguf filetype to current if older version is unsupported by another application
561
+./quantize ./models/7B/ggml-model-q4_0.gguf ./models/7B/ggml-model-q4_0-v2.gguf COPY
562
+
563
564
# run the inference
565
./main -m ./models/7B/ggml-model-q4_0.gguf -n 128
566
```
0 commit comments