Skip to content

The quantize.py script is not needed anymore. Just fetch the latest code and do this as a quantization step: #641

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
huangzhimin4read opened this issue Mar 31, 2023 · 4 comments

Comments

@huangzhimin4read
Copy link

          The `quantize.py` script is not needed anymore. Just fetch the latest code and do this as a quantization step:
./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin 2

Originally posted by @prusnak in #621 (comment)

I tried this method in Colab, but it still reports an error:
/bin/bash: ./quantize: No such file or directory

@CoderRC
Copy link

CoderRC commented Mar 31, 2023

You have to compile the library first. Try: #103 (comment)

@prusnak
Copy link
Collaborator

prusnak commented Mar 31, 2023

Follow the instructions in the README file which tells you to compile the things first using make

@prusnak prusnak closed this as not planned Won't fix, can't repro, duplicate, stale Mar 31, 2023
@montvid
Copy link

montvid commented Jul 12, 2023

I used cmake to build and have to write the path to quantize:
./build/bin/quantize models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_1.bin q4_1

@xxxwp123
Copy link

          The `quantize.py` script is not needed anymore. Just fetch the latest code and do this as a quantization step:
./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin 2

Originally posted by @prusnak in #621 (comment)

I tried this method in Colab, but it still reports an error: /bin/bash: ./quantize: No such file or directory

Have you solved this problem?I have the same problem.thanks

AAbushady pushed a commit to AAbushady/llama.cpp that referenced this issue Jan 28, 2024
…puts. Passthrough now passes the proper variables. (ggml-org#641)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants