Skip to content

Conversation

danielzgtg
Copy link
Collaborator

As requested by #93 (comment) . This tests both the aliased (e.g. "Q4") and unaliased (e.g. "Q4_0") versions.

It is single-threaded as it should be IO-bound for most users. Outputing to ramdisk (default) and inputting from SSD, it takes more than 2 minutes.

The differences from my version in a comment are: CMake integration, sha256sum replacing md5sum to read from the Git cache, automating the venv/mount/git-clone initial setup, and optimization via caching or skipping gguf-dump.

This contains #93, which should be merged first.

@danielzgtg
Copy link
Collaborator Author

Maybe I can create a GitHub Action for the test.

@ecyht2 I'm not sure if that's worth it. The git clone takes a long time and end users just download from Hugging Face.

@ecyht2
Copy link
Collaborator

ecyht2 commented Jul 25, 2025

Hmm, maybe we can hold off on this and let @mmwillet decide.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants