Optimize context computing (GEMM) for metal backend #106

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

li-plus merged 1 commit into main from opt-metal

Aug 22, 2023

Owner

li-plus commented Aug 22, 2023 •

edited

Loading

Port from llama.cpp ggml-org/llama.cpp#2615. Great work! Now we can use Apple GPU for context acceleration. Will resolve #98.


          Optimize context computing (GEMM) for metal backend

2fdc86a

li-plus merged commit 9de6e6f into main

li-plus deleted the opt-metal branch

August 22, 2023 16:30

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet