-
Notifications
You must be signed in to change notification settings - Fork 12.7k
Closed
Labels
good first issueGood for newcomersGood for newcomersperformanceSpeed related topicsSpeed related topics
Description
With the recent support for running convolutions on the GPU (#4060) we should be able to offload CLIP to run fully on the GPU.
- Implement
ggml_acc
CUDA / Metal kernels - Avoid
ggml_repeat
where possible using broadcast - Should use the new
ggml-backend
API (see https://github.com/ggerganov/ggml/blob/master/examples/gpt-2/main-backend.cpp)
Piezoid, y10ab1, jjiteshh, NXTler and lin72hcmp-nct, monatis, y10ab1, abc2cba, lin72h and 1 more
Metadata
Metadata
Assignees
Labels
good first issueGood for newcomersGood for newcomersperformanceSpeed related topicsSpeed related topics