Clip performance on Mac Silicone

**Describe the Issue**

GPU for clip is not being used with Qwen2-VL-7B-Instruct Model with an mmproj visual module. 

I get:
attempting to apply Multimodal Projector: /Users/xxx/Documents/Llava/LLavaImageTagger/mmproj-Qwen2-VL-7B-Instruct-f16.gguf
Clip will use CPU for this model!
clip_model_load: model name:   Qwen2-VL-7B-Instruct
clip_model_load: description:  image encoder for Qwen2VL
clip_model_load: GGUF version: 3
clip_model_load: alignment:    32
clip_model_load: n_tensors:    521
clip_model_load: n_kv:         20
clip_model_load: ftype:        f16

I use the following args to start the compiled Mac os version koboldcpp-mac-arm64:
"$KOBOLDCPP_BINARY" "$TEXT_MODEL" --mmproj "$IMAGE_PROJECTOR" --flashattention --contextsize 4096 --visionmaxres 9999 --noblas --gpulayers 200 --threads 11 --blasthreads 11  --quiet &

Is it possible to address this? 

**Additional Information:**
Apple Mac Studio M2 Max 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clip performance on Mac Silicone #1392

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Clip performance on Mac Silicone #1392

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions