Skip to content

Clip performance on Mac Silicone #1392

@foggyghost0

Description

@foggyghost0

Describe the Issue

GPU for clip is not being used with Qwen2-VL-7B-Instruct Model with an mmproj visual module.

I get:
attempting to apply Multimodal Projector: /Users/xxx/Documents/Llava/LLavaImageTagger/mmproj-Qwen2-VL-7B-Instruct-f16.gguf
Clip will use CPU for this model!
clip_model_load: model name: Qwen2-VL-7B-Instruct
clip_model_load: description: image encoder for Qwen2VL
clip_model_load: GGUF version: 3
clip_model_load: alignment: 32
clip_model_load: n_tensors: 521
clip_model_load: n_kv: 20
clip_model_load: ftype: f16

I use the following args to start the compiled Mac os version koboldcpp-mac-arm64:
"$KOBOLDCPP_BINARY" "$TEXT_MODEL" --mmproj "$IMAGE_PROJECTOR" --flashattention --contextsize 4096 --visionmaxres 9999 --noblas --gpulayers 200 --threads 11 --blasthreads 11 --quiet &

Is it possible to address this?

Additional Information:
Apple Mac Studio M2 Max

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions