Add support for running on with Apple GPUs

### Is your feature request related to a problem? Please describe.

Currently to test the model one needs to use Nvidia GPU. In principale it should be possible to run it on with Apple GPU via Metal as well. It's implemented in [llama.cpp](https://github.com/ggml-org/llama.cpp/pull/5021)

### Describe the solution you'd like

When model is installed on a Mac it should use this dependency: https://github.com/philipturner/metal-flash-attention

### Describe alternatives you've considered

_No response_

### Additional context

_No response_

### Organisation

AWI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for running on with Apple GPUs #181

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Organisation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add support for running on with Apple GPUs #181

Description

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Organisation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions