-
Notifications
You must be signed in to change notification settings - Fork 12.8k
Closed
Labels
Description
Name and Version
Latest vulkan patches cause problems in koboldCPP for radeon 6900xt.
Operating systems
Windows
Which llama.cpp modules do you know to be affected?
llama-quantize
Command line
Problem description & steps to reproduce
I reported a problem with vulkan on the KoboldCPP project page, and was asked to report the problem here (newly released koboldcpp version with latest fixes for vulkan).
As I wrote in the bug report on KoboldCPP, I have two cards GTX 1080ti and Radeon 6900xt. When the creator of Koboldcpp included the latest patches for vulkan, my radeon stopped working properly with vulkan.
I have tested various models (llama 3.1, nemo, mistral small 22/24b) and in none of them vulkan on radeon 6900xt works correctly anymore, either some random characters are generated from the very beginning, or the response loops very quickly and repeats some word.
First Bad Commit
No response