cuda : fix rope pos data #7452

ggerganov · 2024-05-22T07:02:15Z

Accidentally broke CUDA rope in #7225 for all models that do not use NeoX-style RoPE

ggml-ci

* cuda : fix rope pos data ggml-ci * ggml : drop mode & 1 == 1 support for ggml_rope ggml-ci * ggml : support freq_factors for f16 rope (CPU) ggml-ci * tests : add rope tests using frequency factors ggml-ci

cuda : fix rope pos data

f9d2b25

ggml-ci

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels May 22, 2024

ggml : drop mode & 1 == 1 support for ggml_rope

ce89cd5

ggml-ci

ggerganov mentioned this pull request May 22, 2024

regression: output is nonsense with latest commit and CUDA support enabled #7451

Closed

JohannesGaessler approved these changes May 22, 2024

View reviewed changes

ggerganov added 2 commits May 22, 2024 10:39

ggml : support freq_factors for f16 rope (CPU)

092549b

ggml-ci

tests : add rope tests using frequency factors

66ba5d5

ggml-ci

ggerganov merged commit 3e5faa8 into master May 22, 2024
24 of 79 checks passed

ggerganov deleted the gg/cuda-fix-rope-pos branch May 22, 2024 08:01

github-actions bot added the testing Everything test related label May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda : fix rope pos data #7452

cuda : fix rope pos data #7452

ggerganov commented May 22, 2024

cuda : fix rope pos data #7452

cuda : fix rope pos data #7452

Conversation

ggerganov commented May 22, 2024