Bug: Inconsistent ggml-4-x86-cuda-v100 ci failures on master #7613

mofosyne · 2024-05-29T09:47:11Z

Note: Only one datapoint of ci failure, but it would be important to keep track of this behavior over the next few commits

What happened?

Noticed that it said it's failing in 20 - test-backend-ops, it be good to identify the cause of this issue and potential ways to fix it. The failure in test #20 in test-backend-ops looked like below which doesn't seem to explain much to me. But hopefully it makes sense to someone else here.

[CPY] NMSE = 0.000003149 > 0.000000100 looks interesting however

Name and Version

between commit 504f0c3 and 0e8d8bf

What operating system are you seeing the problem on?

Other? (Please let us know in description)

Relevant log output

�[1;32mOK�[0m
  CPY(type_src=f32,type_dst=q4_1,ne=[256,4,4,4]): ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
[CPY] NMSE = 0.000003149 > 0.000000100 ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
�[1;31mFAIL�[0m
  CPY(type_src=f32,type_dst=q5_0,ne=[256,4,4,4]): ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
ggml_backend_cuda_graph_compute: disabling CUDA graphs due to GPU architecture
�[1;32mOK�[0m

The text was updated successfully, but these errors were encountered:

ggerganov · 2024-05-29T09:50:06Z

See this comment for explanation: #7425 (comment)

mofosyne · 2024-05-29T09:54:34Z

Is there a way to reasonably suppress, change the rounding approach, increase threshold or adapt to this error (e.g. only trigger an error if two or more trips in a test?)

github-actions · 2024-07-13T01:06:48Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

mofosyne added bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches) labels May 29, 2024

github-actions bot added the stale label Jun 29, 2024

github-actions bot closed this as completed Jul 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: Inconsistent ggml-4-x86-cuda-v100 ci failures on master #7613

Bug: Inconsistent ggml-4-x86-cuda-v100 ci failures on master #7613

mofosyne commented May 29, 2024 •

edited

Loading

ggerganov commented May 29, 2024

mofosyne commented May 29, 2024 •

edited

Loading

github-actions bot commented Jul 13, 2024

Bug: Inconsistent ggml-4-x86-cuda-v100 ci failures on master #7613

Bug: Inconsistent ggml-4-x86-cuda-v100 ci failures on master #7613

Comments

mofosyne commented May 29, 2024 • edited Loading

What happened?

Name and Version

What operating system are you seeing the problem on?

Relevant log output

ggerganov commented May 29, 2024

mofosyne commented May 29, 2024 • edited Loading

github-actions bot commented Jul 13, 2024

mofosyne commented May 29, 2024 •

edited

Loading

mofosyne commented May 29, 2024 •

edited

Loading