musa: update compile flags #16265

yeahdongcn · 2025-09-26T01:29:27Z

Make sure to read the contributing guidelines before submitting a PR

This PR introduces a minor performance improvement on MTGPU by applying updated compiler flags. It also addresses build warnings in recently updated files.

yeahdongcn · 2025-09-28T02:42:55Z

There are 3 failed CI tests, but they don’t seem related to this PR.

ggml/src/ggml-cuda/topk-moe.cu

Signed-off-by: Xiaodong Ye <[email protected]>

yeahdongcn · 2025-10-02T11:47:07Z

Just rebased on upstream/master to see if CI passes.

yeahdongcn · 2025-10-02T12:59:00Z

@JohannesGaessler Could you please help merge this? The failed CI cases don’t appear to be related to my changes.

ggml/src/ggml-musa/CMakeLists.txt

This reverts commit 91a2a56.

* origin/master: (124 commits) metal : fix loop bound in ggml_mem_ranges (ggml-org#16412) llama : fix shapes for bert/mpt q/k norm (ggml-org#16409) ggml : fix graph reallocation with multiple chunks (ggml-org#16396) Fix missing messages on sibling navigation (ggml-org#16408) vulkan: Replace uses of maxMemoryAllocationSize and VK_WHOLE_SIZE (ggml-org#16354) vulkan: Fix FA coopmat1 invalid array indexing (ggml-org#16365) ci : change macos-13 to macos-15-intel (ggml-org#16401) Capture model name only after first token (streaming) or completed request (ggml-org#16405) vulkan: in flash attention, bounds check against nem1 (don't rely on GGML_KQ_MASK_PAD) (ggml-org#16316) webui : Fix messages payload sent to chat completions (ggml-org#16402) fix: track viewportHeight via window.innerHeight to avoid unwanted scrolling (ggml-org#16356) test-barrier : do not use more threads than physically available (ggml-org#16389) ggml webgpu: add support for soft_max, optimize rms_norm (ggml-org#16357) model : Apertus model implementation (ggml-org#15852) musa: update compile flags (ggml-org#16265) ci : fix ubuntu-latest-cmake-rpc (disable ccache) (ggml-org#16388) ci: update vulkan ci (ggml-org#16294) ci : fix clean-up of old logs (ggml-org#16381) SYCL: Update to oneAPI 2025.2 (ggml-org#16371) HIP: add IMbackK to codeowner (ggml-org#16375) ...

This reverts commit 91a2a56.

yeahdongcn requested a review from JohannesGaessler September 26, 2025 01:29

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Sep 26, 2025

yeahdongcn force-pushed the xd/musa_compile_flags branch 2 times, most recently from 3db0b7e to 766bbab Compare September 28, 2025 01:25

JohannesGaessler reviewed Sep 29, 2025

View reviewed changes

ggml/src/ggml-cuda/topk-moe.cu Outdated Show resolved Hide resolved

yeahdongcn force-pushed the xd/musa_compile_flags branch from 766bbab to 5c4459a Compare September 29, 2025 13:53

JohannesGaessler approved these changes Sep 29, 2025

View reviewed changes

musa: update compile flags

896455b

Signed-off-by: Xiaodong Ye <[email protected]>

yeahdongcn force-pushed the xd/musa_compile_flags branch from 5c4459a to 896455b Compare October 2, 2025 11:45

ggerganov reviewed Oct 2, 2025

View reviewed changes

ggml/src/ggml-musa/CMakeLists.txt Show resolved Hide resolved

ggerganov merged commit 91a2a56 into ggml-org:master Oct 2, 2025
63 of 68 checks passed

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 2, 2025

Revert "musa: update compile flags (ggml-org#16265)"

ac2ae40

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 3, 2025

Revert "musa: update compile flags (ggml-org#16265)"

4da0bbe

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 4, 2025

Revert "musa: update compile flags (ggml-org#16265)"

75d4572

This reverts commit 91a2a56.

yeahdongcn mentioned this pull request Oct 5, 2025

musa: support new SDK rc4.3.0 MooreThreads/ollama-musa#28

Merged

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 5, 2025

Revert "musa: update compile flags (ggml-org#16265)"

4060dbb

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 7, 2025

Revert "musa: update compile flags (ggml-org#16265)"

a7b2d20

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 7, 2025

Revert "musa: update compile flags (ggml-org#16265)"

1326d48

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 9, 2025

Revert "musa: update compile flags (ggml-org#16265)"

5bc6d9b

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 9, 2025

Revert "musa: update compile flags (ggml-org#16265)"

c48b9a7

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 11, 2025

Revert "musa: update compile flags (ggml-org#16265)"

14cab36

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 11, 2025

Revert "musa: update compile flags (ggml-org#16265)"

fa8250d

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 11, 2025

Revert "musa: update compile flags (ggml-org#16265)"

29e5067

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 12, 2025

Revert "musa: update compile flags (ggml-org#16265)"

023e52e

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 13, 2025

Revert "musa: update compile flags (ggml-org#16265)"

470abba

This reverts commit 91a2a56.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 13, 2025

Revert "musa: update compile flags (ggml-org#16265)"

3927c8c

This reverts commit 91a2a56.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

musa: update compile flags #16265

musa: update compile flags #16265

yeahdongcn commented Sep 26, 2025

Uh oh!

yeahdongcn commented Sep 28, 2025

Uh oh!

Uh oh!

yeahdongcn commented Oct 2, 2025

Uh oh!

yeahdongcn commented Oct 2, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

musa: update compile flags #16265

musa: update compile flags #16265

Conversation

yeahdongcn commented Sep 26, 2025

Uh oh!

yeahdongcn commented Sep 28, 2025

Uh oh!

Uh oh!

yeahdongcn commented Oct 2, 2025

Uh oh!

yeahdongcn commented Oct 2, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants