[CI] Fix broken compile tests due to unsupported SiluMul+Nvfp4Quant fusion #23973

sarckk · 2025-08-30T00:04:35Z

Purpose

Fix failing tests compile/test_pass_manager.py::test_pass_manager_uuid and compile/test_full_graph.py::test_custom_compile_config.

The error is:

[2025-08-29T20:36:33Z] (EngineCore_0 pid=14176)   File "/usr/local/lib/python3.12/dist-packages/vllm/compilation/activation_quant_fusion.py", line 174, in __init__
--
  | [2025-08-29T20:36:33Z] (EngineCore_0 pid=14176)     pattern_silu_mul_nvfp4 = SiluMulNvfp4QuantPattern()
  | [2025-08-29T20:36:33Z] (EngineCore_0 pid=14176)                              ^^^^^^^^^^^^^^^^^^^^^^^^^^
  | [2025-08-29T20:36:33Z] (EngineCore_0 pid=14176)   File "/usr/local/lib/python3.12/dist-packages/vllm/compilation/activation_quant_fusion.py", line 116, in __init__
  | [2025-08-29T20:36:33Z] (EngineCore_0 pid=14176)     super().__init__(kNvfp4Quant)
  | [2025-08-29T20:36:33Z] (EngineCore_0 pid=14176)   File "/usr/local/lib/python3.12/dist-packages/vllm/compilation/activation_quant_fusion.py", line 55, in __init__
  | [2025-08-29T20:36:33Z] (EngineCore_0 pid=14176)     assert self.quant_key in FUSED_OPS, \
  | [2025-08-29T20:36:33Z] (EngineCore_0 pid=14176)            ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  | [2025-08-29T20:36:33Z] (EngineCore_0 pid=14176) torch._dynamo.exc.BackendCompilerFailed: backend='<vllm.compilation.backends.VllmBackend object at 0x7f27e72f50a0>' raised:
  | [2025-08-29T20:36:33Z] (EngineCore_0 pid=14176) AssertionError: unsupported fusion scheme QuantKey(u8,scale(f8e4m3fn,dynamic,GroupShape(row=1, col=16)),scale2(f32,static,per_tensor),symmetric)

This can happen when hasattr(torch.ops._C, "silu_and_mul_nvfp4_quant") is false. Only enable SiluMul+Nvfp4Quant fusion added in #23671 if supported.

Test Plan

pytest tests/compile/test_pass_manager.py::test_pass_manager_uuid
pytest tests/compile/test_full_graph.py::test_custom_compile_config

Test Result

All tests passed locally, check passes in CI as well.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Yong Hoon Shin <[email protected]>

gemini-code-assist

Code Review

This pull request correctly fixes a crash that occurs when the SiluMul+Nvfp4Quant fusion is attempted on a platform where it's not supported. The change adds a check to ensure the fusion is only registered if the required silu_and_mul_nvfp4_quant op is available. The fix is sound. I have added one comment regarding code duplication which could pose a maintenance risk.

vllm/compilation/activation_quant_fusion.py

Signed-off-by: Yong Hoon Shin <[email protected]>

…usion (vllm-project#23973) Signed-off-by: Yong Hoon Shin <[email protected]> Co-authored-by: Roger Wang <[email protected]>

Fix broken pass manager test

25dd34d

Signed-off-by: Yong Hoon Shin <[email protected]>

sarckk requested review from ProExpertProg, youkaichao and zou3519 as code owners August 30, 2025 00:04

gemini-code-assist bot reviewed Aug 30, 2025

View reviewed changes

vllm/compilation/activation_quant_fusion.py Outdated Show resolved Hide resolved

Unify silu_and_mul_nvfp4_quant check

1a57333

Signed-off-by: Yong Hoon Shin <[email protected]>

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 30, 2025

Merge branch 'main' into fix-fusion-pass-test

6a1f060

ywang96 approved these changes Aug 30, 2025

View reviewed changes

ywang96 enabled auto-merge (squash) August 30, 2025 04:09

vllm-bot merged commit 9748c51 into vllm-project:main Aug 30, 2025
37 of 39 checks passed

DarkLight1337 mentioned this pull request Aug 30, 2025

[CI Failure] Skip failing nvfp4 silu test #23959

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[CI] Fix broken compile tests due to unsupported SiluMul+Nvfp4Quant fusion #23973

[CI] Fix broken compile tests due to unsupported SiluMul+Nvfp4Quant fusion #23973

Uh oh!

sarckk commented Aug 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[CI] Fix broken compile tests due to unsupported SiluMul+Nvfp4Quant fusion #23973

[CI] Fix broken compile tests due to unsupported SiluMul+Nvfp4Quant fusion #23973

Uh oh!

Conversation

sarckk commented Aug 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sarckk commented Aug 30, 2025 •

edited by github-actions bot

Loading