Fix PyTorch 2.8 API compatibility for symm_mem.fused_scaled_matmul_reduce_scatter #24393

louiswang524 · 2025-09-07T07:18:18Z

Summary

Fixes PyTorch 2.8 API compatibility issue in collective fusion patterns by adding the missing orig_scatter_dim
parameter to torch.ops.symm_mem.fused_scaled_matmul_reduce_scatter calls.

Problem

PyTorch 2.8 introduced a breaking change where fused_scaled_matmul_reduce_scatter now requires an
orig_scatter_dim parameter. This was causing compilation test failures in tests/compile/test_async_tp.py.

Solution

Added orig_scatter_dim=0 parameter to both instances of the function call in:

ScaledMMReduceScatterPattern.replacement() (line 168)
CutlassScaledMMReduceScatterPattern.replacement() (line 281)

Testing

The fix maintains backward compatibility and addresses the specific error mentioned in issue #24376.

Fixes #24376

github-actions · 2025-09-07T07:18:27Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

gemini-code-assist

Code Review

This pull request aims to fix an API compatibility issue with PyTorch 2.8 by adding the orig_scatter_dim parameter to fused_scaled_matmul_reduce_scatter calls. While the changes are correct for PyTorch 2.8, they break backward compatibility with older versions of PyTorch, which will cause runtime errors. My review includes critical feedback on how to address this by adding the new parameter conditionally based on the PyTorch version, ensuring the fix works across different environments.

gemini-code-assist · 2025-09-07T07:19:44Z

vllm/compilation/collective_fusion.py

While this change fixes compatibility with PyTorch 2.8, it introduces a backward compatibility issue with older PyTorch versions (e.g., 2.7) that do not have the orig_scatter_dim argument. This will raise a TypeError on older versions.

To ensure backward compatibility, this argument should be added conditionally. I recommend refactoring the call to use a kwargs dictionary and add orig_scatter_dim only if the PyTorch version is 2.8 or newer.

Example:

from vllm.utils import is_torch_equal_or_newer kwargs = { "input": input, "mat2": mat2, "scale_a": scale_a, "scale_b": scale_b, "reduce_op": "avg", "scatter_dim": 0, "out_dtype": self.dtype, "group_name": self.tp.device_group.group_name, } if is_torch_equal_or_newer("2.8"): kwargs["orig_scatter_dim"] = 0 gemm_rs = torch.ops.symm_mem.fused_scaled_matmul_reduce_scatter(**kwargs)

gemini-code-assist · 2025-09-07T07:19:44Z

vllm/compilation/collective_fusion.py

This change, while fixing compatibility for PyTorch 2.8, breaks backward compatibility for older versions like 2.7. The orig_scatter_dim argument does not exist in fused_scaled_matmul_reduce_scatter in older PyTorch versions, which will lead to a TypeError. This argument should be added conditionally based on the PyTorch version to maintain compatibility.

Example:

from vllm.utils import is_torch_equal_or_newer kwargs = { # ... other args } if is_torch_equal_or_newer("2.8"): kwargs["orig_scatter_dim"] = 0 gemm_rs = torch.ops.symm_mem.fused_scaled_matmul_reduce_scatter(**kwargs)

Correct the function signature to match PyTorch 2.8 requirements. The issue was not a missing orig_scatter_dim parameter, but rather: 1. Wrong parameter order (using named parameters instead of positional) 2. Missing required parameters: bias_node, result_scale_node, use_fast_accum PyTorch 2.8 signature: fused_scaled_matmul_reduce_scatter(A, B, A_scale, B_scale, reduce_op, scatter_dim, group_name, bias_node, result_scale_node, out_dtype, use_fast_accum) Changes: - Convert scatter_dim=0 to positional argument 0 - Move group_name to correct position (7th parameter) - Add missing bias_node=None (8th parameter) - Add missing result_scale_node=None (9th parameter) - Move out_dtype to correct position (10th parameter) - Add missing use_fast_accum=False (11th parameter) Fixes vllm-project#24376

louiswang524 · 2025-09-07T08:32:43Z

Closing due to incorrect analysis of the issue

andoorve · 2025-09-30T20:35:30Z

Hey @louiswang524 why was the analysis incorrect?

cc: @jasonlizhengjian

louiswang524 requested review from ProExpertProg, youkaichao and zou3519 as code owners September 7, 2025 07:18

louiswang524 mentioned this pull request Sep 7, 2025

[Bug]: tests/compile failures #24376

Closed

1 task

gemini-code-assist bot reviewed Sep 7, 2025

View reviewed changes

louiswang524 force-pushed the fix-pytorch-28-symm-mem-api branch from 975b032 to 27798fb Compare September 7, 2025 08:23

Merge branch 'main' into fix-pytorch-28-symm-mem-api

7e8c507

louiswang524 closed this Sep 7, 2025

louiswang524 deleted the fix-pytorch-28-symm-mem-api branch September 7, 2025 08:32

andoorve mentioned this pull request Oct 1, 2025

[BugFix][torch.compile] Fix fused_scaled_matmul_reduce_scatter signature for PyTorch 2.8 #26038

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix PyTorch 2.8 API compatibility for symm_mem.fused_scaled_matmul_reduce_scatter #24393

Fix PyTorch 2.8 API compatibility for symm_mem.fused_scaled_matmul_reduce_scatter #24393

Uh oh!

louiswang524 commented Sep 7, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Sep 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 7, 2025

Uh oh!

gemini-code-assist bot Sep 7, 2025

Uh oh!

louiswang524 commented Sep 7, 2025

Uh oh!

andoorve commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix PyTorch 2.8 API compatibility for symm_mem.fused_scaled_matmul_reduce_scatter #24393

Fix PyTorch 2.8 API compatibility for symm_mem.fused_scaled_matmul_reduce_scatter #24393

Uh oh!

Conversation

louiswang524 commented Sep 7, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Testing

Uh oh!

github-actions bot commented Sep 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 7, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Sep 7, 2025

Choose a reason for hiding this comment

Uh oh!

louiswang524 commented Sep 7, 2025

Uh oh!

andoorve commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

louiswang524 commented Sep 7, 2025 •

edited by github-actions bot

Loading