Fix baseurl link in CentOS for ROCm5.2 #1031

WBobby · 2022-06-29T22:52:25Z

Fixes #ISSUE_NUMBER

Co-authored-by: Wang, Yanyao <[email protected]>

… now that it's supported in core (#1031) ## Summary In pytorch#149876 I found there was a problem with per op SAC, per layer SAC, and no AC because all these settings saved reduce_scatter_tensor for backward but this a problem: it broke the async TP pattern matching which expects reduce scatter node to only have 1 user (wait_tensor), not 2 (wait_tensor and output_node). In pytorch#149946 I addressed this by: 1) Adding new graph patterns to match on which allow reduce_scatter to have 2 users. 2) Updating the subgraph replacement logic to save the "fused matmul reduce scatter" node for backward instead of the reduce scatter node, if it detects the graph is saving reduce_scatter for backward. This allows the original matmul reduce scatter graph to be replaced and erased correctly. Once pytorch#149946 is landed, we can add back reduce_scatter_tensor to the op save list for SAC in torchtitan, and it won't break SAC and no AC anymore 👍

Fix baseurl link in CentOS for ROCm5.2

593a7a0

WBobby requested review from jeffdaily and jithunnair-amd as code owners June 29, 2022 22:52

jithunnair-amd merged commit 572c17c into ROCm:rocm5.2_internal_testing Jun 29, 2022

jithunnair-amd pushed a commit that referenced this pull request Jul 12, 2022

Fix baseurl link in CentOS for ROCm5.2 (#1031)

f2c51f9

Co-authored-by: Wang, Yanyao <[email protected]>

WBobby mentioned this pull request Aug 17, 2022

Add ROCm5.2.3/AMDGPU support for PyTorch WBobby/pytorch#2

Closed

jithunnair-amd pushed a commit that referenced this pull request Aug 30, 2022

Fix baseurl link in CentOS for ROCm5.2 (#1031)

c6f1cc5

Co-authored-by: Wang, Yanyao <[email protected]>

pruthvistony pushed a commit that referenced this pull request Nov 10, 2022

Fix baseurl link in CentOS for ROCm5.2 (#1031)

f6fccc8

Co-authored-by: Wang, Yanyao <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix baseurl link in CentOS for ROCm5.2 #1031

Fix baseurl link in CentOS for ROCm5.2 #1031

Uh oh!

WBobby commented Jun 29, 2022

Uh oh!

Uh oh!

Fix baseurl link in CentOS for ROCm5.2 #1031

Fix baseurl link in CentOS for ROCm5.2 #1031

Uh oh!

Conversation

WBobby commented Jun 29, 2022

Uh oh!

Uh oh!