Skip to content

DISABLED test_post_localSGD_optimizer_parity_with_hierarchical_sgd (__main__.TestDistBackendWithSpawn) #75052

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
malfet opened this issue Apr 1, 2022 · 3 comments
Labels
module: rocm AMD GPU support for Pytorch oncall: distributed Add this issue/PR to distributed oncall triage queue skipped Denotes a (flaky) test currently skipped in CI.

Comments

@malfet
Copy link
Contributor

malfet commented Apr 1, 2022

Platforms: rocm
This test was disabled because it is failing on master (recent examples).

cc @pietern @mrshenli @pritamdamania87 @zhaojuanmao @satgera @rohan-varma @gqchen @aazzolini @osalpekar @jiayisuse @SciPioneer @H-Huang @kwen2501 @jeffdaily @sunway513 @jithunnair-amd @ROCmSupport @KyleCZH

@pytorch-bot pytorch-bot bot added the skipped Denotes a (flaky) test currently skipped in CI. label Apr 1, 2022
@pytorch-bot
Copy link

pytorch-bot bot commented Apr 1, 2022

Hello there! From the DISABLED prefix in this issue title, it looks like you are attempting to disable a test in PyTorch CI. The information I have parsed is below:
  • Test name: test_post_localSGD_optimizer_parity_with_hierarchical_sgd (__main__.TestDistBackendWithSpawn)
  • Platforms for which to skip the test: rocm

Within ~15 minutes, test_post_localSGD_optimizer_parity_with_hierarchical_sgd (__main__.TestDistBackendWithSpawn) will be disabled in PyTorch CI for these platforms: rocm. Please verify that your test name looks correct, e.g., test_cuda_assert_async (__main__.TestCuda).

To modify the platforms list, please include a line in the issue body, like below. The default action will disable the test for all platforms if no platforms list is specified.

Platforms: case-insensitive, list, of, platforms

We currently support the following platforms: asan, linux, mac, macos, rocm, win, windows.

@KyleCZH
Copy link
Contributor

KyleCZH commented Apr 27, 2022

@malfet please close this issue as we have a fix (#76136) to deal with an odd number of world_size now

@janeyx99 janeyx99 added oncall: distributed Add this issue/PR to distributed oncall triage queue module: rocm AMD GPU support for Pytorch labels Jun 22, 2022
Copy link

pytorch-bot bot commented Jan 15, 2025

Resolving the issue because the test is not flaky anymore after 5655 reruns without any failures and the issue hasn't been updated in 14 days. Please reopen the issue to re-disable the test if you think this is a false positive

@pytorch-bot pytorch-bot bot closed this as completed Jan 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: rocm AMD GPU support for Pytorch oncall: distributed Add this issue/PR to distributed oncall triage queue skipped Denotes a (flaky) test currently skipped in CI.
Projects
None yet
Development

No branches or pull requests

3 participants