-
Notifications
You must be signed in to change notification settings - Fork 24.2k
[Model Averaging] Add a unit test that launches hierarchical SGD by PostLocalSGDOptimizer #74668
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit 4fe68da (more details on the Dr. CI page): 💚 💚 Looks good so far! There are no failures yet. 💚 💚 This comment was automatically generated by Dr. CI (expand for details).Please report bugs/suggestions to the (internal) Dr. CI Users group. |
The failure on Windows seems to be irrelevant. |
@mrshenli has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
@mrshenli has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
…ostLocalSGDOptimizer (#74668) Summary: As title. The added unit test requires 4 GPUs. Please add `ciflow/all` to enable this test. Proposal: #73382 Pull Request resolved: #74668 Reviewed By: albanD Differential Revision: D35173938 Pulled By: mrshenli fbshipit-source-id: b6d61822bfa12c793050af96a8baa4fc92f6b120
Hey @wayi1. |
@pytorchbot revert this |
Reverting PR 74668 failed due to Can't revert PR that was landed via phabricator as D35173938 |
@zengk95 No need to revert PR #74668. Just adding Curious why these failure were not detected when the PR was submitted. The PR has already labeled "ci/master" and "ci/all". cc: @mrshenli @rohan-varma |
Reverting PR 74668 failed due to Comment @zengk95 No need to revert PR #74668. Just add @skip_if_rocm to these tests should work. |
@wayi1 Oh hmm. I was looking at https://hud.pytorch.org/pytorch/pytorch/pull/74668?sha=3491f4c36f63b174a768354af4f1edb8f66f4d38 which had some failures (my mistake). I don't think it got reverted anyways. As for why they didn't run, I think it did run, since it had trunk workflows. |
As title.
The added unit test requires 4 GPUs. Please add
ciflow/all
to enable this test.Proposal: #73382
Parent proposal: #71325