-
Notifications
You must be signed in to change notification settings - Fork 25.6k
[Elastic] Skip store barrier and store get in host assign #136579
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/136579
Note: Links to docs will display an error until the docs builds have been completed. ❌ 9 New Failures, 1 Unrelated FailureAs of commit eba30de with merge base f0a9254 ( NEW FAILURES - The following jobs have failed:
FLAKY - The following job failed but was likely due to flakiness present on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D63351662 |
This pull request was exported from Phabricator. Differential Revision: D63351662 |
95c24f7
to
413e204
Compare
413e204
to
6d9b37f
Compare
This pull request was exported from Phabricator. Differential Revision: D63351662 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
6d9b37f
to
272345d
Compare
This pull request was exported from Phabricator. Differential Revision: D63351662 |
This pull request was exported from Phabricator. Differential Revision: D63351662 |
272345d
to
49f89b3
Compare
This pull request was exported from Phabricator. Differential Revision: D63351662 |
49f89b3
to
e0065ba
Compare
This pull request was exported from Phabricator. Differential Revision: D63351662 |
e0065ba
to
3d4518a
Compare
This pull request was exported from Phabricator. Differential Revision: D63351662 |
…6579) Summary: Pull Request resolved: pytorch#136579 Sometimes, when the worker and role are the same, users want to skip TCPStore in `_assign_worker_ranks` and barrier in rendezvous Test Plan: unit test Reviewed By: d4l3k Differential Revision: D63351662
3d4518a
to
c0e9eb5
Compare
This pull request was exported from Phabricator. Differential Revision: D63351662 |
c0e9eb5
to
5c09c58
Compare
…6579) Summary: Pull Request resolved: pytorch#136579 Sometimes, when the worker and role are the same, users want to skip TCPStore in `_assign_worker_ranks` and barrier in rendezvous Test Plan: unit test Reviewed By: d4l3k Differential Revision: D63351662
5c09c58
to
eba30de
Compare
This pull request was exported from Phabricator. Differential Revision: D63351662 |
@pytorchbot merge (Initiating merge automatically since Phabricator Diff has merged) |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
As title this is to reland #136579 as it broke some OSS CI Differential Revision: [D63542918](https://our.internmc.facebook.com/intern/diff/D63542918/) [ghstack-poisoned]
reland in #136865 |
… assign" As title this is to reland #136579 as it broke some OSS CI Differential Revision: [D63542918](https://our.internmc.facebook.com/intern/diff/D63542918/) cc XilunWu H-Huang awgu kwen2501 wanchaol fegin wz337 wconstab d4l3k c-p-i-o [ghstack-poisoned]
Pull Request resolved: #136865 As title this is to reland #136579 as it broke some OSS CI ghstack-source-id: 245131066 Differential Revision: [D63542918](https://our.internmc.facebook.com/intern/diff/D63542918/)
…tore get in host assign" As title this is to reland #136579 as it broke some OSS CI Differential Revision: [D63542918](https://our.internmc.facebook.com/intern/diff/D63542918/) cc XilunWu H-Huang awgu kwen2501 wanchaol fegin wz337 wconstab d4l3k c-p-i-o [ghstack-poisoned]
… assign" As title this is to reland #136579 as it broke some OSS CI Differential Revision: [D63542918](https://our.internmc.facebook.com/intern/diff/D63542918/) cc XilunWu H-Huang awgu kwen2501 wanchaol fegin wz337 wconstab d4l3k c-p-i-o [ghstack-poisoned]
…36865) As title this is to reland #136579 as it broke some OSS CI Differential Revision: [D63542918](https://our.internmc.facebook.com/intern/diff/D63542918/) Pull Request resolved: #136865 Approved by: https://github.com/atalman
Summary: Sometimes, when the worker and role are the same, users want to skip TCPStore in
_assign_worker_ranks
and barrier in rendezvousTest Plan: unit test
Differential Revision: D63351662
cc @XilunWu @H-Huang @awgu @kwen2501 @wanchaol @fegin @wz337 @wconstab @d4l3k @c-p-i-o