New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[NCCL] Support NCCL Send/Recv #44921
Conversation
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
💊 CI failures summary and remediationsAs of commit d98ef7e (more details on the Dr. CI page): Commit d98ef7e was recently pushed. Waiting for builds... This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group. This comment has been revised 53 times. |
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
Pull Request resolved: #44921 This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. ghstack-source-id: 112391583 Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/)
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Todo: add more tests. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
Pull Request resolved: #44921 This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. ghstack-source-id: 112545024 Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/)
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
Pull Request resolved: #44921 This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. ghstack-source-id: 112722354 Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D23709848/)!
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
Pull Request resolved: #44921 This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. ghstack-source-id: 112735516 Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D23709848/)!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, @jiayisuse can you take another look at the PR too?
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
Pull Request resolved: #44921 This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. ghstack-source-id: 112769784 Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D23709848/)!
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
Pull Request resolved: #44921 This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. ghstack-source-id: 112835901 Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D23709848/)!
Codecov Report
@@ Coverage Diff @@
## gh/mingzhe09088/1/base #44921 +/- ##
==========================================================
- Coverage 68.32% 68.16% -0.16%
==========================================================
Files 410 410
Lines 52981 53165 +184
==========================================================
+ Hits 36199 36240 +41
- Misses 16782 16925 +143
Continue to review full report at Codecov.
|
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
Pull Request resolved: #44921 This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. ghstack-source-id: 112935178 Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D23709848/)!
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context. Differential Revision: [D23709848](https://our.internmc.facebook.com/intern/diff/D23709848/) [ghstack-poisoned]
This pull request has been merged in 59083d6. |
#65601) Summary: Pull Request resolved: #65601 I believe this feature was supported one year ago: #44921 #Closes: #65525 ghstack-source-id: 138918961 Test Plan: N/A Reviewed By: pritamdamania87, mingzhe09088 Differential Revision: D31163535 fbshipit-source-id: 9321a0a5137a3e265e2b54bd78730ac28c7acd55
Stack from ghstack:
This diff adds support for Process Group point-to-point operations on NCCL backend based on ncclSend/ncclRecv. See #43995 for more context.
Differential Revision: D23709848