[5/N] [Dispatchable Collectives] Update send with CPU / CUDA implementations #83859

H-Huang · 2022-08-22T17:50:42Z

Stack from ghstack:

[9/N] [Dispatchable Collectives] Update reduce_scatter with CPU / CUDA implementations #86166 [9/N] [Dispatchable Collectives] Update reduce_scatter with CPU / CUDA implementations
[8/N] [Dispatchable Collectives] Update allgather with CPU / CUDA implementations #84423 [8/N] [Dispatchable Collectives] Update allgather with CPU / CUDA implementations
[7/N] [Dispatchable Collectives] Update reduce with CPU / CUDA implementations #83916 [7/N] [Dispatchable Collectives] Update reduce with CPU / CUDA implementations
[6/N] [Dispatchable Collectives] Update recv with CPU / CUDA implementations #83876 [6/N] [Dispatchable Collectives] Update recv with CPU / CUDA implementations
[5/N] [Dispatchable Collectives] Update send with CPU / CUDA implementations #83859 [5/N] [Dispatchable Collectives] Update send with CPU / CUDA implementations

Changes

Updates for the send collective

Context

#86225

Differential Revision: D40044550

…tations [ghstack-poisoned]

facebook-github-bot · 2022-08-22T17:50:49Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/83859
✖️ Python docs build was skipped
✖️ C++ docs build was skipped
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (0 Pending)

As of commit cdd1470 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

…tations ghstack-source-id: 5d7d8b0090cb412069937d2b056b32647f8cce9b Pull Request resolved: #83859

…DA implementations" [ghstack-poisoned]

…tations ghstack-source-id: f205305654549ae23eab9f8ab04405d7749dcc89 Pull Request resolved: #83859

…DA implementations" [ghstack-poisoned]

pytorch-bot · 2022-09-13T17:24:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/83859

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures, 12 Pending

As of commit 82deaae:

The following jobs have failed:

linux-bionic-py3_7-clang8-xla / test (xla, 1, 1, linux.2xlarge)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…DA implementations" [ghstack-poisoned]

kwen2501

LGTM! Thanks!

nit: missing _ at the end of function name ("send_cpu", "send_cuda").

H-Huang · 2022-09-30T21:29:33Z

nit: missing _ at the end of function name ("send_cpu", "send_cuda").

The convention is that _ after operation means that the tensor is modified in-place. For barrier and send we do not modify the tensor, therefore those ops dont have "_" after

kwen2501 · 2022-09-30T22:42:50Z

I see, thanks for the education!

…DA implementations" [ghstack-poisoned]

H-Huang · 2022-10-03T23:47:50Z

@H-Huang has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-10-04T00:35:42Z

/easycla

As part of the transition to the PyTorch Foundation, this project now requires contributions be covered under the new CLA. See #85559 for additional details.

This comment will trigger a new check of this PR. If you are already covered, you will simply see a new "EasyCLA" check that passes. If you are not covered, a bot will leave a new comment with a link to sign.

H-Huang · 2022-10-04T14:31:02Z

@pytorchbot merge -f "xla failure is unrelated and due to #86093"

pytorchmergebot · 2022-10-04T14:32:31Z

@pytorchbot successfully started a merge job. Check the current status here.
The merge job was triggered with the force (-f) flag. This means your change will be merged immediately, bypassing any CI checks (ETA: 1-5 minutes). If this is not the intended behavior, feel free to use some of the other merge options in the wiki.
Please reach out to the PyTorch DevX Team with feedback or questions!

[5/N] [Dispatchable Collectives] Update send with CPU / CUDA implemen…

dbb6a30

…tations [ghstack-poisoned]

H-Huang requested review from mrshenli, pritamdamania87, zhaojuanmao, rohan-varma, awgu and mingzhe09088 as code owners August 22, 2022 17:50

facebook-github-bot added the cla signed label Aug 22, 2022

H-Huang mentioned this pull request Aug 22, 2022

[4/N] [Dispatchable Collectives] Update all_reduce_ with CPU / CUDA implementations #83810

Closed

facebook-github-bot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Aug 22, 2022

H-Huang added a commit that referenced this pull request Aug 22, 2022

[5/N] [Dispatchable Collectives] Update send with CPU / CUDA implemen…

a0d2ccb

…tations ghstack-source-id: 5d7d8b0090cb412069937d2b056b32647f8cce9b Pull Request resolved: #83859

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

fbd7ea6

…DA implementations" [ghstack-poisoned]

H-Huang added a commit that referenced this pull request Aug 22, 2022

[5/N] [Dispatchable Collectives] Update send with CPU / CUDA implemen…

92fafef

…tations ghstack-source-id: f205305654549ae23eab9f8ab04405d7749dcc89 Pull Request resolved: #83859

H-Huang requested a review from kwen2501 August 22, 2022 23:04

H-Huang added module: c10d Issues/PRs related to collective communications and process groups release notes: distributed (c10d) release notes category topic: new features topic category labels Aug 22, 2022

This was referenced Aug 22, 2022

[6/N] [Dispatchable Collectives] Update recv with CPU / CUDA implementations #83876

Closed

[7/N] [Dispatchable Collectives] Update reduce with CPU / CUDA implementations #83916

Closed

H-Huang added 5 commits August 31, 2022 11:42

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

fe864bc

…DA implementations" [ghstack-poisoned]

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

6387abb

…DA implementations" [ghstack-poisoned]

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

789ae46

…DA implementations" [ghstack-poisoned]

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

3d4e4d1

…DA implementations" [ghstack-poisoned]

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

cdd1470

…DA implementations" [ghstack-poisoned]

H-Huang mentioned this pull request Sep 1, 2022

[8/N] [Dispatchable Collectives] Update allgather with CPU / CUDA implementations #84423

Closed

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

db7f9c3

…DA implementations" [ghstack-poisoned]

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

11ca616

…DA implementations" [ghstack-poisoned]

H-Huang added 4 commits September 13, 2022 12:07

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

0fd71ab

…DA implementations" [ghstack-poisoned]

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

48795ca

…DA implementations" [ghstack-poisoned]

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

65e9cca

…DA implementations" [ghstack-poisoned]

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

b6612e9

…DA implementations" [ghstack-poisoned]

kwen2501 approved these changes Sep 30, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 30, 2022

H-Huang added 2 commits October 3, 2022 07:08

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

f6964a4

…DA implementations" [ghstack-poisoned]

Update on "[5/N] [Dispatchable Collectives] Update send with CPU / CU…

82deaae

…DA implementations" [ghstack-poisoned]

H-Huang mentioned this pull request Oct 3, 2022

[9/N] [Dispatchable Collectives] Update reduce_scatter with CPU / CUDA implementations #86166

Closed

pytorchmergebot added the Merged label Oct 4, 2022

pytorchmergebot closed this in 3f2e7d5 Oct 4, 2022

facebook-github-bot deleted the gh/H-Huang/77/head branch June 8, 2023 14:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[5/N] [Dispatchable Collectives] Update send with CPU / CUDA implementations #83859

[5/N] [Dispatchable Collectives] Update send with CPU / CUDA implementations #83859

H-Huang commented Aug 22, 2022 •

edited

facebook-github-bot commented Aug 22, 2022 •

edited

pytorch-bot bot commented Sep 13, 2022 •

edited

kwen2501 left a comment

H-Huang commented Sep 30, 2022

kwen2501 commented Sep 30, 2022

H-Huang commented Oct 3, 2022

facebook-github-bot commented Oct 4, 2022

H-Huang commented Oct 4, 2022

pytorchmergebot commented Oct 4, 2022

[5/N] [Dispatchable Collectives] Update send with CPU / CUDA implementations #83859

[5/N] [Dispatchable Collectives] Update send with CPU / CUDA implementations #83859

Conversation

H-Huang commented Aug 22, 2022 • edited

Changes

Context

facebook-github-bot commented Aug 22, 2022 • edited

🔗 Helpful links

✅ No Failures (0 Pending)

pytorch-bot bot commented Sep 13, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/83859

❌ 1 Failures, 12 Pending

kwen2501 left a comment

Choose a reason for hiding this comment

H-Huang commented Sep 30, 2022

kwen2501 commented Sep 30, 2022

H-Huang commented Oct 3, 2022

facebook-github-bot commented Oct 4, 2022

H-Huang commented Oct 4, 2022

pytorchmergebot commented Oct 4, 2022

H-Huang commented Aug 22, 2022 •

edited

facebook-github-bot commented Aug 22, 2022 •

edited

pytorch-bot bot commented Sep 13, 2022 •

edited