Fix distributed documentation for asynchronous collective Work objects #45709

rohan-varma · 2020-10-02T00:19:12Z

Closes #42247. Clarifies some documentation related to Work object semantics (outputs of async collective functions). Clarifies the difference between CPU operations and CUDA operations (on Gloo or NCCL backend), and provides an example where the difference in CUDA operation's wait() semantics is necessary to understand for correct code.

codecov · 2020-10-02T04:40:39Z

Codecov Report

Merging #45709 into master will increase coverage by 0.00%.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master   #45709   +/-   ##
=======================================
  Coverage   68.25%   68.25%           
=======================================
  Files         410      410           
  Lines       53246    53246           
=======================================
+ Hits        36343    36344    +1     
+ Misses      16903    16902    -1

Impacted Files	Coverage Δ
torch/testing/_internal/expecttest.py	`78.57% <0.00%> (+1.02%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c8d76ff...258d7cf. Read the comment docs.

mrshenli

Thanks for fixing!

mrshenli · 2020-10-07T01:57:38Z

docs/source/distributed.rst

-return anything
-
-asynchronous operation - when ``async_op`` is set to True. The collective operation function
+Every collective operation function supports the following two kinds of operations, depending on the setting of the ``async_op`` flag passed into the collective:


let's break this into shorter lines

mrshenli · 2020-10-07T02:01:01Z

docs/source/distributed.rst

+further function calls utilizing the output of the collective call will behave as expected. For CUDA collectives,
+function calls utilizing the output on the same CUDA stream will behave as expected. Users must take care of
+synchronization under the scenario of running under different streams. For details on CUDA semantics such as stream
+synchronization, see `cuda semantics <https://pytorch.org/docs/stable/autograd.html#profiler>`__.


cuda semantics points to profiler, is this intentional?

Thanks for the catch! It should actually point to https://pytorch.org/docs/stable/notes/cuda.html

facebook-github-bot

@rohan-varma has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-10-08T04:11:25Z

@rohan-varma merged this pull request in 154347d.

Docs improvement

0274b06

facebook-github-bot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Oct 2, 2020

Update

7374d0a

rohan-varma requested review from mrshenli and pritamdamania87 October 2, 2020 00:23

mrshenli approved these changes Oct 7, 2020

View reviewed changes

rohan-varma added 2 commits October 7, 2020 13:26

Update

b8a050a

Merge remote-tracking branch 'upstream/master' into work_docs_fix

258d7cf

facebook-github-bot reviewed Oct 7, 2020

View reviewed changes

facebook-github-bot closed this in 154347d Oct 8, 2020

facebook-github-bot added the Merged label Oct 8, 2020

facebook-github-bot deleted the work_docs_fix branch January 27, 2021 18:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix distributed documentation for asynchronous collective Work objects #45709

Fix distributed documentation for asynchronous collective Work objects #45709

rohan-varma commented Oct 2, 2020 •

edited

codecov bot commented Oct 2, 2020 •

edited

mrshenli left a comment

mrshenli Oct 7, 2020

mrshenli Oct 7, 2020

rohan-varma Oct 7, 2020

facebook-github-bot left a comment

facebook-github-bot commented Oct 8, 2020

Fix distributed documentation for asynchronous collective Work objects #45709

Fix distributed documentation for asynchronous collective Work objects #45709

Conversation

rohan-varma commented Oct 2, 2020 • edited

codecov bot commented Oct 2, 2020 • edited

Codecov Report

mrshenli left a comment

Choose a reason for hiding this comment

mrshenli Oct 7, 2020

Choose a reason for hiding this comment

mrshenli Oct 7, 2020

Choose a reason for hiding this comment

rohan-varma Oct 7, 2020

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Oct 8, 2020

rohan-varma commented Oct 2, 2020 •

edited

codecov bot commented Oct 2, 2020 •

edited