Skip to content

Commit

Permalink
[Docs] Minor doc fixes for init_process_group
Browse files Browse the repository at this point in the history
Pull Request resolved: #47644

Minor Update to the init_process_group docs.
ghstack-source-id: 116337449

Differential Revision: [D24633432](https://our.internmc.facebook.com/intern/diff/D24633432/)
  • Loading branch information
osalpekar committed Nov 10, 2020
1 parent abae12b commit 0353911
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions torch/distributed/distributed_c10d.py
Expand Up @@ -401,11 +401,11 @@ def init_process_group(backend,
asynchronously and the process will crash. ``NCCL_BLOCKING_WAIT``
will provide errors to the user which can be caught and handled,
but due to its blocking nature, it has a performance overhead. On
the other hand, ``NCCL_ASYNC_ERROR_HANDLING`` has little
the other hand, ``NCCL_ASYNC_ERROR_HANDLING`` has very little
performance overhead, but crashes the process on errors. This is
done since CUDA execution is async and it is no longer safe to
continue executing user code since failed async NCCL operations
might result in subsequent CUDA operations to run on corrupted
might result in subsequent CUDA operations running on corrupted
data. Only one of these two environment variables should be set.
group_name (str, optional, deprecated): Group name.
Expand Down

0 comments on commit 0353911

Please sign in to comment.