Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[NCCL] Add Error log when ProcessGroupNCCL takes down process upon
timeout/error Pull Request resolved: #44988 The new NCCL async error handling feature throws an exception from the workCleanup Thread if one of the NCCL operations encounters an error or times out. This PR adds an error log to make it more clear to the user why the training process crashed. ghstack-source-id: 113876146 Differential Revision: [D23794801](https://our.internmc.facebook.com/intern/diff/D23794801/)
- Loading branch information