v2.4.2-1
Add tree algorithms for allreduce to improve performance at scale. Add ncclCommAbort() and ncclCommGetAsyncError() to properly handle network errors and be permit recover. Detect initial CPU affinity and no longer escape it.
Assets 2
-
2019-01-29T23:19:27Z -
2019-01-29T23:19:27Z -