Release v2.19.4-1: 2.19.4-1 · NVIDIA/nccl

v2.19.4-1
88d44d7
Compare

Choose a tag to compare

View all tags

v2.19.4-1
88d44d7
Compare

Choose a tag to compare

View all tags

sjeaugey tagged this 13 Nov 18:36

Split transport connect phase into multiple steps to avoid port
exhaustion when connecting alltoall at large scale. Defaults to 128
peers per round.
Fix memory leaks on CUDA graph capture.
Fix alltoallv crash on self-sendrecv.
Make topology detection more deterministic when PCI speeds are not
available (fix issue #1020).
Properly close shared memory in NVLS resources.
Revert proxy detach after 5 seconds.
Add option to print progress during transport connect.
Add option to set NCCL_DEBUG to INFO on first WARN.

Assets 2

Source code (zip)

2023-11-13T18:36:12Z
Source code (tar.gz)

2023-11-13T18:36:12Z

Provide feedback

Saved searches

Use saved searches to filter your results more quickly