Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Kineto][NCCL][5/n] Populate in/out split size info for all_to_all fr…
…om CPU to CUDA kernel Summary: This diff populates all_to_all input and out split size from CPU op to GPU kernel when valid. Test Plan: **Trace example**: - For non all_to_all collective functions: https://fburl.com/perfdoctor/4nobsu15 https://pxl.cl/3GNVb - For all_to_all: https://fburl.com/perfdoctor/f418goys https://pxl.cl/3H2nd Differential Revision: D50762093
- Loading branch information