NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
-
Updated
Nov 15, 2023 - C++
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
Blood Cell Simulation server
Advanced High Performance Computing in C with OpenMP, CUDA, MPI and NCCL. The folder project includes my final project for the special course. I implemented a Jacobi-solver for the Poisson partial differential problem both using OpenMP in the CPU, using CUDA on the GPU and using CUDA, MPI and NCCL on multiple GPUs.
Add a description, image, and links to the nccl topic page so that developers can more easily learn about it.
To associate your repository with the nccl topic, visit your repo's landing page and select "manage topics."