NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
-
Updated
Nov 15, 2023 - C++
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
Advanced High Performance Computing in C with OpenMP, CUDA, MPI and NCCL. The folder project includes my final project for the special course. I implemented a Jacobi-solver for the Poisson partial differential problem both using OpenMP in the CPU, using CUDA on the GPU and using CUDA, MPI and NCCL on multiple GPUs.
Blood Cell Simulation server
Add a description, image, and links to the nccl topic page so that developers can more easily learn about it.
To associate your repository with the nccl topic, visit your repo's landing page and select "manage topics."