Skip to content

LenaO/tutorial-multi-gpu

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SC24 Tutorial: Efficient Distributed GPU Programming for Exascale

DOI

Repository with talks and exercises of our Efficient GPU Programming for Exascale tutorial, to be held at SC24.

Coordinates

  • Date: 17 November 2024
  • Occasion: SC24 Tutorial
  • Tutors: Simon Garcia de Gonzalo (SNL), Andreas Herten (JSC), Markus Hrywniak (NVIDIA), Jiri Kraus (NVIDIA), Lena Oden (Uni Hagen)

Setup

The tutorial is an interactive tutorial with introducing lectures and practical exercises to apply knowledge. The exercises have been derived from the Jacobi solver implementations available in NVIDIA/multi-gpu-programming-models.

Walk-through:

Curriculum:

  1. Lecture: Tutorial Overview, Introduction to System + Onboarding Andreas
  2. Lecture: MPI-Distributed Computing with GPUs Simon
  3. Hands-on: Multi-GPU Parallelization
  4. Lecture: Performance / Debugging Tools Markus
  5. Lecture: Optimization Techniques for Multi-GPU Applications Simon
  6. Hands-on: Overlap Communication and Computation with MPI
  7. Lecture: Overview of NCCL and NVSHMEN in MPI Lena
  8. Hands-on: Using NCCL and NVSHMEM
  9. Lecture: Device-initiated Communication with NVSHMEM Jiri
  10. Hands-on: Using Device-Initiated Communication with NVSHMEM
  11. Lecture: Conclusion and Outline of Advanced Topics Andreas

About

SC21 Tutorial on Multi GPU Usage at Exascale

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Cuda 44.0%
  • C++ 33.9%
  • Jupyter Notebook 12.2%
  • Makefile 8.0%
  • Shell 1.1%
  • TeX 0.6%
  • Python 0.2%