Libipc

Libipc is a small library containing example code illustrating how to share GPU data between MPI processes sharing a GPU and using Inter-Process Communications (IPC). The code uses the ROCm HIP API and was based on published CUDA API converted to HIP (and back to CUDA for the CUDA version) Simple example on how to share data across MPI ranks on GPU via Inter-Process Communications (IPC).

Context

This work was undertaken by the staff of the ARCHER2 Centre of Excellence and motivated by a challenge of one user community.

Design requirements

Nodes with multiple GPUs
- We assume a single GPU per each rank (via XXX_VISIBLE_DEVICES env variable)
Multiple MPI ranks attached to the same GPU
- We could use sub-communicators (gpu_node_communicator)
Rank-0 of each gpu_node_communicator allocates data
- Data are shared with the other ranks of gpu_node_communicator with direct access
- Minimal synchronization implied (at the level of MPI or GPU?)

How to compile and run

First, you need to set the proper environment for CUDA or HIP. We provide two corresponding scripts (sourceme_cuda.sh for A100 nodes or sourceme_hip.sh for Mi250X nodes) that can be used on HPE Cray EX systems.

Then, you can use make ACC=cuda or make ACC=hip to compile the library and the examples. By default, the Fortran example is compiled. You can add EXT=cpp to compile the C++ example.

For the execution, we provide an example based on SLURM (run.sh).

Contritutors

The original contributers to this work were:

Alfio Lazzaro
Douglas Shanks
Harvey Richardson

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
acc_ipc.F90		acc_ipc.F90
cuda_intf.F90		cuda_intf.F90
cuda_intf.cpp		cuda_intf.cpp
cuda_intf.h		cuda_intf.h
hip_intf.F90		hip_intf.F90
hip_intf.cpp		hip_intf.cpp
hip_intf.h		hip_intf.h
ipc_api.cpp		ipc_api.cpp
ipc_api.h		ipc_api.h
mpiIPC.cpp		mpiIPC.cpp
mpiIPC_cuda.F90		mpiIPC_cuda.F90
mpiIPC_hip.F90		mpiIPC_hip.F90
run.sh		run.sh
select_gpu_amd.sh		select_gpu_amd.sh
select_gpu_nvidia.sh		select_gpu_nvidia.sh
sourceme_cuda.sh		sourceme_cuda.sh
sourceme_hip.sh		sourceme_hip.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Libipc

Context

Design requirements

How to compile and run

Contritutors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Libipc

Context

Design requirements

How to compile and run

Contritutors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages