GPU Support in MPICH #3519

gcongiu · 2019-01-18T16:19:38Z

This issue defines integration of GPU support in MPICH. Currently we enable GPU to GPU buffer transfers using the MPI interface through the UCX netmod in the CH4 device (actual transport is provided by UCX). However this only works for NVIDIA GPUs and contiguous datatypes. Non-contiguous datatype buffers are packed in host memory and then transferred. This involves a device-to-host memory copy. Additionally collective reductions are not supported and data in intermediate nodes has to be transferred to host memory to perform operations.

The following features will be needed to enable full GPU support:

The text was updated successfully, but these errors were encountered:

Bayyapu · 2019-03-22T19:37:06Z

@gcongiu I am trying to note the FY19Q2 progress. Could you please let me know the update or point me to the correct PR that i need to see for the progress of this issue?

hzhou · 2021-05-04T13:27:25Z

Outdated

pavanbalaji added this to To do in High-priority Features via automation Jan 18, 2019

Bayyapu added this to the Non-ECP-Q2FY19 milestone Jan 19, 2019

Bayyapu assigned gcongiu Jan 19, 2019

Bayyapu modified the milestones: Non-ECP-Q2FY19, Q2FY19 Jan 22, 2019

pavanbalaji changed the title ~~GPU Support~~ GPU Support in MPICH Feb 20, 2019

hzhou added this to GPU in Threading and GPU Hackathon Dec 7, 2019

hzhou mentioned this issue Dec 7, 2019

GPU: outline of GPU integration #4220

Closed

hzhou closed this as completed May 4, 2021

High-priority Features automation moved this from To do to Done May 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU Support in MPICH #3519

GPU Support in MPICH #3519

gcongiu commented Jan 18, 2019 •

edited by hzhou

Bayyapu commented Mar 22, 2019

hzhou commented May 4, 2021

GPU Support in MPICH #3519

GPU Support in MPICH #3519

Comments

gcongiu commented Jan 18, 2019 • edited by hzhou

Bayyapu commented Mar 22, 2019

hzhou commented May 4, 2021

gcongiu commented Jan 18, 2019 •

edited by hzhou