Skip to content

Conversation

@devreal
Copy link
Contributor

@devreal devreal commented Jun 21, 2019

This PR changes the shared memory allocation in the rdma osc to synchronize error codes among the ranks in the shared memory communicator to prevent rank 0 from aborting without notifying the other ranks. The changes add two additional allreduce calls on the shared memory communicator and one on the window communicator and replaces two barrier calls with the error code synchronization.

Cherry-pick of #6746 to v4.0.x

Signed-off-by: Joseph Schuchart schuchart@hlrs.de

@ompiteam-bot
Copy link

Can one of the admins verify this patch?

@jsquyres
Copy link
Member

ok to test

@devreal devreal force-pushed the osc_winalloc_err_v4.0.x branch from d97a21b to bbbfd0a Compare June 24, 2019 15:45
…roup

Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>
(cherry picked from commit 8f27cc2)
@devreal devreal force-pushed the osc_winalloc_err_v4.0.x branch from bbbfd0a to c5cf343 Compare June 24, 2019 15:49
@hppritcha
Copy link
Member

bot:ompi:retest

@gpaulsen gpaulsen modified the milestones: v4.0.x, v4.0.2 Jun 24, 2019
@hppritcha hppritcha self-requested a review June 24, 2019 19:25
@hppritcha
Copy link
Member

bot:ompi:retest

@gpaulsen gpaulsen merged commit 514e273 into open-mpi:v4.0.x Jun 28, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants