Use `NEVER_ALONE` for chained MPI calls #184

csegarragonz · 2021-11-25T18:56:20Z

In this PR I set MPI functions to use the NEVER_ALONE scheduling topology hint by default.

Preliminary results show that this policy reduces the number of cross-host messages by 20% on our baseline experiment.

I have to update some tests in the test_remote_mpi_worlds.cpp file as they assumed a specific scheduling behaviour.

…stribution

csegarragonz · 2021-11-26T10:36:43Z

tests/test/scheduler/test_remote_mpi_worlds.cpp

@@ -201,61 +202,14 @@ TEST_CASE_METHOD(RemoteMpiTestFixture,
    thisWorld.destroy();
 }

-TEST_CASE_METHOD(RemoteMpiTestFixture, "Test barrier across hosts", "[mpi]")


With the new scheduling policy, we will always have at least two remote processes. Thus, we can't test a barrier across hosts without using a distributed setting (both remote ranks must call barrier to unlock).

The other tests in this file I managed to port.

Ok nice, we have quite a lot of this kind of test where we're trying to fake a distributed setting locally, all of which date from before the distributed tests.

Is there a dist test that covers this now? I can't remember. If not, could we add one? Happy to discuss specifics of this offline as I'm not sure we have any Faabric dist tests that cover the MPI implementation.

In future it might be worth trying to port things to dist tests rather than keep the hacky local versions, but in this instance it seems to work ok.

I am introducing distributed tests for MPI (inheriting from mpi-native in #186). The order in which we merge this PR and the other one does not really matter.

Shillaker · 2021-11-26T12:29:40Z

tests/test/scheduler/test_remote_mpi_worlds.cpp

@@ -201,61 +202,14 @@ TEST_CASE_METHOD(RemoteMpiTestFixture,
    thisWorld.destroy();
 }

-TEST_CASE_METHOD(RemoteMpiTestFixture, "Test barrier across hosts", "[mpi]")


Ok nice, we have quite a lot of this kind of test where we're trying to fake a distributed setting locally, all of which date from before the distributed tests.

Is there a dist test that covers this now? I can't remember. If not, could we add one? Happy to discuss specifics of this offline as I'm not sure we have any Faabric dist tests that cover the MPI implementation.

In future it might be worth trying to port things to dist tests rather than keep the hacky local versions, but in this instance it seems to work ok.

tests/test/scheduler/test_remote_mpi_worlds.cpp

use NEVER_ALONE for chained MPI calls

ca2d737

csegarragonz added mpi Related to the MPI implementation scheduler labels Nov 25, 2021

csegarragonz self-assigned this Nov 25, 2021

remove remote barrier test as it is not possible to mimick without di…

a459aeb

…stribution

csegarragonz commented Nov 26, 2021

View reviewed changes

csegarragonz requested a review from Shillaker November 26, 2021 11:02

Shillaker requested changes Nov 26, 2021

View reviewed changes

cleanup tests

9305c59

csegarragonz requested a review from Shillaker November 29, 2021 15:38

Shillaker approved these changes Nov 29, 2021

View reviewed changes

csegarragonz merged commit 657a506 into master Nov 29, 2021

csegarragonz deleted the mpi-topo-hint branch November 29, 2021 19:13

csegarragonz mentioned this pull request Feb 23, 2022

Add task to generate release body #233

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `NEVER_ALONE` for chained MPI calls #184

Use `NEVER_ALONE` for chained MPI calls #184

csegarragonz commented Nov 25, 2021 •

edited

Loading

csegarragonz Nov 26, 2021 •

edited

Loading

Shillaker Nov 26, 2021

csegarragonz Nov 29, 2021

Shillaker Nov 26, 2021

Use NEVER_ALONE for chained MPI calls #184

Use NEVER_ALONE for chained MPI calls #184

Conversation

csegarragonz commented Nov 25, 2021 • edited Loading

csegarragonz Nov 26, 2021 • edited Loading

Choose a reason for hiding this comment

Shillaker Nov 26, 2021

Choose a reason for hiding this comment

csegarragonz Nov 29, 2021

Choose a reason for hiding this comment

Shillaker Nov 26, 2021

Choose a reason for hiding this comment

Use `NEVER_ALONE` for chained MPI calls #184

Use `NEVER_ALONE` for chained MPI calls #184

csegarragonz commented Nov 25, 2021 •

edited

Loading

csegarragonz Nov 26, 2021 •

edited

Loading