Adding sysimage generation to mpi tests #48

amartinhuertas · 2021-10-21T03:30:16Z

Solve Medium Priority Task in #39

IMPORTANT NOTE: This PR MUST BE reviewed after #47

codecov-commenter · 2021-10-21T03:41:14Z

Codecov Report

Merging #48 (120e0a4) into release-0.2 (73de176) will not change coverage.
The diff coverage is n/a.

@@             Coverage Diff             @@
##           release-0.2     #48   +/-   ##
===========================================
  Coverage         0.00%   0.00%           
===========================================
  Files               33      33           
  Lines             3205    3205           
===========================================
  Misses            3205    3205

Impacted Files	Coverage Δ
src/Visualization.jl	`0.00% <ø> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 73de176...120e0a4. Read the comment docs.

fverdugo

I am not sure about this. Introduces considerable complexity and it has not a positive impact in the test times. In fact, by looking the gh actions it seems to even lead to slower times.

Perhaps separating the sequential and mpi tests is enough for now.

amartinhuertas · 2021-10-21T05:16:40Z

I am not sure about this. Introduces considerable complexity and it has not a positive impact in the test times. In fact, by looking the gh actions it seems to even lead to slower times.

Yes, by now it does not improve much. If we want to perform tests with different values of the number of processors (apart from 1 and 4), this will be very helfpul.

amartinhuertas · 2021-10-21T05:18:15Z

I am not sure about this. Introduces considerable complexity and it has not a positive impact in the test times. In fact, by looking the gh actions it seems to even lead to slower times.

Besides, we will be evaluating the ability to precompile GridapDistributed.jl. Such action for clusters is a must.

…into adding_sysimage_generation_to_mpi_tests

fverdugo · 2021-10-21T05:24:21Z

Besides, we will be evaluating the ability to precompile GridapDistributed.jl. Such action for clusters is a must.

yes, but precompilation needs to be done in the final app, because we cannot anticipate which other packages is going to call the user.

With PackageCompiler generating a system image is straight forward once you have the the final app. I think, it is not our job to provide a precompilaiton mechanism having PackageCompiler at our disposal.

In any case, perhaps it is a good idea to create a new repo GridapBenchmarks or whatever, where we can add both serial and parallel benchs. And we definitively want precompilation scripts in that repo.

amartinhuertas · 2021-10-21T05:28:48Z

I think it makes a lot of sense to test the workflow that you are going to face when dealing with a cluster. The repo of apps is going to be abandoned, and not tested frequently (we have already many other experiences of similar repos).

fverdugo · 2021-10-21T05:38:12Z

I think it makes a lot of sense to test the workflow that you are going to face when dealing with a cluster. The repo of apps is going to be abandoned, and not tested frequently (we have already many other experiences of similar repos).

This is a very good point! but as the sysimage is created now, it is not testing the actual workflow that one needs to follow in a cluster. To check the workflow, you need an app, i.e., you need a (or some) "main" function(s) inside a Package and then create a sys image for those functions.

We have several options:

Move the "main" functions of the tests inside the src dir and then create a system image by calling them on a single MPI task
Create a dummy "test app" in the test dir and move the "main" test functions to the src dir of the "test app" and then precompile the "test app".

I would go for 2 definitively, even though it has some more boilerplate code, since it actually mimics what a user needs to do in a cluster.

What do you think?

amartinhuertas · 2021-10-21T05:39:18Z

yes, but precompilation needs to be done in the final app, because we cannot anticipate which other packages is going to call the user.

I agree the solution that we currently have in this branch is not definitive, in the sense that it is not testing exactly what will happen with the actual app, with all dependencies etc.. But it may happen that even with the current solution we detect a failure of precompilation after a commit. This can give us some early useful information, that would be otherwise very hard to obtain if the issue happens when you deploy the software in the cluster.

fverdugo · 2021-10-21T05:39:40Z

and BTW the precompilation scrips should be in the "test app" (i.e. the final app) as usual

amartinhuertas · 2021-10-21T05:45:00Z

What do you think?

Ok. Let us go for 2. Can we use the sysimage of the test app to accelerate the mpi tests or would you avoid that?

Actions CI)

mind

* Addint automated image generation in CI.yml

…into adding_sysimage_generation_to_mpi_tests

amartinhuertas · 2021-10-22T00:00:56Z

@fverdugo ... all tests are now passing. PR ready to review/merge.

The CI script generates the TestApp.so sysimage, and then leverages it to run the MPI tests with 1x and 4x MPI tasks. The generation of the image takes 21 min according to github CI. On the other hand, the parallel MPI tests runtime is missleading in Gitlub CI: https://github.com/gridap/GridapDistributed.jl/runs/3970159570?check_suite_focus=true I think it only measures the time spent in the runtests.jl script, not the time spent in the mpirun processes forked out of it.

Generating sysimage in Github actions in order to reduce MPI tests time

eec734a

amartinhuertas changed the base branch from master to release-0.2 October 21, 2021 03:30

amartinhuertas requested a review from fverdugo October 21, 2021 03:31

amartinhuertas mentioned this pull request Oct 21, 2021

Misc pending tasks associated with refactoring in branch release-0.2 #39

Open

30 tasks

fverdugo requested changes Oct 21, 2021

View reviewed changes

Merge branch 'release-0.2' of github.com:gridap/GridapDistributed.jl …

d485703

…into adding_sysimage_generation_to_mpi_tests

amartinhuertas and others added 13 commits October 21, 2021 17:46

Generated Project.toml and Manifest.toml in TestApp

519ee36

More work in TestApp

33d0013

Removing WriteVTK from Project.toml TestAPP

d1d7e29

Preliminary version of TestApp (without image generation yet in Github

a911f23

Actions CI)

Separated the body of runtests_np4.jl in a separate file with reuse in

3380ba7

mind

Instantiating TestApp in ci.yml

fe3f36a

Fixing paths to --project in runtests.jl mpi

953b9a6

* Deactivating package build, do it manually in CI.yml

97c007e

* Addint automated image generation in CI.yml

Fixing typo in seq tests

cb416f3

Debugging image generation

953eef7

Update ci.yml

4fb646b

Merge branch 'release-0.2' of github.com:gridap/GridapDistributed.jl …

e998a2b

…into adding_sysimage_generation_to_mpi_tests

Fixing TestApp Manifest.toml

120e0a4

fverdugo approved these changes Oct 22, 2021

View reviewed changes

fverdugo merged commit d7ccea9 into release-0.2 Oct 22, 2021

fverdugo deleted the adding_sysimage_generation_to_mpi_tests branch October 22, 2021 12:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding sysimage generation to mpi tests #48

Adding sysimage generation to mpi tests #48

amartinhuertas commented Oct 21, 2021 •

edited

Loading

codecov-commenter commented Oct 21, 2021 •

edited

Loading

fverdugo left a comment

amartinhuertas commented Oct 21, 2021

amartinhuertas commented Oct 21, 2021

fverdugo commented Oct 21, 2021

amartinhuertas commented Oct 21, 2021

fverdugo commented Oct 21, 2021

amartinhuertas commented Oct 21, 2021

fverdugo commented Oct 21, 2021

amartinhuertas commented Oct 21, 2021

amartinhuertas commented Oct 22, 2021

Adding sysimage generation to mpi tests #48

Adding sysimage generation to mpi tests #48

Conversation

amartinhuertas commented Oct 21, 2021 • edited Loading

codecov-commenter commented Oct 21, 2021 • edited Loading

Codecov Report

fverdugo left a comment

Choose a reason for hiding this comment

amartinhuertas commented Oct 21, 2021

amartinhuertas commented Oct 21, 2021

fverdugo commented Oct 21, 2021

amartinhuertas commented Oct 21, 2021

fverdugo commented Oct 21, 2021

amartinhuertas commented Oct 21, 2021

fverdugo commented Oct 21, 2021

amartinhuertas commented Oct 21, 2021

amartinhuertas commented Oct 22, 2021

amartinhuertas commented Oct 21, 2021 •

edited

Loading

codecov-commenter commented Oct 21, 2021 •

edited

Loading