Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Distributed Compilation as an option to DaCe Program #1555

Merged
merged 2 commits into from
Apr 16, 2024

Conversation

kotsaloscv
Copy link
Collaborator

@kotsaloscv kotsaloscv commented Apr 8, 2024

Option to activate/deactivate Distributed Compilation.

This small PR is based on the following comment (DAPP/DaCe Mattermost channel):
I have an unexpected behaviour in DaCe distributed compilation.
Currently, if you have an MPI program, distributed compilation is the default behaviour (as seen in this file). I was expecting that after the loading of the compiled sdfg every rank would do symbol specialization.
Although, this is not the case, i.e. every rank uses the compiled sdfg from rank 0, which specializes its symbols with the values corresponding to rank 0. Therefore, the compiled sdfg loaded by all the other ranks use a wrong sdfg (symbols are not specialized with the values of the correct rank).
To validate this behaviour, I have de-activated the distributed compilation and set dace.config.Config.set("cache", value="unique"). Indeed, this approach works without any issue.
Is there a way to change this unexpected behaviour, i.e. to have by default the distributed compilation but every rank to perform symbol specialization.
To give a bit more context, I am generating an sdfg that uses closures heavily, i.e. all the gt4py fields are defined externally to the sdfg (could that be an issue)?

@kotsaloscv kotsaloscv self-assigned this Apr 8, 2024
@kotsaloscv kotsaloscv requested a review from tbennun April 16, 2024 07:18
@tbennun tbennun added this pull request to the merge queue Apr 16, 2024
Merged via the queue into master with commit 78759b5 Apr 16, 2024
9 checks passed
@tbennun tbennun deleted the distributed_compilation branch April 16, 2024 15:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants