Returning and broadcasting _replica_thermodynamic_states #562

ijpulidos · 2022-03-29T01:14:52Z

Description

Fixes #449 by ensuring that _mix_replicas returns the _replica_thermodynamic_states. Also broadcasting it to the MPI context, according to mpiplus broadcasting

Todos

Implement feature / fix bug
Add tests
Update documentation as needed
Update changelogNotable points that this PR has either accomplished or will accomplish.

Status

Ready to go

jchodera

Indeed, this appears to have been the cause of the bug! Thanks for catching this.

At some point in the future, we may want to extend the mpiplus API to make run() more consistent---the inconsistency of

self._do_action_1()
self._result = self._do_action_2()

bugs me a bit.

If we could later figure out how to extend the API of mpiplus so we can just have

self._do_action_1()
self._do_action_2()
self._do_action_3()

that would be optimal.

Alternatively, we could refactor to do

self._result1 = self._do_action_1()
self._result2 = self._do_action_2()
self._result3 = self._do_action_3()

but we would want to make sure the methods consistently always return new arguments without side-effects or cleanly update and distribute state no matter which solution we choose. Perhaps we can open an issue for that?

jchodera

Can you be sure to update the revision history and mark this as an important bugfix?

We should cut a bugfix release after this.

jchodera · 2022-03-29T04:20:24Z

Alternatively, we could rework self._propagate_replicas() to assume that the self._replica_thermodynamic_states is only correct on node 0 and to pass the appropriate state index along with each replica index, but I imagine this is what was happening originally and the non-obvious pattern here caused the propagation and mixing implementations to drift apart from each other.

zhang-ivy · 2022-03-29T15:49:06Z

Confirming that this PR fixes the mixing problem for 1) ala dipeptide in solvent t-repex and 2) apo barstar h-repex.
@ijpulidos : Have you double checked whether the existing mixing tests are now passing? I think they may not have been failing as expected before.. so we may want add ala dipeptide in solvent t-repex as a test to test for this problem in the future?

mikemhenry · 2022-03-29T19:20:59Z

Looking at our schedule CI, it looks like there may be an issue with the windows openmm builds

mikemhenry · 2022-03-29T19:22:34Z

Now that we have the changelog notes added, do we want to get this merged and a bug fix cut? This would also get our real time anaylsis logging on conda-forge as well.

mikemhenry · 2022-03-30T05:01:46Z

@ijpulidos is this ready to merge? It looks good RE what we talked about at our meeting!

Returning and broadcasting _replica_thermodynamic_states

637feea

ijpulidos mentioned this pull request Mar 29, 2022

MPI bug when multiple GPUs are used per calculation #449

Closed

ijpulidos requested a review from jchodera March 29, 2022 01:21

jchodera approved these changes Mar 29, 2022

View reviewed changes

jchodera requested changes Mar 29, 2022

View reviewed changes

jchodera added the 🐛 bug label Mar 29, 2022

jchodera added this to the 0.21.3 milestone Mar 29, 2022

jchodera added the ⬆️ high-priority label Mar 29, 2022

ijpulidos added 2 commits March 29, 2022 13:39

Applying the fix to other samplers.

8acac36

Adding release notes changes.

1468d26

mikemhenry requested a review from jchodera March 29, 2022 19:21

Minor changes. Important comments/notes.

7f5f4de

mikemhenry approved these changes Mar 30, 2022

View reviewed changes

ijpulidos merged commit 51cad19 into main Mar 30, 2022

ijpulidos deleted the fix-mpi-replica-mix branch March 30, 2022 14:22

udlich mentioned this pull request Apr 19, 2022

openmmtools0.21.3 equilibration protocol forgets to broadcast _replica_thermodynamic_states #579

Closed

ijpulidos mentioned this pull request Jun 6, 2022

Release 0.21.4 - bugfix release #590

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Returning and broadcasting _replica_thermodynamic_states #562

Returning and broadcasting _replica_thermodynamic_states #562

ijpulidos commented Mar 29, 2022 •

edited

Loading

jchodera left a comment

jchodera left a comment

jchodera commented Mar 29, 2022

zhang-ivy commented Mar 29, 2022

mikemhenry commented Mar 29, 2022

mikemhenry commented Mar 29, 2022

mikemhenry commented Mar 30, 2022

Returning and broadcasting _replica_thermodynamic_states #562

Returning and broadcasting _replica_thermodynamic_states #562

Conversation

ijpulidos commented Mar 29, 2022 • edited Loading

Description

Todos

Status

jchodera left a comment

Choose a reason for hiding this comment

jchodera left a comment

Choose a reason for hiding this comment

jchodera commented Mar 29, 2022

zhang-ivy commented Mar 29, 2022

mikemhenry commented Mar 29, 2022

mikemhenry commented Mar 29, 2022

mikemhenry commented Mar 30, 2022

ijpulidos commented Mar 29, 2022 •

edited

Loading