Fix first participant method in compositional cplscheme #1307

uekerman · 2022-05-27T08:38:54Z

Main changes of this PR

I tried to reproduce the problem a user ran into when

coupling three solvers A, B, C
with two serial-explicit coupling schemes A<-->B and B<-->C
and prescribing the time window size for B and C by A.

Reported here: https://precice.discourse.group/t/variable-time-steps-without-sub-cycling-in-serial-explicit-coupling/1049/5

I indeed ran into a problem when CompositionalCouplingScheme tried to merge different time stepping methods and fixed it.

Afterwards, I ran into another problem, however. "first-participant" has not great integration test coverage yet :/
I reproduced the problem with the integration test ReadWriteScalarDataFirstParticipant.

The following is problematic for the first participant of a serial-explicit coupling scheme with dt by "first-participant". Then, the coupling scheme does not define a time window size.

precice/src/precice/impl/SolverInterfaceImpl.cpp

Line 1699 in 780d4f1

    
           double timeStepStart = _couplingScheme->getTimeWindowSize() - _couplingScheme->getThisTimeWindowRemainder();

We need a similar treatment as here:

precice/src/precice/impl/SolverInterfaceImpl.cpp

Lines 431 to 435 in 780d4f1

    
           if (_couplingScheme->hasTimeWindowSize()) { 
        
             timeWindowSize = _couplingScheme->getTimeWindowSize(); 
        
           } else { 
        
             timeWindowSize = computedTimestepLength; 
        
           }

and then probably store the time window site at SolverInterfaceImpl.

@BenjaminRodenberg Could I hand over to you here?

Author's checklist

I checked that this actually solves the user's problem.
I added a changelog file with make changelog if there are user-observable changes since the last release.
I ran make format to ensure everything is formatted correctly.
I sticked to C++14 features.
I sticked to CMake version 3.16.3.
I squashed / am about to squash all commits that should be seen as one.

Reviewers' checklist

Does the changelog entry make sense? Is it formatted correctly?
Do you understand the code changes?

precice-bot · 2022-05-27T08:42:03Z

This pull request has been mentioned on preCICE Forum on Discourse. There might be relevant details there:

https://precice.discourse.group/t/variable-time-steps-without-sub-cycling-in-serial-explicit-coupling/1049/6

…t participant method.

…rm cases.

uekerman · 2022-06-07T07:36:24Z

src/cplscheme/BaseCouplingScheme.cpp

+bool BaseCouplingScheme::solverSetsTimeWindowSize() const
+{
+  PRECICE_ASSERT(hasTimeWindowSize());
+  return false;
+}


Isn't this function a copy of hastTimeWindowSize?
I agree that the name solverSetsTimeWindowSize() is much clearer.

Isn't this function a copy of hastTimeWindowSize?

From the perspective of functionality: Yes. But I think it clearly improves readability to have two functions here. It's from the outside perspective not obvious that hasTimeWindowSize() == !solverSetsTimeWindowSize().

If we remove hasTimeWindowSize, we would end up in some situations with something like this:

double BaseCouplingScheme::getThisTimeWindowRemainder() const { PRECICE_TRACE(); double remainder = 0.0; if (!solverSetsTimeWindowSize()) { remainder = getNextTimestepMaxLength(); } PRECICE_DEBUG("return {}", remainder); return remainder; }

instead of

double BaseCouplingScheme::getThisTimeWindowRemainder() const { PRECICE_TRACE(); double remainder = 0.0; if (hasTimeWindowSize()) { remainder = getNextTimestepMaxLength(); } PRECICE_DEBUG("return {}", remainder); return remainder; }

So I would like to keep both functions for the sake of readability.

Mmh, I actually find the first version easier to read. It gives additional information on when what happens.
In then end, hasTimeWindowSize is hard to understand, what does "has a time window size" mean?

src/cplscheme/CompositionalCouplingScheme.cpp

src/cplscheme/SerialCouplingScheme.cpp

Co-authored-by: Benjamin Uekermann <benjamin.uekermann@gmail.com>

This reverts commit 77d733c.

tests/serial/three-solvers/ThreeSolversFirstParticipant.xml

BenjaminRodenberg

I decided to remove the method solverSetsTimeWindowSize. I'm still not happy with the way how hasTimeWindowSize encodes two different pieces information (1) is there a time window size available? 2) Does this participant set the time window size?), but I don't think that the solution with solverSetsTimeWindowSize really helps with respect to compositional coupling schemes. I would keep it as it is. If you think there is still a need for refactoring this part then we can put it into an issue, but the original bug should be fixed now and with respect to refactoring I'm running out of ideas and time here.

precice-bot · 2022-06-14T14:22:45Z

This pull request has been mentioned on preCICE Forum on Discourse. There might be relevant details there:

https://precice.discourse.group/t/variable-time-steps-without-sub-cycling-in-serial-explicit-coupling/1049/10

uekerman

Looks good to me

uekerman · 2022-06-15T08:35:24Z

Waiting to merge till we resolved the issue on Discourse, but I am confident that we don't need to change anything here.

uekerman · 2022-06-24T09:34:48Z

We have another bug in preCICE, I just added an integration test to reproduce.

precice/src/cplscheme/SerialCouplingScheme.cpp

Lines 60 to 74 in cd1b3a4

    
           void SerialCouplingScheme::initializeImplementation() 
        
           { 
        
             // determine whether initial data needs to be communicated 
        
             determineInitialSend(getSendData()); 
        
             determineInitialReceive(getReceiveData()); 
        
             // If the second participant initializes data, the first receive for the 
        
             // second participant is done in initializeData() instead of initialize(). 
        
             if (not doesFirstStep() && not sendsInitializedData() && isCouplingOngoing()) { 
        
               PRECICE_DEBUG("Receiving data"); 
        
               receiveAndSetTimeWindowSize(); 
        
               receiveData(getM2N(), getReceiveData()); 
        
               checkDataHasBeenReceived(); 
        
             } 
        
           }

If dt-method = "first participant": The second participant always needs to receive the time window size in initialize already. This cannot wait till initializeData as initialize needs to return this value to the user already.

@BenjaminRodenberg handing back to you 😁

BenjaminRodenberg · 2022-06-28T09:30:35Z

tests/serial/time/explicit/serial-coupling/ReadWriteScalarDataFirstParticipantInitData.cpp

+
+BOOST_AUTO_TEST_SUITE(Integration)
+BOOST_AUTO_TEST_SUITE(Serial)
+BOOST_AUTO_TEST_SUITE(Time)


I don't think that this test (maybe also the other new tests) fits very well into Integration/Serial/Time. Maybe Integration/Serial/InitializeData ? Or should we create a dedicated test suite for the first-participant configuration?

In a way, that's an inherent problem / feature of integration tests; they always test multiple components simultaneously. Here it is "first-participant" + "data initialization". For me, "first-participant" is the dominating feature here. That's why I added it to time. It somehow fits nicely as it only applies to serial coupling.

BenjaminRodenberg · 2022-06-28T12:08:38Z

We have another bug in preCICE, I just added an integration test to reproduce.

precice/src/cplscheme/SerialCouplingScheme.cpp

Lines 60 to 74 in cd1b3a4

void SerialCouplingScheme::initializeImplementation()

{

// determine whether initial data needs to be communicated

determineInitialSend(getSendData());

determineInitialReceive(getReceiveData());

// If the second participant initializes data, the first receive for the

// second participant is done in initializeData() instead of initialize().

if (not doesFirstStep() && not sendsInitializedData() && isCouplingOngoing()) {

PRECICE_DEBUG("Receiving data");

receiveAndSetTimeWindowSize();

receiveData(getM2N(), getReceiveData());

checkDataHasBeenReceived();

}

}

If dt-method = "first participant": The second participant always needs to receive the time window size in initialize already. This cannot wait till initializeData as initialize needs to return this value to the user already.

@BenjaminRodenberg handing back to you grin

I tried to fix this, but we might have a deadlock here:

The first participant sends the timestep size at the end of advance (it does not know about the timestep size provided by the user before the user defines it via advance). But to reach advance, the first participant must call initializeData and receive initial data from the second participant here. Here comes the deadlock: The second participant does not reach initializeData to send initial data, because it is waiting for the timestep size in initialize.

#1196 might help us to resolve this deadlock:

if we merge initialize and initializeData, we have more possibilities to move around communication.
if initializeData is mandatory initializeData could be used to return dt, not initialize.

But with our current API design and order of API calls I do not really see a solution. To me it looks like initializeData + first participant has to be forbidden.

uekerman · 2022-06-28T12:24:59Z

You're right 🙈

* Fixes implementation * But results in a deadlock * Modify test correspondingly

BenjaminRodenberg · 2022-06-28T13:58:23Z

I just pushed 62e3417. This is from my current point of view as close as we can get to a solution. The deadlock-problem still exists, therefore I had to modify the test.

Depending on how we resolve the deadlock-situation, can can then keep 4431577 and 62e3417 here or cherry-pick it into a feature branch.

This reverts commit 62e3417.

Fix first participant method in compositional cplscheme

932766e

github-actions bot assigned uekerman May 27, 2022

uekerman added the bug preCICE does not behave the way we want and we should look into it (and fix it if possible) label May 27, 2022

uekerman added this to the Version 2.x.x milestone May 27, 2022

uekerman added 2 commits May 27, 2022 10:44

Format test config

8d12af1

Add failing test for first-participant serial explicit cpl

e6865a2

fsimonis assigned BenjaminRodenberg Jun 1, 2022

BenjaminRodenberg added 5 commits June 2, 2022 15:42

Refactoring for first participant method.

2d2bb0d

Add dedicated function for checking whether coupling scheme uses firs…

ba85ce4

…t participant method.

Add proper treatment of waveforms, if participant first method is used.

1bdcf69

Read at end of window for consistent treatement with other non-wavefo…

15ecc76

…rm cases.

Add test for first participant and implicit coupling.

69e5faa

uekerman commented Jun 7, 2022

View reviewed changes

src/cplscheme/CompositionalCouplingScheme.cpp Outdated Show resolved Hide resolved

uekerman commented Jun 7, 2022

View reviewed changes

src/cplscheme/SerialCouplingScheme.cpp Outdated Show resolved Hide resolved

BenjaminRodenberg and others added 3 commits June 9, 2022 08:49

Move debug statement.

1318de1

Update src/cplscheme/CompositionalCouplingScheme.cpp

77d733c

Co-authored-by: Benjamin Uekermann <benjamin.uekermann@gmail.com>

Revert "Update src/cplscheme/CompositionalCouplingScheme.cpp"

e0f9306

This reverts commit 77d733c.

BenjaminRodenberg reviewed Jun 13, 2022

View reviewed changes

tests/serial/three-solvers/ThreeSolversFirstParticipant.xml Show resolved Hide resolved

BenjaminRodenberg added 3 commits June 13, 2022 14:49

Add assertions and fix wrong assumptions.

b061aa6

Remove method solverSetsTimeWindowSize.

8ed5875

Remove dead code.

b572621

BenjaminRodenberg reviewed Jun 13, 2022

View reviewed changes

uekerman commented Jun 14, 2022

View reviewed changes

Add another failing test

4431577

BenjaminRodenberg reviewed Jun 28, 2022

View reviewed changes

Fix implementation, if initializeData is used

62e3417

* Fixes implementation * But results in a deadlock * Modify test correspondingly

Revert "Fix implementation, if initializeData is used"

a9ce3ec

This reverts commit 62e3417.

uekerman force-pushed the fix-three-solver-first-participant branch from 1a42930 to a9ce3ec Compare June 30, 2022 10:44

uekerman added 2 commits June 30, 2022 12:51

Disable initData + frist-participant integration test

9acabc0

Fix formatting

e52f2bf

uekerman mentioned this pull request Jun 30, 2022

Fix and enable first-participant timestepping combined with communication of initial data #1347

Closed

uekerman added 3 commits June 30, 2022 17:47

Delete first-participant init-data tests

708b169

Add changelog [ci skip]

7857d13

Reomve test from tests.cmake

e2051a0

uekerman merged commit c800c66 into precice:develop Jul 1, 2022

uekerman mentioned this pull request Jul 1, 2022

Add failing test to reproduce first-participant + initData problem #1349

Merged

7 tasks

uekerman modified the milestones: Version 2.x.x, Version 2.5.0 Jul 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix first participant method in compositional cplscheme #1307

Fix first participant method in compositional cplscheme #1307

uekerman commented May 27, 2022 •

edited

precice-bot commented May 27, 2022

uekerman Jun 7, 2022

BenjaminRodenberg Jun 9, 2022

uekerman Jun 12, 2022

BenjaminRodenberg left a comment

precice-bot commented Jun 14, 2022

uekerman left a comment

uekerman commented Jun 15, 2022

uekerman commented Jun 24, 2022

BenjaminRodenberg Jun 28, 2022 •

edited

uekerman Jun 30, 2022 •

edited

BenjaminRodenberg commented Jun 28, 2022

uekerman commented Jun 28, 2022

BenjaminRodenberg commented Jun 28, 2022

	if (_couplingScheme->hasTimeWindowSize()) {
	timeWindowSize = _couplingScheme->getTimeWindowSize();
	} else {
	timeWindowSize = computedTimestepLength;
	}

Fix first participant method in compositional cplscheme #1307

Fix first participant method in compositional cplscheme #1307

Conversation

uekerman commented May 27, 2022 • edited

Main changes of this PR

Author's checklist

Reviewers' checklist

precice-bot commented May 27, 2022

uekerman Jun 7, 2022

Choose a reason for hiding this comment

BenjaminRodenberg Jun 9, 2022

Choose a reason for hiding this comment

uekerman Jun 12, 2022

Choose a reason for hiding this comment

BenjaminRodenberg left a comment

Choose a reason for hiding this comment

precice-bot commented Jun 14, 2022

uekerman left a comment

Choose a reason for hiding this comment

uekerman commented Jun 15, 2022

uekerman commented Jun 24, 2022

BenjaminRodenberg Jun 28, 2022 • edited

Choose a reason for hiding this comment

uekerman Jun 30, 2022 • edited

Choose a reason for hiding this comment

BenjaminRodenberg commented Jun 28, 2022

uekerman commented Jun 28, 2022

BenjaminRodenberg commented Jun 28, 2022

uekerman commented May 27, 2022 •

edited

BenjaminRodenberg Jun 28, 2022 •

edited

uekerman Jun 30, 2022 •

edited