test: ensure no leader before restarting outdated follower #18203

deepthidevaki · 2024-05-03T07:05:49Z

Description

The test was flaky because we were looking for a leader among m1 and m2. But it should be m1 and m2Restarted. In the flaky case, m2Restarted was the leader. This was not the intention of the test, because node 2 must be outdated at this point and cannot become the leader. But in this case, the shutdowns and restarts were so fast that leader m1 didn't get a chance to step down. So when m2 was restarted, it immediately got the latest log entry and was able to become the leader immediately after the force configure is completed. To prevent this case, we now wait until there is no leader, before restarting node 2.

Related issues

closes #17670

lenaschoenburg

Nice fix 👍

The test was flaky because we were looking for a leader among m1 and m2. But it should be m1 and m2Restarted. In the flaky case, m2Restarted was the leader. This was not the intention of the test, because node 2 must be outdated at this point and cannot become the leader. But in this case, the shutdowns and restarts were so fast that leader m1 didn't get a chance to step down. So when m2 was restarted, it immediatly got the latest log entry and was able to become the leader immediately after the force configure is completed. To prevent this case, we now wait until there is no leader, before restarting node 2.

deepthidevaki · 2024-05-06T07:17:57Z

PR is blocked at step "SDK test summary" which doesn't exist in this branch. Rebasing on main.

backport-action · 2024-05-06T08:32:39Z

Successfully created backport PR for stable/8.5:

[Backport stable/8.5] test: ensure no leader before restarting outdated follower #18259

@deepthidevaki

…ed follower (#18259) # Description Backport of #18203 to `stable/8.5`. relates to #17670 original author: @deepthidevaki

deepthidevaki added the backport stable/8.5 Backport a pull request to stable/8.5 label May 3, 2024

github-actions bot added the component/zeebe Related to the Zeebe component/team label May 3, 2024

deepthidevaki requested a review from lenaschoenburg May 3, 2024 09:19

lenaschoenburg approved these changes May 6, 2024

View reviewed changes

lenaschoenburg enabled auto-merge May 6, 2024 07:12

deepthidevaki force-pushed the dd-17670-flaky-reconfigure branch from 7da955e to 86b6d8c Compare May 6, 2024 07:18

lenaschoenburg added this pull request to the merge queue May 6, 2024

Merged via the queue into main with commit d471acd May 6, 2024
39 checks passed

lenaschoenburg deleted the dd-17670-flaky-reconfigure branch May 6, 2024 08:32

backport-action mentioned this pull request May 6, 2024

[Backport stable/8.5] test: ensure no leader before restarting outdated follower #18259

Merged

github-merge-queue bot pushed a commit that referenced this pull request May 6, 2024

[Backport stable/8.5] test: ensure no leader before restarting outdat…

3e962f8

…ed follower (#18259) # Description Backport of #18203 to `stable/8.5`. relates to #17670 original author: @deepthidevaki

Zelldon added the version:8.5.1 Marks an issue as being completely or in parts released in 8.5.1 label May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: ensure no leader before restarting outdated follower #18203

test: ensure no leader before restarting outdated follower #18203

deepthidevaki commented May 3, 2024

lenaschoenburg left a comment

deepthidevaki commented May 6, 2024

backport-action commented May 6, 2024

test: ensure no leader before restarting outdated follower #18203

test: ensure no leader before restarting outdated follower #18203

Conversation

deepthidevaki commented May 3, 2024

Description

Related issues

lenaschoenburg left a comment

Choose a reason for hiding this comment

deepthidevaki commented May 6, 2024

backport-action commented May 6, 2024