-
Notifications
You must be signed in to change notification settings - Fork 558
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix flaky MultiPartitionDeploymentLifecycleTest #10065
Conversation
Test was flaky because it was asserted that the COMPLETE commands and COMPLETED events were written in order. This is not always as the case. The COMPLETE commands are a result of partition 2 and 3 sending the command to partition 1. There is no guarantee that partition 1 will first receive both commands before it starts processing them. As long as we assert that we receive the COMPLETE command of partition X before the COMPLETED event of partition X the test should be accurate without being flaky.
Test Results 847 files + 1 847 suites +1 1h 41m 19s ⏱️ + 2m 41s For more details on these failures, see this check. Results for commit 907b0e4. ± Comparison against base commit 403c25a. ♻️ This comment has been updated with latest results. |
@pihme Just pinging you to make sure you've seen this PR |
Thanks for the reminder. I wasn't aware of it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great improvement, but I think it can be improved further
...ava/io/camunda/zeebe/engine/processing/deployment/MultiPartitionDeploymentLifecycleTest.java
Outdated
Show resolved
Hide resolved
...ava/io/camunda/zeebe/engine/processing/deployment/MultiPartitionDeploymentLifecycleTest.java
Outdated
Show resolved
Hide resolved
...ava/io/camunda/zeebe/engine/processing/deployment/MultiPartitionDeploymentLifecycleTest.java
Outdated
Show resolved
Hide resolved
Oh, and maybe we want to backport this |
@pihme I have removed all the hardcoded indexes, please have another look! |
4d6beed
to
907b0e4
Compare
We can backport them. The test wasn't flaky in previous versions though, as it was introduced with the new inter partition command sending. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm 🎉
...ava/io/camunda/zeebe/engine/processing/deployment/MultiPartitionDeploymentLifecycleTest.java
Outdated
Show resolved
Hide resolved
907b0e4
to
369e5a1
Compare
bors merge |
Build succeeded: |
Backport failed for Please cherry-pick the changes locally. git fetch origin stable/1.3
git worktree add -d .worktree/backport-10065-to-stable/1.3 origin/stable/1.3
cd .worktree/backport-10065-to-stable/1.3
git checkout -b backport-10065-to-stable/1.3
ancref=$(git merge-base 396f1854c7da6ae6866897200af34b797379b85d 369e5a1edbcdcf08cca41b464faa7aa3602ce72b)
git cherry-pick -x $ancref..369e5a1edbcdcf08cca41b464faa7aa3602ce72b |
Successfully created backport PR #10091 for |
Description
Test was flaky because it was asserted that the COMPLETE commands and COMPLETED events were written in order.
This is not always as the case. The COMPLETE commands are a result of partition 2 and 3 sending the command to partition 1. There is no guarantee that partition 1 will first receive both commands before it starts processing them.
As long as we assert that we receive the COMPLETE command of partition X before the COMPLETED event of partition X the test should be accurate without being flaky.
Related issues
closes #9964
Definition of Done
Not all items need to be done depending on the issue and the pull request.
Code changes:
backport stable/1.3
) to the PR, in case that fails you need to create backports manually.Testing:
Documentation:
Please refer to our review guidelines.