-
Notifications
You must be signed in to change notification settings - Fork 526
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: fix epoch check panic when checkpoint #2005
Conversation
The reason why starting cluster using |
Codecov Report
@@ Coverage Diff @@
## main #2005 +/- ##
==========================================
- Coverage 70.79% 70.79% -0.01%
==========================================
Files 627 627
Lines 80775 80788 +13
==========================================
+ Hits 57188 57190 +2
- Misses 23587 23598 +11
Flags with carried forward coverage won't be shown. Click here to find out more.
📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we also remove the check and assert here?
Yes, NTFS. |
Emmm, we may have to keep this check. This check is applied to each node, we may have some nodes that do not contain any actors in the future but others do. |
What's changed and what's your intention?
After investigation the bug mentioned in #1995 and reported in slack group, this bug is caused by injecting a barrier immediately after a quick checkpoint when there's no actors exist in the cluster. This means two equal epoch might generated and fail the epoch check in this situation.
After this PR merged, we should introduce or develop a monotonic clock lib for epoch generation.
Checklist
Refer to a related PR or issue link (optional)
Resolve #1995