-
Notifications
You must be signed in to change notification settings - Fork 3.4k
HBASE-25334 TestRSGroupsFallback.testFallback is flaky #2728
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@Apache9 I'm sorry, could you test this patch? |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
@sunhelly Yes, that is exactly the cause, in HBASE-25282, i use the ServerCrashProcedure to judge whether the Region Server is in processing, Sorry to introduce such a problem. |
@yuqi1129 Please help to check if this patch can fix the problem, thanks. |
|
@sunhelly Sorry for late reply, your PR seems OK to me,LGTM |
|
Tried locally, TestRSGroupsFallback still failed. Could you two please confirm it again? And mind explaining a bit why HBASE-25282 could make this test fail? What's the root cause? Thanks. |
@Apache9 @yuqi1129 I looked through the patch of HBASE-25282 and found a little clue. I have commented at the PR, PR of HBASE-25282 . |
|
Oh, the method |
|
I'm so sorry, didn't merge the master branch, so the UT is always passing. @Apache9 |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
Apache9
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So this is the root cause? I suggest that we still use getProcedures but first filter out the finished procedures before checking whehter it is SCP.
| */ | ||
| public boolean areDeadServersInProgress() throws IOException { | ||
| return master.getProcedures().stream().anyMatch(p -> p instanceof ServerCrashProcedure); | ||
| return master.getMasterProcedureExecutor().getActiveProceduresNoCopy().stream() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The getActiveProceduresNoCopy is only expected to be called when initialization, where we can make sure there is no race.
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
Signed-off-by: Duo Zhang <zhangduo@apache.org> Change-Id: I2f78f6108cd4c3049b3df797d9f6aba4d81b5da8
…allback.testFallback is flaky" into branch-2 and branch-2.6 (#7472) #7476 (#7476) The change in HBASE-25334 (introduced by commit 32c4432 in branch-2 and branch-2.6) has diverged between the master (#2728) and branch-2 branches. This is a following up changes after HBASE-29720 and have the complete functional changes of HBASE-25334 that uses SCP as the source of dead servers in progress. Signed-off-by: Istvan Toth <stoty@apache.org> Co-authored-by: Kevin Geiszler <kevin.j.geiszler@gmail.com>
No description provided.