HDDS-5606. Intermittent failure in TestBlockDeletion#testContainerStatisticsAfterDelete#2528
Conversation
adoroszlai
left a comment
There was a problem hiding this comment.
Thanks @ChenSammi for working on this.
- Can you please explain the problem and the solution?
- Does this also address HDDS-5605?
- Do we have any data on repeated run success rate?
The problem is a container state change from DELETEING to DELETED check failure. The change will happen without questoin. The question is the timing. So the fix is the addition of a retry of the state check action.
This doesn't address HDDS-5605. HDDS-5605 is still under investigation.
I have run the CPI three times, two times succeed, the third fails with other UT failure, such as TestRootedOzoneFileSystemWithFSO. It seems there are several tests which fails randomly, and need to be fixed too. |
adoroszlai
left a comment
There was a problem hiding this comment.
Thanks @ChenSammi for the fix. It seems to work fine, passed 40x:
|
This only changes and the only failure is in Therefore I'm merging this without retriggering CI. |
|
Thanks @adoroszlai for the code review. |
https://issues.apache.org/jira/browse/HDDS-5606