HDDS-10086. Intermittent timeout in TestSafeMode#5945
Merged
adoroszlai merged 4 commits intoapache:masterfrom Jan 9, 2024
Merged
HDDS-10086. Intermittent timeout in TestSafeMode#5945adoroszlai merged 4 commits intoapache:masterfrom
adoroszlai merged 4 commits intoapache:masterfrom
Conversation
Contributor
duongkame
approved these changes
Jan 9, 2024
Contributor
duongkame
left a comment
There was a problem hiding this comment.
Thanks for the change @adoroszlai . Looks reasonable to me.
Contributor
Author
|
Thanks @duongkame for the review. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What changes were proposed in this pull request?
HDDS-8982 added a new assertion in
TestSafeModeand set timeout of 1 minute for the test case. Encountered the following problem in a recent run:Initial
selectContainerhas correctly found none:Pipeline creation failed, since no datanodes were available:
However, one pipeline was found to be
ALLOCATED, so the call waited for that to be opened:The problem is that both timeouts are 60 seconds, thus the test may be aborted just before getting the expected
IOException.This PR increases test timeout to 2 minutes. At first I tried to reduce pipeline report time to avoid unnecessary wait, and it has fixed the original issue, but hit another intermittent timeout shutting down datanodes (which is part of the original test, before the
getContainercall).https://issues.apache.org/jira/browse/HDDS-10086
How was this patch tested?
Passed in 10x20 runs:
https://github.com/adoroszlai/ozone/actions/runs/7447762180