Skip to content

HDDS-4760. Intermittent failure in ozone-ha acceptance test#1886

Merged
adoroszlai merged 2 commits intoapache:masterfrom
adoroszlai:HDDS-4760
Feb 6, 2021
Merged

HDDS-4760. Intermittent failure in ozone-ha acceptance test#1886
adoroszlai merged 2 commits intoapache:masterfrom
adoroszlai:HDDS-4760

Conversation

@adoroszlai
Copy link
Contributor

What changes were proposed in this pull request?

Wait for OM leader election (similar to SCM safemode exit) in acceptance tests, as even 15 retries is not enough in some cases (failover to other OM has no delay).

https://issues.apache.org/jira/browse/HDDS-4760

How was this patch tested?

https://github.com/adoroszlai/hadoop-ozone/actions/runs/531562574

2021-02-02T18:34:29.2792765Z Safe mode is off
2021-02-02T18:34:40.9042608Z Found OM leader for service omservice: om2 : LEADER (om2)
...
2021-02-02T19:14:59.0717583Z Safe mode is off
2021-02-02T19:15:03.7196532Z Found OM leader for service id1: om1 : LEADER (om1)

@adoroszlai adoroszlai self-assigned this Feb 2, 2021
Copy link
Contributor

@sodonnel sodonnel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Hopefully this will work better than a set number of retries.

@adoroszlai adoroszlai merged commit e712a1f into apache:master Feb 6, 2021
@adoroszlai adoroszlai deleted the HDDS-4760 branch February 6, 2021 07:47
@adoroszlai
Copy link
Contributor Author

Thanks @sodonnel for the review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants