Skip to content

HDDS-15177. acceptance test race condition in generating keytab and service start.#10194

Merged
adoroszlai merged 1 commit intoapache:masterfrom
ss77892:HDDS-15177
May 8, 2026
Merged

HDDS-15177. acceptance test race condition in generating keytab and service start.#10194
adoroszlai merged 1 commit intoapache:masterfrom
ss77892:HDDS-15177

Conversation

@ss77892
Copy link
Copy Markdown
Contributor

@ss77892 ss77892 commented May 5, 2026

What changes were proposed in this pull request?

HDDS-15177. acceptance test race condition in generating keytab and service start.

Fix intermittent xcompat failures where the new-cluster Compose stack could start SCM and other Kerberos-backed services before init-kdc finished exporting keytabs and starting the KDC. Add the same KDC healthcheck used on the old cluster (verify scm.keytab exists and port 88 accepts connections) and depend on the KDC being healthy.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-15177

How was this patch tested?

CI + local tests.

Copy link
Copy Markdown
Contributor

@adoroszlai adoroszlai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @ss77892 for the patch.

Comment thread hadoop-ozone/dist/src/main/compose/xcompat/new-cluster.yaml Outdated
Comment thread hadoop-ozone/dist/src/main/compose/xcompat/new-cluster.yaml Outdated
@adoroszlai
Copy link
Copy Markdown
Contributor

adoroszlai commented May 6, 2026

Example for failure: https://github.com/apache/ozone/actions/runs/25426543750/job/74600076227

I think other secure clusters need the same fix (only upgrade has it currently). xcompat-old is special only because it starts multiple clusters (one per Ozone version), increasing the probability of hitting the race condition.

@ss77892
Copy link
Copy Markdown
Contributor Author

ss77892 commented May 8, 2026

@adoroszlai Thank you for your review and suggestions. I've updated the PR.

Copy link
Copy Markdown
Contributor

@adoroszlai adoroszlai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @ss77892 for updating the patch.

@adoroszlai adoroszlai merged commit d33f994 into apache:master May 8, 2026
35 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants