Skip to content

Conversation

@scaliby
Copy link
Member

@scaliby scaliby commented Nov 12, 2025

Description

Existing error code check seems to be too narrow, as communication to the cluster can fail with broad amount of errors that can be fixed by removing the --dns-endpoint flag. I modified the logic so it always retries without --dns-endpoint when listing pods fails.

Issue

b/460038063

Testing

Tested manually and added unit tests.

@scaliby scaliby marked this pull request as draft November 12, 2025 16:30
@scaliby scaliby marked this pull request as ready for review November 12, 2025 16:58
@scaliby scaliby force-pushed the credentials-retrieval-fix branch from f9d815d to f1b0d48 Compare November 12, 2025 17:59
Copy link
Collaborator

@jamOne- jamOne- left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

left some nits. Thanks for the fix

@scaliby scaliby requested a review from jamOne- November 13, 2025 08:23
@scaliby scaliby force-pushed the credentials-retrieval-fix branch from 82cfce5 to dfe6c2a Compare November 13, 2025 08:42
@scaliby scaliby enabled auto-merge (squash) November 13, 2025 08:42
@scaliby scaliby merged commit 844261f into AI-Hypercomputer:main Nov 13, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants