-
Notifications
You must be signed in to change notification settings - Fork 5.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Autoscaler] Check if SSH is available every 5 sec, not 10 #14484
Conversation
@@ -37,6 +37,7 @@ | |||
os.path.dirname(os.path.abspath(__file__)), "kubernetes/kubectl-rsync.sh") | |||
MAX_HOME_RETRIES = 3 | |||
HOME_RETRY_DELAY_S = 5 | |||
SSH_RETRY_DELAY_S = 5 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we define this in https://github.com/ray-project/ray/blob/master/python/ray/autoscaler/_private/constants.py
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
@ijrsvt @AmeerHajAli can you help merge this? |
@yiranwang52 Can you fix lint first?
Running |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
Just to understand:
- this makes the ssh interval a parameter (default to 5)
- the
NODE_START_WAIT_S
is renamed toAUTOSCALER_NODE_START_WAIT_S
.
Correct. |
@yiranwang52 , LINT: |
Why are these changes needed?
Shorten the wait interval between SSH check, so the machine become ready sooner.
Related issue number
Checks
scripts/format.sh
to lint the changes in this PR.