New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Retry socket connections #266
Comments
This is probably the cause of the error in
The test still passes even though one of the workers failed to connect. Also, when we run |
A common pattern for a Ray process is to connect to some socket(s). In these cases, we often try once and exit violently if the initial connection is unsuccessful. However, these failures to connect may be transient (for instance, if the
listen
buffer on the server-side is full duringphoton_connect
). In these cases, we should retry the connection, with some backoff.The text was updated successfully, but these errors were encountered: