-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Core] Better handling of redis errors #33744
Comments
For 2.4, we should audit all redis calls and retry REDIS_REPLY_ERROR |
Close since duplicate with this #32541 |
Are they the same? |
@iycheng these issues not the same (but somewhat related). This one is about how the GCS handles errors and status codes in all requests to redis. The other is about surfacing a better error to the user when we fail to connect on startup. |
The printed out error message is |
@iycheng thanks for the leader election fix! I think we should still leave this issue open and prioritize improving how we handle reply codes. Not having any form of retries or gracefully handling error codes makes the system unstable. What do you think? |
What happened + What you expected to happen
When we use ray with external redis, sometimes redis returns REDIS_REPLY_ERROR and we are not handle it properly.
Versions / Dependencies
master
Reproduction script
#33733
Issue Severity
None
The text was updated successfully, but these errors were encountered: