Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Increase oracle client timeout from 5 to 60 minutes. #991

Merged
merged 1 commit into from Mar 4, 2024

Conversation

onponomarev
Copy link
Contributor

The timeout is so high to prevent a rare race condition from happening. We need clients to wait till chief oracle server starts. This normally takes a few minutes, but sometimes might take longer.
See #990 for more details. Initially we didn't have any timeout. It was introduced to avoid tuner jobs hanging forever if chief oracle stops responding.
See #957.

The timeout is so high to prevent a rare race condition from happening.
We need clients to wait till chief oracle server starts. This normally takes
a few minutes, but sometimes might take longer.
See keras-team#990 for more details.
Initially we didn't have any timeout. It was introduced to avoid tuner jobs
hanging forever if chief oracle stops responding.
See keras-team#957.
Copy link
Collaborator

@haifeng-jin haifeng-jin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR!

Copy link

codecov bot commented Mar 4, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.50%. Comparing base (7430e6c) to head (c21e192).

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #991      +/-   ##
==========================================
- Coverage   98.37%   97.50%   -0.88%     
==========================================
  Files          50       50              
  Lines        3085     3085              
==========================================
- Hits         3035     3008      -27     
- Misses         50       77      +27     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@haifeng-jin haifeng-jin changed the base branch from master to temp_fix March 4, 2024 18:21
@haifeng-jin haifeng-jin merged commit d3c23b9 into keras-team:temp_fix Mar 4, 2024
5 of 10 checks passed
@haifeng-jin
Copy link
Collaborator

A few more changes needed. I will take over.
Merging it into temp_fix branch to do the fixes.
I will then merge temp_fix into master and do the release.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants