Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Index Management/ML] Fix broken retries on inference id creation #186961

Merged
merged 7 commits into from
Jul 2, 2024

Conversation

sphilipse
Copy link
Member

@sphilipse sphilipse commented Jun 26, 2024

Summary

This fixes the inference endpoint creation API being called multiple times on error. The call will often time out because downloading/deploying the model takes longer than the Kibana request timeout limit. Setting the timeout limit higher would still be fragile, so ignoring the timeout error makes more sense.

This PR also contains a few small language fixes and variable renames for clarity.

@sphilipse sphilipse added release_note:skip Skip the PR/issue when compiling release notes Team:Search v8.15.0 labels Jun 26, 2024
@sphilipse sphilipse requested review from a team as code owners June 26, 2024 09:57
@yuliacech yuliacech self-requested a review June 27, 2024 08:52
Copy link
Contributor

@yuliacech yuliacech left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hey @sphilipse, I've tested locally and the timeout error seems to break the logic to save mappings with semantic text: on a new deployment, the request fails and it's not possible to refresh.
Screenshot 2024-06-27 at 11 13 36

@yuliacech
Copy link
Contributor

Adding a screen recording as well

Screen.Recording.2024-06-27.at.11.22.45.mov

Copy link
Contributor

@darnautov darnautov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ML changes LGTM

@sphilipse
Copy link
Member Author

Thanks @yuliacech, looks like the timeout error check function doesn't work for this specific API. Replaced that and this should work now :)

@yuliacech yuliacech self-requested a review July 1, 2024 09:13
@kibana-ci
Copy link
Collaborator

💚 Build Succeeded

Metrics [docs]

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id before after diff
indexManagement 665.2KB 665.2KB +21.0B

History

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

Copy link
Contributor

@yuliacech yuliacech left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing the timeout issue, @sphilipse! Re-tested locally, latest changes LGTM 👍

@sphilipse sphilipse merged commit 69c0858 into elastic:main Jul 2, 2024
18 checks passed
@kibanamachine kibanamachine added the backport:skip This commit does not require backporting label Jul 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport:skip This commit does not require backporting release_note:skip Skip the PR/issue when compiling release notes Team:Search v8.15.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants