Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ML] Check for missing delete tasks and recreate during nightly maintenance #42840

Closed
droberts195 opened this issue Jun 4, 2019 · 2 comments
Closed
Assignees
Labels
>enhancement :ml Machine learning

Comments

@droberts195
Copy link
Contributor

When an ML job is deleted a task is created to delete all the things that need deleting. However, this is not a persistent task, only a node-specific task. If the node running that task is shut down before the deletion completes then the job is left in a partially deleted state.

One option to fix this would be to make the deletion task a persistent task. However, this is a huge amount of development work.

A simpler alternative could be to have the nightly maintenance action check for jobs in the deleting state that don't have a corresponding deletion task, and start a new deletion task for them.

@droberts195 droberts195 added >enhancement :ml Machine learning labels Jun 4, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/ml-core

@benwtrent
Copy link
Member

A simpler alternative could be to have the nightly maintenance action check for jobs in the deleting state that don't have a corresponding deletion task, and start a new deletion task for them.

I am all for that.

A persistent task seems like overkill.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>enhancement :ml Machine learning
Projects
None yet
Development

No branches or pull requests

4 participants