Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ILM: revise and enhance error handling in the ILM loop #52485

Closed
andreidan opened this issue Feb 18, 2020 · 2 comments
Closed

ILM: revise and enhance error handling in the ILM loop #52485

andreidan opened this issue Feb 18, 2020 · 2 comments
Labels
:Data Management/ILM+SLM Index and Snapshot lifecycle management >refactoring Team:Data Management Meta label for data/management team

Comments

@andreidan
Copy link
Contributor

andreidan commented Feb 18, 2020

We iteratively trigger the ILM policies by parsing the ILM cluster state for every index that has a policy attached (as indicated by the index.lifecycle.name setting).
We addressed a scenario where the trigger of polices was suspended when a policy for an index encountered an error while trying to trigger in #49128
but there might be an opportunity to widen the scope of the try/catch block we added to include the initial cluster state parsing.

As a side note, it'd be interesting to try and devise a testing framework that'd wreak havoc during the policy execution (skip some policy steps, wipe out some/all of the cluster state we hold for the ilm execution state for example) and make sure ILM continues running for the indexes that have a healthy state.

@andreidan andreidan added >refactoring :Data Management/ILM+SLM Index and Snapshot lifecycle management labels Feb 18, 2020
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-core-features (:Core/Features/ILM+SLM)

@rjernst rjernst added the Team:Data Management Meta label for data/management team label May 4, 2020
@dakrone
Copy link
Member

dakrone commented May 17, 2024

This has been open for quite a while, and we haven't made much progress on this due to focus in other areas. For now I'm going to close this as something we aren't planning on implementing. We can re-open it later if needed.

We have, however, made some progress in separate issues around this, so I don't think this is necessary any more.

@dakrone dakrone closed this as completed May 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Data Management/ILM+SLM Index and Snapshot lifecycle management >refactoring Team:Data Management Meta label for data/management team
Projects
None yet
Development

No branches or pull requests

4 participants