Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dynamic Index Cache Sizing #86640

Open
pickypg opened this issue May 10, 2022 · 3 comments
Open

Dynamic Index Cache Sizing #86640

pickypg opened this issue May 10, 2022 · 3 comments
Labels
>enhancement :Search/Search Search-related issues that do not fall into other categories

Comments

@pickypg
Copy link
Member

pickypg commented May 10, 2022

Description

Index-level caching is one of many areas that can be helpfully tuned to improve the performance of Elasticsearch. Ironically, disabling / reducing a cache size is often a solution for improving the performance.

Elasticsearch has tracked hit/miss stats of the query cache, but it does nothing with the data other than inform an interested user about the effectiveness of their index's caching.

It would be interesting if the node would be able to tune the index query cache heuristically based on actual usage (age, frequency, and value). This could likely be combined with more automatic, global tweaks that occur based on the node's data tier.

This could ultimately be used to give GBs of heap back to an individual ES node instead of burning it on cache storage that's not effectively used (or useful).

@pickypg pickypg added >enhancement needs:triage Requires assignment of a team area label labels May 10, 2022
@arteam arteam added the :Search/Search Search-related issues that do not fall into other categories label May 12, 2022
@elasticmachine elasticmachine added the Team:Search Meta label for search team label May 12, 2022
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-search (Team:Search)

@arteam arteam removed Team:Search Meta label for search team needs:triage Requires assignment of a team area label labels May 12, 2022
@jpountz
Copy link
Contributor

jpountz commented Jun 7, 2022

I agree with the general observation that Elasticsearch is likely often burning GBs of heap on the filter cache for little benefit. We should more proactively give back heap to the JVM when possible.

In my opinion this is a general Lucene issue, let's move it to Lucene's JIRA?

@pickypg
Copy link
Member Author

pickypg commented Jun 7, 2022

https://issues.apache.org/jira/browse/LUCENE-10602

The aspect about tuning it by data tier may still be worthwhile for ES to pick up, such as removing it from the frozen tier.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>enhancement :Search/Search Search-related issues that do not fall into other categories
Projects
None yet
Development

No branches or pull requests

4 participants