Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Return more information in RefreshStats #49363

Closed
romseygeek opened this issue Nov 20, 2019 · 6 comments
Closed

Return more information in RefreshStats #49363

romseygeek opened this issue Nov 20, 2019 · 6 comments
Labels
:Distributed/Engine Anything around managing Lucene and the Translog in an open shard. >enhancement :Search/Search Search-related issues that do not fall into other categories Team:Distributed Meta label for distributed team Team:Search Meta label for search team

Comments

@romseygeek
Copy link
Contributor

A common cause of performance issues in elasticsearch is long-lasting or too-frequent refreshes (eg indexing multiple individual docs with refresh=true rather than using bulk loads, or enabling eager_global_orginals on high cardinality fields). Refresh stats currently give us the total number of refreshes and the total time they took, but this isn't particularly useful when trying to track down these refresh issues.

One possible enhancement would be to keep track of the number and length of refreshes in the previous 60 mins, and return that; we could also track the tail percentile refresh lengths and warn if a refresh takes longer than a certain amount of time.

@romseygeek romseygeek added >enhancement :Search/Search Search-related issues that do not fall into other categories labels Nov 20, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-search (:Search/Search)

@jimczi
Copy link
Contributor

jimczi commented Nov 22, 2019

If eager_global_ordinals is the culprit of these slow refreshes this should show up in the warmer stats. For high cardinality field the cost of this option should not change upon refreshes so the load time should be reflected in the stats. Maybe we could add dedicated entries for each warmer or add the loading time in the fielddata stats to ease debugging ?

@rjernst rjernst added the Team:Search Meta label for search team label May 4, 2020
@javanna javanna added the :Distributed/Engine Anything around managing Lucene and the Translog in an open shard. label May 3, 2023
@elasticsearchmachine elasticsearchmachine added the Team:Distributed Meta label for distributed team label May 3, 2023
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-search (Team:Search)

@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-distributed (Team:Distributed)

@javanna
Copy link
Member

javanna commented May 3, 2023

Given the recent discussions around refresh , I tagged the Distributed team to double check what they think about this issue that we opened a while ago but did not follow up on. Is it still something we'd like to address?

@javanna
Copy link
Member

javanna commented Jun 17, 2024

We have no plans to work on this issue, closing.

@javanna javanna closed this as not planned Won't fix, can't repro, duplicate, stale Jun 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed/Engine Anything around managing Lucene and the Translog in an open shard. >enhancement :Search/Search Search-related issues that do not fall into other categories Team:Distributed Meta label for distributed team Team:Search Meta label for search team
Projects
None yet
Development

No branches or pull requests

6 participants