Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Performance] Batch shard requests lying in the queue #4763

Open
itiyama opened this issue Oct 13, 2022 · 0 comments
Open

[Performance] Batch shard requests lying in the queue #4763

itiyama opened this issue Oct 13, 2022 · 0 comments
Labels
discuss Issues intended to help drive brainstorming and decision making distributed framework enhancement Enhancement or improvement to existing feature or request Indexing Indexing, Bulk Indexing and anything related to indexing

Comments

@itiyama
Copy link

itiyama commented Oct 13, 2022

An optimal bulk size is a function of memory, shard count, thread count, number of co-ordinating nodes etc. A lot of these factors change over time for a particular customer and customers may not revisit the bulk size numbers after a certain period of time. e.g. a cluster with 10 nodes and 10 shards may have an optimal 2000 bulk size with 200 as the shard level bulk size. But once the customer scales to 100 nodes and 100 shards- a 2000 bulk size would no longer be optimal due to huge co-ordination overhead and also the shard level bulk size is reduced to just 20- hence more fsync calls. One could argue that the customers themselves could set the bulk size well, but then a higher bulk size would mean that larger requests wait in the coordinator queue and hence increase the memory overhead.

@itiyama itiyama added enhancement Enhancement or improvement to existing feature or request untriaged labels Oct 13, 2022
@anasalkouz anasalkouz added discuss Issues intended to help drive brainstorming and decision making and removed untriaged labels Oct 18, 2022
@adnapibar adnapibar added the Indexing Indexing, Bulk Indexing and anything related to indexing label Nov 17, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discuss Issues intended to help drive brainstorming and decision making distributed framework enhancement Enhancement or improvement to existing feature or request Indexing Indexing, Bulk Indexing and anything related to indexing
Projects
None yet
Development

No branches or pull requests

4 participants