Slice query for Point in time readers (PIT) #65740

jimczi · 2020-12-02T13:19:01Z

Slice queries that are used under a PIT (point-in-time) reader should use the internal Lucene document id to filter documents. If all slices use the same PIT, relying on Lucene document ids for the filtering should be much more effective than the current TermsSliceQuery that uses the _id field.
We could also deprecate the usage of slices in scrolls if they are more effective inside a PIT but that can be done in a follow up.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2020-12-02T13:19:04Z

Pinging @elastic/es-search (Team:Search)

This PR adds support for using the `slice` option in point-in-time searches. By default, the slice query splits documents based on their Lucene ID. This strategy is more efficient than the one used for scrolls, which is based on the `_id` field and must iterate through the whole terms dictionary. When slicing a search, the same point-in-time ID must be used across slices to guarantee the partitions don't overlap or miss documents. Closes #65740.

jimczi added >enhancement :Search/Search Search-related issues that do not fall into other categories labels Dec 2, 2020

elasticmachine added the Team:Search Meta label for search team label Dec 2, 2020

jakelandis mentioned this issue Dec 2, 2020

Very large scroll search (i.e. reindex) can drastically slow down when slices > shards #65788

Closed

matriv self-assigned this Dec 7, 2020

jimczi unassigned matriv Mar 16, 2021

jtibshirani self-assigned this Jun 10, 2021

jtibshirani mentioned this issue Jun 23, 2021

Support search slicing with point-in-time #74457

Merged

jtibshirani closed this as completed in #74457 Jul 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slice query for Point in time readers (PIT) #65740

Slice query for Point in time readers (PIT) #65740

jimczi commented Dec 2, 2020 •

edited

Loading

elasticmachine commented Dec 2, 2020

Slice query for Point in time readers (PIT) #65740

Slice query for Point in time readers (PIT) #65740

Comments

jimczi commented Dec 2, 2020 • edited Loading

elasticmachine commented Dec 2, 2020

jimczi commented Dec 2, 2020 •

edited

Loading