Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize disjunction counts. #12415

Merged
merged 15 commits into from
Aug 11, 2023
Merged

Commits on Jul 5, 2023

  1. Optimize disjunction counts.

    This introduces `LeafCollector#collect(DocIdStream)` to enable collectors to
    collect batches of doc IDs at once. `BooleanScorer` takes advantage of this by
    creating a `DocIdStream` whose `count()` method counts the number of bits that
    are set in the bit set of matches in the current window, instead of naively
    iterating over all matches.
    
    On wikimedium10m, this yields a ~20% speedup when counting hits for the `title
    OR 12` query (2.9M hits).
    
    Relates apache#12358
    jpountz committed Jul 5, 2023
    Configuration menu
    Copy the full SHA
    26cb6a6 View commit details
    Browse the repository at this point in the history
  2. iter

    jpountz committed Jul 5, 2023
    Configuration menu
    Copy the full SHA
    57da108 View commit details
    Browse the repository at this point in the history

Commits on Jul 7, 2023

  1. Configuration menu
    Copy the full SHA
    5102463 View commit details
    Browse the repository at this point in the history
  2. iter

    jpountz committed Jul 7, 2023
    Configuration menu
    Copy the full SHA
    2820fa1 View commit details
    Browse the repository at this point in the history
  3. simplify

    jpountz committed Jul 7, 2023
    Configuration menu
    Copy the full SHA
    f3fc1e6 View commit details
    Browse the repository at this point in the history
  4. Add javadocs

    jpountz committed Jul 7, 2023
    Configuration menu
    Copy the full SHA
    f9ec645 View commit details
    Browse the repository at this point in the history

Commits on Jul 8, 2023

  1. tidy

    jpountz committed Jul 8, 2023
    Configuration menu
    Copy the full SHA
    2025ba6 View commit details
    Browse the repository at this point in the history

Commits on Jul 28, 2023

  1. More docs.

    jpountz committed Jul 28, 2023
    Configuration menu
    Copy the full SHA
    74c378b View commit details
    Browse the repository at this point in the history

Commits on Aug 4, 2023

  1. Configuration menu
    Copy the full SHA
    d871c21 View commit details
    Browse the repository at this point in the history
  2. iter

    jpountz committed Aug 4, 2023
    Configuration menu
    Copy the full SHA
    5b79367 View commit details
    Browse the repository at this point in the history

Commits on Aug 5, 2023

  1. Configuration menu
    Copy the full SHA
    1497813 View commit details
    Browse the repository at this point in the history

Commits on Aug 11, 2023

  1. Configuration menu
    Copy the full SHA
    4d9a7ed View commit details
    Browse the repository at this point in the history
  2. CHANGES

    jpountz committed Aug 11, 2023
    Configuration menu
    Copy the full SHA
    5c237c9 View commit details
    Browse the repository at this point in the history
  3. s/min/base/

    jpountz committed Aug 11, 2023
    Configuration menu
    Copy the full SHA
    ba4d5e3 View commit details
    Browse the repository at this point in the history
  4. Handle corner case.

    jpountz committed Aug 11, 2023
    Configuration menu
    Copy the full SHA
    aaf7e89 View commit details
    Browse the repository at this point in the history