Make data predicate evaluation column-major #2730

dominiklohmann · 2022-11-24T11:04:29Z

This is a major performance optimization for expression evaluation that essentially fills in one of the last TODOs from #2440: Making data predicate evaluation itself column-major, i.e., moving as much as possible of the evaluation out of the hot loop.

Here's how the new expression algorithm works, roughly, with (3.ii) and (4) being the new and improved steps:

Normalize the selection bitmap from the dense index result to the length of the batch + offset.
Determine whether the expression is empty, a connective of some sort, or a predicate. For connectives, resolve them recursively and combine the resulting bitmaps accordingly.
Evaluate predicates:
1. If it's a meta extractor, operate on the batch metadata. In case of a match, the selection bitmap is the result directly.
2. If it's a data predicate, access the desired array, and lift the resolved types for both sides of the predicate into a compile time context for the column evaluator.
The column evaluator has specialization based on the three-tuple of lhs type, relational operator, and rhs view. The generic fall back case iterates over all fields per the selection bitmap to do the evaluation using the cell evaluator, which can be specialized per relational operator.

We've seen some pretty great results. When using a query on a 6M event database that's doing multiple substring searches (for which we have neither sparse nor dense index support), the full scan of the database is now over 2x quicker than before.

Concretely, we measured the following numbers using Hyperfine with three warmup runs and ten measured runs for this commit compared to its merge-base with master de86af3fca.

New:

Time (mean ± σ):     11.038 s ±  0.360 s    [User: 28.600 s, System: 4.430 s]
Range (min … max):   10.808 s … 11.569 s    10 runs

Old:

Time (mean ± σ):     22.805 s ±  0.616 s    [User: 45.060 s, System: 4.826 s]
Range (min … max):   22.042 s … 23.568 s    10 runs

This was a Hackathon group effort with @patszt, @dispanser, and @dominiklohmann.

I noticed this when working on #2730; I doubt this makes a big difference overall, but we still shouldn't do unnecessary work to infer a schema when we already have it.

mavam

Looks good at a high level. I only left some minor questions/comments. If CI passes, this would work for me.

libvast/src/evaluate.cpp

I noticed this when working on #2730; I doubt this makes a big difference overall, but we still shouldn't do unnecessary work to infer a schema when we already have it.

libvast/src/evaluate.cpp

I noticed this when working on #2730; I doubt this makes a big difference overall, but we still shouldn't do unnecessary work to infer a schema when we already have it.

@patszt

This is a major performance optimization for expression evaluation that essentially fills in one of the last TODOs from #2440: Making data predicate evaluation itself column-major, i.e., moving as much as possible of the evaluation out of the hot loop. Here's how the new expression algorithm works, roughly, with (3b) and (4) being the new and improved steps: 1. Normalize the selection bitmap from the dense index result to the length of the batch + offset. 2. Determine whether the expression is empty, a connective of some sort, or a predicate. For connectives, resolve them recursively and combine the resulting bitmaps accordingly. 3. Evaluate predicates: a) If it's a meta extractor, operate on the batch metadata. In case of a match, the selection bitmap is the result directly. b) If it's a data predicate, access the desired array, and lift the resolved types for both sides of the predicate into a compile time context for the column evaluator. 4. The column evaluator has specialization based on the three-tuple of lhs type, relational operator, and rhs view. The generic fall back case iterates over all fields per the selection bitmap to do the evaluation using the cell evaluator, which can be specialized per relational operator. We've seen some pretty great results. When using a query on a 6M event database that's doing multiple substring searches (for which we have neither sparse nor dense index support), the full scan of the database is now over 2x quicker than before. Concretely, we measured the following numbers using Hyperfine with three warmup runs and ten measured runs for this commit compared to its merge-base with master `de86af3fca`. New: Time (mean ± σ): 11.038 s ± 0.360 s [User: 28.600 s, System: 4.430 s] Range (min … max): 10.808 s … 11.569 s 10 runs Old: Time (mean ± σ): 22.805 s ± 0.616 s [User: 45.060 s, System: 4.826 s] Range (min … max): 22.042 s … 23.568 s 10 runs This was a Hackathon group effort with @patszt, @dispanser, and @dominiklohmann. Co-authored-by: Patryk Sztyglic <patryk.sztyglic@gmail.com> Co-authored-by: Thomas Peiselt <pi@kulturguerilla.org> Co-authored-by: Dominik Lohmann <mail@dominiklohmann.de>

Co-authored-by: Matthias Vallentin <matthias@vallentin.net>

dominiklohmann added the performance Improvements or regressions of performance label Nov 24, 2022

dominiklohmann mentioned this pull request Nov 24, 2022

Avoid unnecessary type inference from arrow::Schema #2731

Merged

dominiklohmann requested a review from a team November 24, 2022 11:53

dominiklohmann added the blocked Blocked by an (external) issue label Nov 24, 2022

mavam approved these changes Nov 24, 2022

View reviewed changes

mavam reviewed Nov 24, 2022

View reviewed changes

libvast/src/evaluate.cpp Show resolved Hide resolved

mavam reviewed Nov 24, 2022

View reviewed changes

libvast/src/evaluate.cpp Show resolved Hide resolved

dominiklohmann force-pushed the story/sc-35030/columnar-data-predicate-evaluation branch from c584a14 to dd4d0b5 Compare November 25, 2022 17:17

dominiklohmann added rc and removed blocked Blocked by an (external) issue labels Dec 6, 2022

dominiklohmann enabled auto-merge December 6, 2022 15:39

dispanser and others added 7 commits December 6, 2022 18:29

Use cell evaluator for meta extractor evaluation

b5b5560

Implement match operator in terms of equal operator

7d542bd

Improve comment wording

debeb6e

Co-authored-by: Matthias Vallentin <matthias@vallentin.net>

Explain why negations are an XOR operation

7b725ed

Explain why XOR does The Right Thing for negations

c52fbc4

Document performance improvements

98add3e

dominiklohmann force-pushed the story/sc-35030/columnar-data-predicate-evaluation branch from f4c5bd4 to 98add3e Compare December 6, 2022 17:30

dominiklohmann merged commit 8972422 into master Dec 6, 2022

dominiklohmann deleted the story/sc-35030/columnar-data-predicate-evaluation branch December 6, 2022 19:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make data predicate evaluation column-major #2730

Make data predicate evaluation column-major #2730

dominiklohmann commented Nov 24, 2022 •

edited

mavam left a comment

Make data predicate evaluation column-major #2730

Make data predicate evaluation column-major #2730

Conversation

dominiklohmann commented Nov 24, 2022 • edited

mavam left a comment

Choose a reason for hiding this comment

dominiklohmann commented Nov 24, 2022 •

edited