faster metric scans #7920

richardstartin · 2021-12-17T12:52:17Z

This change is motivated by a profile where a user performed a summation over a raw column in a group by query, and a significant amount of time was spent in bounds checks:

This change adds methods fillValues to the ForwardIndexReader interface which can avoid bounds checks and perform vectorized copies when the range of docIds is contiguous. There is no way to avoid bounds checks with the current APIs otherwise as there is no way for the compiler to infer that the docIds array is monotonic.

With

    "noDictionaryColumns": [
      "clicks"
    ]

This speeds up select platform, sum(clicks) from complexWebsite group by platform on a 7.5M row segment by ~25%: 60ms down to 45ms.

Summation is not the best case for the change because it requires conversion from long to double - accumulating the sum as a long would amplify the effect.

codecov-commenter · 2021-12-17T13:21:50Z

Codecov Report

Merging #7920 (3a96ce8) into master (428e3f2) will decrease coverage by 0.11%.
The diff coverage is 49.42%.

@@             Coverage Diff              @@
##             master    #7920      +/-   ##
============================================
- Coverage     71.37%   71.25%   -0.12%     
- Complexity     4193     4210      +17     
============================================
  Files          1595     1594       -1     
  Lines         82514    82613      +99     
  Branches      12304    12316      +12     
============================================
- Hits          58895    58870      -25     
- Misses        19643    19768     +125     
+ Partials       3976     3975       -1

Flag	Coverage Δ
integration1	`28.92% <2.29%> (-0.21%)`	⬇️
integration2	`27.48% <2.29%> (-0.09%)`	⬇️
unittests1	`68.20% <49.42%> (-0.03%)`	⬇️
unittests2	`14.29% <0.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...readers/forward/BaseChunkSVForwardIndexReader.java	`46.15% <18.36%> (-46.95%)`	⬇️
...t/segment/spi/index/reader/ForwardIndexReader.java	`73.86% <88.88%> (+67.61%)`	⬆️
...java/org/apache/pinot/core/common/DataFetcher.java	`85.18% <100.00%> (-0.53%)`	⬇️
...data/manager/realtime/DefaultSegmentCommitter.java	`0.00% <0.00%> (-80.00%)`	⬇️
...a/manager/realtime/RealtimeSegmentDataManager.java	`50.00% <0.00%> (-25.00%)`	⬇️
...er/api/resources/LLCSegmentCompletionHandlers.java	`43.56% <0.00%> (-18.82%)`	⬇️
.../common/request/context/predicate/EqPredicate.java	`66.66% <0.00%> (-13.34%)`	⬇️
...data/manager/realtime/SegmentCommitterFactory.java	`88.23% <0.00%> (-11.77%)`	⬇️
...ache/pinot/core/operator/docidsets/OrDocIdSet.java	`86.36% <0.00%> (-11.37%)`	⬇️
...altime/ServerSegmentCompletionProtocolHandler.java	`51.42% <0.00%> (-6.67%)`	⬇️
... and 21 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 428e3f2...3a96ce8. Read the comment docs.

...he/pinot/segment/local/segment/index/readers/forward/FixedByteChunkSVForwardIndexReader.java

richardstartin · 2021-12-19T21:44:23Z

I added a benchmark to show where the difference comes from, but the benchmark also hints at the minimum cost of query vectorization: an array copy. This cost could be eliminated for reductions.

Benchmark                                                (_blockSize)  (_numBlocks)  Mode  Cnt   Score   Error  Units
BenchmarkFixedByteSVForwardIndexReader.readDoubles              10000          1000  avgt    5  37.284 ± 0.919  ms/op <- bad, long to double conversion
BenchmarkFixedByteSVForwardIndexReader.readDoublesBatch         10000          1000  avgt    5  15.674 ± 0.245  ms/op <- no bounds checks
BenchmarkFixedByteSVForwardIndexReader.readLongs                10000          1000  avgt    5  35.244 ± 1.947  ms/op <- bad, but no type conversion
BenchmarkFixedByteSVForwardIndexReader.readLongsBatch           10000          1000  avgt    5  10.777 ± 0.163  ms/op <- best case, vectorized copy, no type conversion

klsince · 2021-12-20T18:03:09Z

...he/pinot/segment/local/segment/index/readers/forward/FixedByteChunkSVForwardIndexReader.java

@@ -91,4 +264,8 @@ public double getDouble(int docId, ChunkReaderContext context) {
      return _rawData.getDouble(docId * Double.BYTES);
    }
  }
+
+  private boolean isContiguousRange(int[] docIds, int length) {


nit: it'd be helpful to comment a bit when the range can be contiguous and when not.

Why? That doesn't seem like a responsibility of this class, does it?

Think I'm lack of some context to understand this new check. Not necessarily a comment to the code. Would appreciate it if you could shed some light in the conversion here. Just looking at this check method, I assume it'd require the docIds to be unique and sorted in asc. So when that happens and when not. This may help me understand when the optimization can kick in.

OK, got it. However, I think that kind of info would be better off in a readme about the how the query and storage layers interact. I can do that (because understanding this area of the code has been hard for me) but not in this PR. I don't think the comment belongs here, and would be superfluous after reading a decent readme.

richardstartin · 2021-12-20T20:32:08Z

@siddharthteotia to review before merging.

siddharthteotia · 2021-12-20T21:56:30Z

@siddharthteotia to review before merging.

Yes, I will make sure to go through this by EOD. Thank you for waiting

pinot-core/src/main/java/org/apache/pinot/core/common/DataFetcher.java

richardstartin · 2021-12-22T19:06:28Z

This feels like a fairly low risk change for a decent improvement to me. Are there any blockers here?

...he/pinot/segment/local/segment/index/readers/forward/FixedByteChunkSVForwardIndexReader.java

…n fixed width metric columns

richardstartin force-pushed the faster-metric-scans branch 3 times, most recently from a999be7 to e71d582 Compare December 17, 2021 16:52

richardstartin marked this pull request as ready for review December 17, 2021 17:57

richardstartin changed the title ~~[WIP] faster metric scans~~ faster metric scans Dec 17, 2021

kishoreg reviewed Dec 19, 2021

View reviewed changes

...he/pinot/segment/local/segment/index/readers/forward/FixedByteChunkSVForwardIndexReader.java Outdated Show resolved Hide resolved

...he/pinot/segment/local/segment/index/readers/forward/FixedByteChunkSVForwardIndexReader.java Outdated Show resolved Hide resolved

richardstartin force-pushed the faster-metric-scans branch from d20f123 to 9368ab2 Compare December 19, 2021 15:09

klsince reviewed Dec 20, 2021

View reviewed changes

Jackie-Jiang approved these changes Dec 21, 2021

View reviewed changes

pinot-core/src/main/java/org/apache/pinot/core/common/DataFetcher.java Outdated Show resolved Hide resolved

richardstartin mentioned this pull request Dec 21, 2021

Power of 2 fixed size chunks #7934

Merged

richardstartin force-pushed the faster-metric-scans branch from a988aa7 to b3a7772 Compare December 22, 2021 18:05

siddharthteotia approved these changes Dec 23, 2021

View reviewed changes

Jackie-Jiang reviewed Dec 24, 2021

View reviewed changes

...he/pinot/segment/local/segment/index/readers/forward/FixedByteChunkSVForwardIndexReader.java Outdated Show resolved Hide resolved

richardstartin added 5 commits December 24, 2021 01:23

add fillValues to ForwardIndexReader to accelerate contiguous scans o…

b7b2f4f

…n fixed width metric columns

review comments

f029c3d

add benchmark

48b97a5

move string handling to default implementations

8b0d5f3

move methods to support more readers

3a96ce8

richardstartin force-pushed the faster-metric-scans branch from b3a7772 to 3a96ce8 Compare December 24, 2021 01:35

mayankshriv merged commit 0df8492 into apache:master Jan 5, 2022

richardstartin mentioned this pull request Feb 5, 2022

Aggregation delay conversion to double #8139

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

faster metric scans #7920

faster metric scans #7920

richardstartin commented Dec 17, 2021 •

edited

Loading

codecov-commenter commented Dec 17, 2021 •

edited

Loading

richardstartin commented Dec 19, 2021 •

edited

Loading

klsince Dec 20, 2021

richardstartin Dec 20, 2021

klsince Dec 20, 2021

richardstartin Dec 20, 2021

richardstartin commented Dec 20, 2021

siddharthteotia commented Dec 20, 2021

richardstartin commented Dec 22, 2021

faster metric scans #7920

faster metric scans #7920

Conversation

richardstartin commented Dec 17, 2021 • edited Loading

codecov-commenter commented Dec 17, 2021 • edited Loading

Codecov Report

richardstartin commented Dec 19, 2021 • edited Loading

klsince Dec 20, 2021

Choose a reason for hiding this comment

richardstartin Dec 20, 2021

Choose a reason for hiding this comment

klsince Dec 20, 2021

Choose a reason for hiding this comment

richardstartin Dec 20, 2021

Choose a reason for hiding this comment

richardstartin commented Dec 20, 2021

siddharthteotia commented Dec 20, 2021

richardstartin commented Dec 22, 2021

richardstartin commented Dec 17, 2021 •

edited

Loading

codecov-commenter commented Dec 17, 2021 •

edited

Loading

richardstartin commented Dec 19, 2021 •

edited

Loading