persist: "packed" encodings for `Interval` and `Time` #27336

ParkMyCar · 2024-05-28T22:41:18Z

This PR implements "packed" encodings for Interval and Time that will be used for writing structured data in Persist.

The packed encodings are designed to be as fast as possible and have the same sort order as the original types. They're based on discussion from #26175. I added benchmarks that measure throughput and locally I get the following results:

type	encode	decode
interval	~1.5 billion/second	~1.6 billion/second
time	~2.5 billion/second	~1.3 billion/second

I believe the implementations can be optimized further with SIMD, but right now that requires inline_asm! or the Nightly compiler.

Note: The code placement feels a bit weird since these impls should only be used by Persist, so I was thinking of putting the impl behind a trait like mz_persist_types::ColumnarCodec, but was curious what other folks thought.

Motivation

Progress towards https://github.com/MaterializeInc/database-issues/issues/7411

Checklist

This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
This PR includes the following user-facing behavior changes:

antiguru · 2024-05-29T01:49:59Z

src/repr/benches/packed.rs

+    const INTERVAL: Interval = Interval::new(1, 1, 0);
+    group.bench_function("encode", |b| {
+        b.iter(|| {
+            let packed = PackedInterval::from(INTERVAL);


I might be totally off, but don't you need to black_box INTERVAL, too? Otherwise the optimizer might just constant-fold the expresion?

That totally makes sense, the benchmarks also seemed too fast and the numbers look more reasonable now, thanks!

bkirwi

📈!

bkirwi · 2024-05-29T15:58:08Z

src/repr/src/adt/datetime.rs

+    /// Interprets a slice of bytes as a [`PackedNaiveTime`].
+    ///
+    /// Returns an error if the size of the slice is incorrect.
+    pub fn from_bytes(slice: &[u8]) -> Result<Self, String> {


This could be implemented a bit more directly via try_into: https://play.rust-lang.org/?version=stable&mode=debug&edition=2021&gist=b3c429d443727dbe68bc28817f9aea9d

I wasn't sure how I felt about this leaving data only partially validated, but I think it's fine to panic at read time. Hard to imagine we'd ever do anything else in production...

Ahhh nice! updated to use try_into

…e` and `MzAclType` (#27360) This PR does a few things: 1. Introduces a `ColumnarCodec` trait that is defines how to encode a type `T` so it can be durably persisted in an `arrow::array::FixedSizeBinaryArray`. 2. Refactors the impls of `PackedInterval` and `PackedNaiveTime` (introduced in #27336) to use `ColumnarCodec`. 3. Add `PackedAclItem` and `PackedMzAclItem` which implement `ColumnarCodec` 4. Refactor existing and add more benchmarks to measure the throughput of encoding and decoding for all of the "packed" types vs the existing `ProtoDatum` types. #### Benchmarks The existing benchmarks have been reworked a bit to include encoding into an existing buffer and reading from a slice. Also included are benchmarks for the same workflow but using the existing protobuf types, since the motivation for this work is to improve throughput compared to protobuf. type | encode | decode -------------------|------------------------|-------- interval/packed | ~1,172 mil/second | ~1,000 mil/second interval/proto | ~239.3 mil/second | ~120.4 mil/second time/packed | ~722.1 mil/second | ~1,038 mil/second time/proto | ~336.9 mil/second | ~187.3 mil/second acl_item/packed | ~1,050 mil/second | ~1,033 mil/second acl_item/proto | ~117.7 mil/second | ~56.5 mil/second mz_acl_item/packed | ~385.6 mil/second | ~590.0 mil/second mz_acl_item/proto | ~58.9 mil/second | ~28.5 mil/second In general, the "packed" representations have a 5x - 20x higher throughput than their protobuf alternatives. FWIW I believe the packed representations are so much faster because protobuf encodes integers with LEB128 and encodes fields one at a time. Whereas the packed representations use a bit more memory but encode integers in their true size and encodes and entire type "all at once". ### Motivation Progress towards https://github.com/MaterializeInc/materialize/issues/24830 ### Checklist - [ ] This PR has adequate test coverage / QA involvement has been duly considered. ([trigger-ci for additional test/nightly runs](https://trigger-ci.dev.materialize.com/)) - [ ] This PR has an associated up-to-date [design doc](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/design/README.md), is a design doc ([template](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/design/00000000_template.md)), or is sufficiently small to not require a design.  - [ ] If this PR evolves [an existing `$T ⇔ Proto$T` mapping](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/command-and-response-binary-encoding.md) (possibly in a backwards-incompatible way), then it is tagged with a `T-proto` label. - [ ] If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label ([example](MaterializeInc/cloud#5021)).  - [x] This PR includes the following [user-facing behavior changes](https://github.com/MaterializeInc/materialize/blob/main/doc/developer/guide-changes.md#what-changes-require-a-release-note): - N/a

start

98da3bd

ParkMyCar requested review from danhhz, bkirwi and a team May 28, 2024 22:41

ParkMyCar mentioned this pull request May 28, 2024

[dnm] adapter/persist: add PackedInterval type #26799

Closed

5 tasks

antiguru reviewed May 29, 2024

View reviewed changes

fix clippy, extra std::hint::black_box in benchmarks

5603b9d

bkirwi approved these changes May 29, 2024

View reviewed changes

use try_into

1b0be5e

ParkMyCar enabled auto-merge (squash) May 29, 2024 18:29

ParkMyCar merged commit 30aa94a into MaterializeInc:main May 29, 2024
76 checks passed

ParkMyCar mentioned this pull request May 30, 2024

persist: add trait ColumnarCodec and "packed" encodings for AclType and MzAclType #27360

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

persist: "packed" encodings for `Interval` and `Time` #27336

persist: "packed" encodings for `Interval` and `Time` #27336

ParkMyCar commented May 28, 2024 •

edited

Loading

antiguru May 29, 2024

ParkMyCar May 29, 2024

bkirwi left a comment

bkirwi May 29, 2024

ParkMyCar May 29, 2024

persist: "packed" encodings for Interval and Time #27336

persist: "packed" encodings for Interval and Time #27336

Conversation

ParkMyCar commented May 28, 2024 • edited Loading

Motivation

Checklist

antiguru May 29, 2024

Choose a reason for hiding this comment

ParkMyCar May 29, 2024

Choose a reason for hiding this comment

bkirwi left a comment

Choose a reason for hiding this comment

bkirwi May 29, 2024

Choose a reason for hiding this comment

ParkMyCar May 29, 2024

Choose a reason for hiding this comment

persist: "packed" encodings for `Interval` and `Time` #27336

persist: "packed" encodings for `Interval` and `Time` #27336

ParkMyCar commented May 28, 2024 •

edited

Loading