storage/mvlog: introduce batch collection to be used in new log impl #17358

andrwng · 2024-03-24T05:08:12Z

I don't have the full read path working end-to-end yet, but these bits haven't been changing much so I figured I'd push it to get some early feedback.

This PR introduces some foundational types, and a building block for the read path to be used in an upcoming implementation of a storage log that handles concurrency with MVCC (hence the added namespace mvlog).

This log's basic unit of data will be an "entry", which may be a record batch, or another kind (e.g. term marker, truncation marker, etc). Ultimately though, this log will need to be able to implement the model::record_batch_reader::impl, and to that end, this PR introduces a batch_collector abstraction that will be used as a building block for the reader.

The batch collector takes inspiration from the existing parser and consumer implementations in the storage layer, but focuses exclusively on logical checks and invariants of the data. The idea here will be that this collector will be owned by the reader implementation to collect record batches across multiple segments, with the collector indicating to the higher level reader lifecycle signals like being done or being full.

Backports Required

Release Notes

none

dotnwat · 2024-03-25T23:00:27Z

src/v/storage/mvlog/tests/batch_collector_test.cc

+    }
+}
+
+TEST(BatchCollectorTest, TestDataTooHigh) {


Adds an error type to be used in the new multi-version log.

Adds an enum to be used along the read path.

Adds an abstraction that will be used on the read path to collect record batches. This abstraction encapsulates some of the behavior that exists in storage::log_segment_batch_reader and storage::skipping_consumer in that its role is to determine whether it should collect a given record as a part of a read, and then collect it. Some later changes will introduce a new entry abstraction that will wrap the entire record batch header and body to be fed into this collector. I considered implementing the existing batch_consumer interface and encapsulating some bits of the segment batch reader into a new reader class, but felt like the batch_consumer overcomplicated the business logic of batch collection.

vbotbuildovich · 2024-03-27T23:18:16Z

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/46929#018e81f1-ef35-4eca-a427-70e829dc9381

dotnwat · 2024-04-02T03:44:37Z

src/v/storage/mvlog/CMakeLists.txt

@@ -0,0 +1,13 @@
+enable_clang_tidy()


dotnwat · 2024-04-02T03:45:30Z

src/v/storage/mvlog/CMakeLists.txt

+    Seastar::seastar
+    v::base
+    v::bytes
+    v::storage


something i did when i was working on io:: that i think was nice was to avoid all but the bare essential dependencies for as long as possible. do we need v::storage?

We discussed this a bit offline. We'll only end up bringing a couple classes out of storage. In the near term, that'll be some serialization utils and the log_reader_config. I'll consider either moving the generic stuff into its own module, or copying over just what I need to this module, though likely as follow up

dotnwat · 2024-04-02T03:47:32Z

src/v/storage/mvlog/batch_collector.h

+    }
+
+private:
+    static constexpr size_t default_max_buffer_size = 32_KiB;


surprised that this doesn't need to be defined up above where it's used. oh well, i guess if it compiles!

dotnwat · 2024-04-02T03:50:32Z

src/v/storage/mvlog/batch_collector.h

+    const size_t target_max_buffer_size_;
+
+    // The last offset seen by this collector.
+    model::offset last_offset_;


what does the initial value of offset::min mean? that no offset has yet been seen sort of like std::optional<offset>? i guess min() also works out for the comparison to the added batch offset without any extra checks.

Right exactly. This is a good call out -- it's probably worth switching over to optional<> rather than sentinel values

with some planning, it is often really nice to do something like have the constructor take the first batch and then there is no special initial state.

dotnwat · 2024-04-02T04:02:50Z

src/v/storage/mvlog/batch_collector.cc

+    // TODO: add ghost batch building here.
+
+    cur_buffer_size_ += batch_hdr.size_bytes;
+    batch_hdr.ctx.term = cur_term_;


got it so the term isn't stored in the batches on disk, we're going to mix that in as we go along from some other source?

Right, at least in my local branch, the term is going to be stored as a part of the entry body envelope for record batches. For now I'm just reusing the disk serialization for record batches we have today in segment appender, rather than using the envelope, since we already have an envelope to wrap the record batch guts.

dotnwat · 2024-04-02T04:03:54Z

src/v/storage/mvlog/batch_collector.cc

+
+    cur_buffer_size_ += batch_hdr.size_bytes;
+    batch_hdr.ctx.term = cur_term_;
+    batches_.emplace_back(model::record_batch(


doesn't emplace_back let you pass in the ctor parameters directly instead of using the move constructor of record_batch?

Oops! Yes. Will follow up

dotnwat · 2024-04-02T04:06:27Z

src/v/storage/mvlog/batch_collector.h

+    result<reader_outcome, errc> set_term(model::term_id new_term) noexcept;
+
+    // Releases the batches to the caller.
+    ss::circular_buffer<model::record_batch> release_batches() noexcept {


did you consider simplifying the interface by making batch_collector non-reusable? then release_batches could effectively be r-value qualified, and the term would be passed into the constructor and avoid resetting cur_buffer_size?

I did, but I ended up preferring the conceptual simplicity of one batch collector per log reader. Because a log reader may span multiple segments, and because each segment will need to add to a collector, I found it easier to reason about lifecycle of the collector by tying its lifetime to the log reader.

github-actions bot added the area/redpanda label Mar 24, 2024

dotnwat reviewed Mar 25, 2024

View reviewed changes

src/v/storage/mvlog/tests/batch_collector_test.cc

}

}

TEST(BatchCollectorTest, TestDataTooHigh) {

Copy link

Member

dotnwat Mar 25, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🔥

andrwng added 3 commits March 26, 2024 23:33

mvlog: add errc type

59a96cf

Adds an error type to be used in the new multi-version log.

mvlog: add reader_outcome

bd77291

Adds an enum to be used along the read path.

andrwng force-pushed the mvlog-batch-collector branch from 5bf4b43 to 8fb5985 Compare March 27, 2024 20:44

andrwng marked this pull request as ready for review March 27, 2024 20:47

andrwng requested a review from dotnwat March 27, 2024 20:47

dotnwat approved these changes Apr 2, 2024

View reviewed changes

andrwng merged commit cf0e547 into redpanda-data:dev Apr 3, 2024
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage/mvlog: introduce batch collection to be used in new log impl #17358

storage/mvlog: introduce batch collection to be used in new log impl #17358

andrwng commented Mar 24, 2024 •

edited

dotnwat Mar 25, 2024

vbotbuildovich commented Mar 27, 2024

dotnwat Apr 2, 2024

dotnwat Apr 2, 2024

andrwng Apr 2, 2024

dotnwat Apr 2, 2024

dotnwat Apr 2, 2024

andrwng Apr 3, 2024

dotnwat Apr 3, 2024

dotnwat Apr 2, 2024

andrwng Apr 2, 2024

dotnwat Apr 2, 2024

andrwng Apr 2, 2024

dotnwat Apr 2, 2024

andrwng Apr 2, 2024

storage/mvlog: introduce batch collection to be used in new log impl #17358

storage/mvlog: introduce batch collection to be used in new log impl #17358

Conversation

andrwng commented Mar 24, 2024 • edited

Backports Required

Release Notes

Choose a reason for hiding this comment

vbotbuildovich commented Mar 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andrwng commented Mar 24, 2024 •

edited