VSR: Use previous `checkpoint_id` in prepares' headers #1501

sentientwaffle · 2024-01-30T22:10:18Z

Background

Right now, a replica will not prepare (or even queue) requests whose op would extend beyond their next checkpoint's trigger op.

This is problematic for performance (the "checkpoint latency spike"), since the replica enters the new checkpoint with nothing to commit.
And the "latency spike" is farther exacerbated by the fact that clients will back off retrying their requests.

Originally we didn't start preparing past the checkpoint to guard against overwriting WAL entries before they will definitely not be needed again. But thanks to vsr_checkpoint_interval, that reason does not apply.

However, prepare headers include a checkpoint_id (motivation here). That checkpoint id isn't available until the checkpoint trigger prepare commits.

Fix

The first constants.pipeline_prepare_queue_max prepares that follow a checkpoint trigger op will now contain the previous checkpoint's id, since that is available both before and after that checkpoint trigger commits.

~~These are called the "border" prepares – they may be prepared during either checkpoint. (This part is not implemented in this commit, though!)~~

All prepares within a checkpoint include the checkpoint_id of the previous checkpoint. The transition occurs at the checkpoint op.

# Background Right now, a replica will not prepare (or even queue) requests whose op would extend beyond their next checkpoint's trigger op. This is problematic for performance (the "checkpoint latency spike"), since the replica enters the new checkpoint with nothing to commit. And the "latency spike" is farther exacerbated by the fact that clients will back off retrying their requests. Originally we didn't start preparing past the checkpoint to guard against overwriting WAL entries before they will definitely not be needed again. But thanks to `vsr_checkpoint_interval`, that reason does not apply. However, prepare headers include a `checkpoint_id` ([motivation here](https://github.com/tigerbeetle/tigerbeetle/blob/30cfbfa2eca94b8cd0b1d2a8ce41c7e7720128f0/src/vsr/message_header.zig#L531-L539)). That checkpoint id isn't available until the checkpoint trigger prepare commits. # Fix The first `constants.pipeline_prepare_queue_max` prepares that follow a checkpoint trigger op will now contain the _previous_ checkpoint's id, since that is available both before and after that checkpoint trigger commits. These are called the "border" prepares – they may be prepared during either checkpoint. (This part if not implemented in this commit, though!)

matklad · 2024-01-31T12:34:07Z

Joran floated an interesting idea during our 1-1 today: what if we don't do border prepares, and instead include checkpoint id of the previous checkpoint for all prepares, so that we don't have a special case?

I think this not only removes special case, but simplifies the mental model overall somewhat: prepares before the first trigger are not special. All prepares in a single checkpoint have the same checkpoint id (that of the previous checkpoint).

sentientwaffle · 2024-01-31T19:27:16Z

src/vsr/superblock.zig

+            // NOTE: Within the vsr_state.checkpoint assignment below, do not read from vsr_state
+            // directly. A miscompilation bug (as of Zig 0.11.0) causes fields to receive the
+            // incorrect values.


(Not sure if this is technically a 'miscompilation' or just underspecification, but it is surprising and worth a note either way.)

matklad · 2024-02-01T16:25:50Z

src/vsr/replica.zig

        /// Returns checkpoint id associated with the op.
        ///
-        /// Normally, this is just the id of the checkpoint the op builds on top. However, ops
+        /// Normally, this is just the id of the op's previous checkpoint. However, ops
        /// between a checkpoint and its trigger can't know checkpoint's id yet, and instead use


We don't have trigger special case anymore.

matklad · 2024-02-01T16:29:14Z

src/vsr/replica.zig

@@ -5013,35 +5014,41 @@ pub fn ReplicaType(
            return vsr.Checkpoint.trigger_for_checkpoint(self.op_checkpoint_next()).?;
        }

+        /// Returns the highest op that this replica can safely prepare to its WAL.
+        fn op_checkpoint_next_border(self: *const Self) u64 {


Now that we dont' have "border prepares", the "border" name doesn't seem really great, though I can't suggest a better alternative...

Maybe:

trigger --- the op when the checkpoints starts

due --- the op by which the checkpoint should have completed

Or maybe even not introduce any name here?

self.op_checkpoint_next_trigger() + constants.pipeline_prepare_queue_max

actually reads great to me: trigger, plus whatever we can pipeline over the trigger.

In the next PR in the sequence (which actually enables "prepare beyond the checkpoint") op_checkpoint_next_border is used very frequently, so imo having it in a helper function rather than inlining self.op_checkpoint_next_trigger() + constants.pipeline_prepare_queue_max is quite useful.

It is used in ways that "trigger" is often used right now -- "the highest op that we will accept in our WAL". So I don't think "trigger" or "due" make sense. Maybe "limit" or "ceiling"?

🤔 yeah, due has inverted semantics here -- checkpoint is indeed due to be done by the time we accept that op into our WAL, but, in code, we don't block until checkpoint is ready, we rather drop prepares.

Can't say that limit or ceiling is much better than the border. Hm, maybe

op_checkpoint_pipeline_limit

? I think "pipeline" is key here. Maybe even just

op_pipeline_limit

? The fact that it depends on the checkpoint is sort of an implementation detail I think?

Though, no strong opinion either way, we can live with border_prepare as well .

👍 I'm going to leave it as "border" at least for now. The fact that it is a term that is not used elsewhere in the code (so can't be confused with anything else) makes up for the fact that it is not self-explaining.

sentientwaffle added 2 commits January 30, 2024 13:59

VSR: Refactor into border_for_checkpoint(), op_checkpoint_next_border()

19f069c

sentientwaffle force-pushed the dj-vsr-prepare-beyond-checkpoint-border-prepares branch from 2c5e3f4 to 19f069c Compare January 30, 2024 22:11

sentientwaffle marked this pull request as ready for review January 30, 2024 22:22

sentientwaffle enabled auto-merge January 30, 2024 22:22

sentientwaffle assigned matklad Jan 30, 2024

sentientwaffle added 2 commits January 31, 2024 11:20

VSR: Rename previous_checkpoint_id to parent_checkpoint_id

8ceb03b

VSR: Add CheckpointState.grandparent_checkpoint_id

db66b81

sentientwaffle commented Jan 31, 2024

View reviewed changes

VSR: Change prepare.header.checkpoint_id at checkpoint boundary

d73b9ee

sentientwaffle force-pushed the dj-vsr-prepare-beyond-checkpoint-border-prepares branch from 31f7ba8 to d73b9ee Compare January 31, 2024 20:59

sentientwaffle changed the title ~~VSR: Use previous checkpoint_id in border prepares' headers~~ VSR: Use previous checkpoint_id in prepares' headers Feb 1, 2024

matklad reviewed Feb 1, 2024

View reviewed changes

VSR: Border prepare code review

0d750fc

matklad approved these changes Feb 1, 2024

View reviewed changes

sentientwaffle added this pull request to the merge queue Feb 1, 2024

Merged via the queue into main with commit 09634c7 Feb 1, 2024
25 checks passed

sentientwaffle deleted the dj-vsr-prepare-beyond-checkpoint-border-prepares branch February 1, 2024 16:58

sentientwaffle mentioned this pull request Feb 1, 2024

VSR: Prepare beyond checkpoint #1508

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VSR: Use previous `checkpoint_id` in prepares' headers #1501

VSR: Use previous `checkpoint_id` in prepares' headers #1501

sentientwaffle commented Jan 30, 2024 •

edited

matklad commented Jan 31, 2024 •

edited

sentientwaffle Jan 31, 2024

matklad Feb 1, 2024

matklad Feb 1, 2024

sentientwaffle Feb 1, 2024

matklad Feb 1, 2024

sentientwaffle Feb 1, 2024

VSR: Use previous checkpoint_id in prepares' headers #1501

VSR: Use previous checkpoint_id in prepares' headers #1501

Conversation

sentientwaffle commented Jan 30, 2024 • edited

Background

Fix

matklad commented Jan 31, 2024 • edited

sentientwaffle Jan 31, 2024

Choose a reason for hiding this comment

matklad Feb 1, 2024

Choose a reason for hiding this comment

matklad Feb 1, 2024

Choose a reason for hiding this comment

sentientwaffle Feb 1, 2024

Choose a reason for hiding this comment

matklad Feb 1, 2024

Choose a reason for hiding this comment

sentientwaffle Feb 1, 2024

Choose a reason for hiding this comment

VSR: Use previous `checkpoint_id` in prepares' headers #1501

VSR: Use previous `checkpoint_id` in prepares' headers #1501

sentientwaffle commented Jan 30, 2024 •

edited

matklad commented Jan 31, 2024 •

edited