vsr: fix races in client_replies #1515

matklad · 2024-02-05T12:26:13Z

it might be the case that writing bitset is not set, but there's a write for the slot in the queue: replies can wait on read to complete.
when a faulty read completes, it might clobber faulty bit, unset by a a write which was scheduled after the read.

The specific series of events here:

Replica receives a RequestReply and starts a reply read
The read completes with a failure, replica sets the faulty bit
Replica receives RequestReply starts a reply read
Replica receives Reply and starts a reply write - the write unsets faulty bit - the write doesn't start, because there's a read executing
The read completes, setting the faulty bit again
Replica receives RequestReply
- It doesn't start reply read, because there's an in-progress write that can resolve a read.
- But the faulty bit is set, tripping up an assertion.

The root issue here is the race between a read and a write for the same reply. Remove the race by explicitly handling the interleaving:

When submitting a read, resolve it immediately if there's a pending write (this was already handled by read_reply_sync)
When submitting a write, resolve any pending reads for the same reply.
Remove the code to block the write while the read is in-progress, as this is no longer possible.

Note that it is still possible that a read and a write for the same slot race, if they target different replies. In this case, there won't be clobbering, as, when the read completes, we double-check freshness by consulting client_sessions.

SEED: 2517747396662708227
Closes: #1511

matklad · 2024-02-05T12:28:48Z

Alternative fix is available at #1513, though I like this version more.

sentientwaffle · 2024-02-05T20:57:11Z

src/vsr/client_replies.zig

@@ -243,6 +243,15 @@ pub fn ClientRepliesType(comptime Storage: type) type {
                client_replies.write_reply_next();
            }

+            if (callback == null) {


Could rewrite this into const callback = read.callback orelse { ... to avoid callback.? later.

This needs to be a bit more complicated as the read's released here already, but, on balance, yeah, this seems better.

sentientwaffle · 2024-02-05T21:10:41Z

src/vsr/client_replies.zig

-                while (reads.next()) |read| {
-                    if (read.slot.index == write.slot.index) return;
-                }
-


Maybe in write_reply_callback we could iterate the reads and assert that none of them are to the slot that we just wrote?

There actually can be races after the state sync.

TIL: in our testing storage, we forbid write-write races, but allow write-read races. Which ... seems fine? At least in this case.

- it might be the case that `writing` bitset is not set, but there's a write for the slot in the queue: replies can wait on read to complete. - when a faulty read completes, it might clobber faulty bit, unset by a a write which was scheduled after the read. The specific series of events here: 1. Replica receives a RequestReply and starts a reply read 2. The read completes with a failure, replica sets the faulty bit 3. Replica receives RequestReply starts a reply read 4. Replica receives Reply and starts a reply write - the write unsets faulty bit - the write doesn't start, because there's a read executing 4. The read completes, setting the faulty bit _again_ 5. Replica receives RequestReply - It _doesn't_ start reply read, because there's an in-progress write that can resolve a read. - But the faulty bit is set, tripping up an assertion. The root issue here is the race between a read and a write for the same reply. Remove the race by explicitly handling the interleaving: * When submitting a read, resolve it immediately if there's a pending write (this was already handled by `read_reply_sync`) * When submitting a write, resolve any pending reads for the same reply. * Remove the code to block the write while the read is in-progress, as this is no longer possible. Note that it is still possible that a read and a write for the same slot race, if they target different replies. In this case, there won't be clobbering, as, when the read completes, we double-check freshness by consulting `client_sessions`. SEED: 2517747396662708227 Closes: #1511

matklad force-pushed the matklad/raced2 branch from 5bdd222 to 5f1a140 Compare February 5, 2024 12:28

matklad force-pushed the matklad/raced2 branch from 5f1a140 to 0ba9044 Compare February 5, 2024 12:31

matklad assigned sentientwaffle Feb 5, 2024

sentientwaffle previously approved these changes Feb 5, 2024

View reviewed changes

matklad dismissed sentientwaffle’s stale review via f464c4e February 5, 2024 22:30

matklad force-pushed the matklad/raced2 branch 2 times, most recently from f464c4e to 9db2919 Compare February 5, 2024 22:33

matklad force-pushed the matklad/raced2 branch from 9db2919 to 4db067f Compare February 5, 2024 22:36

matklad enabled auto-merge February 5, 2024 22:49

sentientwaffle approved these changes Feb 5, 2024

View reviewed changes

matklad added this pull request to the merge queue Feb 5, 2024

Merged via the queue into main with commit d1ce66c Feb 5, 2024
25 checks passed

matklad deleted the matklad/raced2 branch February 5, 2024 23:01

matklad mentioned this pull request Feb 6, 2024

vsr: fix races in client_replies #1513

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vsr: fix races in client_replies #1515

vsr: fix races in client_replies #1515

matklad commented Feb 5, 2024

matklad commented Feb 5, 2024

sentientwaffle Feb 5, 2024

matklad Feb 5, 2024

sentientwaffle Feb 5, 2024

matklad Feb 5, 2024

vsr: fix races in client_replies #1515

vsr: fix races in client_replies #1515

Conversation

matklad commented Feb 5, 2024

matklad commented Feb 5, 2024

sentientwaffle Feb 5, 2024

Choose a reason for hiding this comment

matklad Feb 5, 2024

Choose a reason for hiding this comment

sentientwaffle Feb 5, 2024

Choose a reason for hiding this comment

matklad Feb 5, 2024

Choose a reason for hiding this comment