storage: Splits get write locks on the entire span #14992

bdarnell · 2017-04-17T18:18:44Z

This was meant to go in as a part of #14833.
De-flakes TestUnsplittableRange.

cockroach-teamcity · 2017-04-17T18:18:49Z

This change is

tamird · 2017-04-17T18:20:38Z

Reviewed 1 of 1 files at r1.
Review status: all files reviewed at latest revision, all discussions resolved, some commit checks pending.

Comments from Reviewable

tamird · 2017-04-17T18:20:47Z

Might be good to test more directly, though.

Review status: all files reviewed at latest revision, all discussions resolved, some commit checks pending.

Comments from Reviewable

a-robinson

I can confirm that this removes the flake.

This was meant to go in as a part of cockroachdb#14833. De-flakes TestUnsplittableRange. Fixes cockroachdb#14881

bdarnell · 2017-04-17T19:01:26Z

Added a unit test.

tamird · 2017-04-17T19:04:57Z

I was thinking we'd test it at a higher level, specifically inducing the troublesome race. I'm OK with this, too.

Reviewed 1 of 1 files at r2.
Review status: all files reviewed at latest revision, all discussions resolved, some commit checks pending.

Comments from Reviewable

bdarnell · 2017-04-17T19:16:12Z

A higher-level test for this would be tricky to write, because we don't have any hooks in place to detect when the Put has been blocked by the command queue.

nvanbenschoten · 2017-12-09T17:26:51Z

@bdarnell I was thinking about this in relation to #16075 (comment) and it's pretty unfortunate that splits need to declare write locks on their left and right-hand sides. Naively I would expect that a read lock would be sufficient to block all writes on the range that may concurrently mess with the stats, but I assume we're running into issues with #14342 (writes in the future affecting the stats). If that is, in fact, the issue, we might consider adjusting the perceived timestamp of the split in the eyes of the CommandQueue so that even a read lock will block all concurrent writes while still permitting concurrent reads.

bdarnell · 2017-12-11T01:52:27Z

You mean putting the split in the command queue as a read at the maximum possible timestamp, but still using its real timestamp for the timestamp cache when the operation completes? I think that would work, although it feels hacky. In that case then reads to the LHS would be uninterrupted, and reads to the RHS would be allowed except for the ones in flight as the split occurs (since they would no longer be addressed to the correct range).

nvanbenschoten · 2017-12-11T14:34:51Z

I think that would work, although it feels hacky.

In one sense, I agree, but it also seems logically consistent to say that the operation on the LHS is a read that needs to depend on all writes and therefore operates at the maximum timestamp. Do we have any notion of this to deal with inline values?

I may also be blowing this out of proportion, but it seems like blocking all reads to a range during splits as serious enough to warrant some special logic.

and reads to the RHS would be allowed except for the ones in flight as the split occurs (since they would no longer be addressed to the correct range)

How is that handled now? Are reads to the RHS which are waiting in the cmdQueue during the split all rejected with a RangeKeyMismatchError after the split finishes and they're allowed to execute? In that case, we'd probably need to keep the declared keys for the RHS as writes.

bdarnell · 2017-12-11T19:25:57Z

In one sense, I agree, but it also seems logically consistent to say that the operation on the LHS is a read that needs to depend on all writes and therefore operates at the maximum timestamp.

Reads always depend on all writes in their past; it's weird for a read to depend on writes in its future.

Do we have any notion of this to deal with inline values?

Currently, we apply the command's timestamp to all of the keys it touches, even if they have inline values.

How is that handled now? Are reads to the RHS which are waiting in the cmdQueue during the split all rejected with a RangeKeyMismatchError after the split finishes and they're allowed to execute? In that case, we'd probably need to keep the declared keys for the RHS as writes.

Yes, exactly. We might be able to allow the RHS reads through as long as everything is properly synchronized, so that there is a smaller window of disruption. I don't think that's safe today, though.

nvanbenschoten · 2018-11-19T23:04:58Z

@bdarnell In the process of reworking the CommandQueue, I noticed that the logic from #14342 is disabled when a command enters the CommandQueue with an empty timestamp. This means that we could simply pass an empty timestamp and avoid the "maximum possible timestamp" hack we were discussing above. Once we do that, we can declare a read span for the entire range and allow concurrent reads to go through (although the locking on the RHS might be tricky). Think it's worth pursuing this?

bdarnell · 2018-11-20T03:16:19Z

That sounds plausible. The question is just whether that behavior is something we want to commit to in the command queue - when, aside from this trick, would we ever have both timestamped and untimestamped access to the same keys?

nvanbenschoten · 2018-11-20T15:42:17Z

The question is just whether that behavior is something we want to commit to in the command queue

It's actually something we're already committed to and I think it makes sense. We already do exactly this for locally-scoped keys. Providing an empty timestamp to the command queue is basically a way of saying "don't treat this as an MVCC operation".

would we ever have both timestamped and untimestamped access to the same keys?

Currently, I don't believe so. It doesn't seem like a big leap to me though.

bdarnell requested review from spencerkimball, petermattis and a-robinson April 17, 2017 18:18

a-robinson approved these changes Apr 17, 2017

View reviewed changes

storage: Splits get write locks on the entire span

d138490

This was meant to go in as a part of cockroachdb#14833. De-flakes TestUnsplittableRange. Fixes cockroachdb#14881

bdarnell force-pushed the split-write branch from 81b7abf to d138490 Compare April 17, 2017 18:57

bdarnell merged commit 0ae6353 into cockroachdb:master Apr 17, 2017

bdarnell deleted the split-write branch April 17, 2017 19:16

a-robinson mentioned this pull request May 1, 2017

teamcity: failed tests on master: test (proposer evaluated kv)/TestUnsplittableRange #14960

Closed

nvanbenschoten mentioned this pull request Feb 4, 2018

kv: speed up the split trigger, don't hold latches while computing stats #22348

Closed

bdarnell mentioned this pull request Nov 20, 2018

storage: 4.5 seconds after a cluster creation is a bad time to run your transactions #32495

Closed

nvanbenschoten mentioned this pull request Nov 25, 2018

storage: acquire read latches instead of write latches during range splits #32583

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: Splits get write locks on the entire span #14992

storage: Splits get write locks on the entire span #14992

bdarnell commented Apr 17, 2017

cockroach-teamcity commented Apr 17, 2017

tamird commented Apr 17, 2017

tamird commented Apr 17, 2017

a-robinson left a comment

bdarnell commented Apr 17, 2017

tamird commented Apr 17, 2017

bdarnell commented Apr 17, 2017

nvanbenschoten commented Dec 9, 2017

bdarnell commented Dec 11, 2017

nvanbenschoten commented Dec 11, 2017

bdarnell commented Dec 11, 2017

nvanbenschoten commented Nov 19, 2018

bdarnell commented Nov 20, 2018

nvanbenschoten commented Nov 20, 2018

storage: Splits get write locks on the entire span #14992

storage: Splits get write locks on the entire span #14992

Conversation

bdarnell commented Apr 17, 2017

cockroach-teamcity commented Apr 17, 2017

tamird commented Apr 17, 2017

tamird commented Apr 17, 2017

a-robinson left a comment

Choose a reason for hiding this comment

bdarnell commented Apr 17, 2017

tamird commented Apr 17, 2017

bdarnell commented Apr 17, 2017

nvanbenschoten commented Dec 9, 2017

bdarnell commented Dec 11, 2017

nvanbenschoten commented Dec 11, 2017

bdarnell commented Dec 11, 2017

nvanbenschoten commented Nov 19, 2018

bdarnell commented Nov 20, 2018

nvanbenschoten commented Nov 20, 2018