Consider replacing vector clocks with naive timestamp-based conflict resolution #2784

timmaxw · 2014-07-31T01:30:12Z

In #2663, we decided that the new ReQL administrative API should handle vector clock conflicts as follows: Reading from a conflicted value produces an error. Writing to a conflicted value resolves the conflict.

However, I'm not sure that this is actually the best solution. The problem is that we have no good way to return a document that is partially in error. Imagine this situation: A user accidentally causes a vector clock conflict on a table's database field. Now they can't access the table's metadata at all; they get an error telling them to "overwrite the document". But maybe they've forgotten what they had the config set to. Even though the valid config is still stored on the server, the user can't access it; they have to reconstruct it. The user experience is similarly bad if they do a range scan over the rethinkdb.table_config artificial table. The table with the metadata conflict will not appear; instead, they will get a message saying that there was an error.

We should consider replacing vector clocks with a structure consisting of (timestamp, uuid, value), where timestamp is a time_t. The semilattice join is defined by comparing first by timestamp, then by UUID. When the user writes to a field, the server will set timestamp to the larger of the server's current time() and the old timestamp plus one, and it will set uuid to its machine ID. (Or peer ID. Or a newly generated UUID. It doesn't matter.)

Normally, this works transparently, just like vector clocks. If the user issues two updates almost simultaneously, one will be chosen arbitrarily. If the user writes the same field on both sides of a netsplit, whichever write happens later (by wall-clock time) will be chosen, as long as the servers' clocks are sane. If the servers' clocks are insane, then everything still works properly, except that if the user writes the same field on both sides of a netsplit the winner will be arbitrary. The user never sees a conflict; the system always picks a value for the field.

This issue is probably not important.

The text was updated successfully, but these errors were encountered:

timmaxw · 2014-07-31T01:40:40Z

Another benefit is that we wouldn't have to think about conflict states in all the code that reads from the vector clocks.

mlucy · 2014-07-31T01:43:59Z

However, I'm not sure that this is actually the best solution. The problem is that we have no good way to return a document that is partially in error.

We could introduce a pseudotype for vector clock conflicts. (I thought that's what the plan was.) You can read this pseudotype to get the values in conflict, but if you try to use it in a ReQL expression it produces an error.

timmaxw · 2014-07-31T01:59:37Z

That would work. But I think that the users' lives would be much simpler if we got rid of vector clocks completely.

timmaxw · 2014-07-31T02:02:30Z

Well, "much simpler" is the wrong word. A typical user will rarely encounter vector clock conflicts. But I think they will be quite inconvenient when they do occur; the user will probably see them as a hassle. This will be especially true with the new ReQL admin API, because we'll encourage people to use scripts to configure their cluster. Vector clock conflicts are important if two humans make different changes at the same time, and also it's rare for human admins to create a vector clock conflict; but with automated configuration tools, the risk of a coincidence is higher, and the probability that the conflict is actually worth bothering the user about is lower.

timmaxw · 2014-08-01T00:09:38Z

Another problem with vector clocks is that there's no good way to handle a vector clock conflict on the name field of a table when we are using the table's name as the primary key for rethinkdb.table_config.

timmaxw · 2014-08-01T23:41:47Z

On further thought, the proposal to have the system raise an error if the user tries to read a vector clock conflict is hard to implement. Currently, we implement writes as a function that maps the old value to the new value; we would have to distinguish between writes that use the old value and writes that don't. It seems to me that naive timestamp-based resolution is significantly easier to implement than either of the vector-clock solutions.

timmaxw · 2014-08-16T00:42:11Z

This proposal has been approved. We'll implement it as part of the ReQL admin changes.

mlucy · 2014-08-16T01:36:48Z

I'm a little bit scared of this, but I guess it's probably fine.

coffeemug · 2014-08-18T18:45:05Z

I think it's fine for a couple of reasons:

People take advantage of this functionality (changing values on two sides of a netsplit) extremely rarely if ever, and it's usually by accident.
If a metadata conflict does happen, empirically people are very surprised and have no idea how to fix it. They're also frustrated that they have to deal with the issue.
It's temporary. Eventually we'll replace this with a consensus algorithm so conflicts couldn't happen period.

I think timestamp-based resolution would result in a dramatically better user experience when the edge cases do happen. It's probably not ideal for large deployments, but for the moment there are much bigger issues in those scenarios anyway. By the time we fix those, we'll probably also add a consensus algorithm so this problem will go away.

timmaxw · 2014-08-21T05:46:37Z

Implementation is in CR 1994.

timmaxw · 2014-08-25T20:44:50Z

Merged into reql_admin in 284e341.

timmaxw added this to the reql-admin milestone Jul 31, 2014

timmaxw mentioned this issue Aug 1, 2014

Figure out how to handle non-atomicity in semilattice updates through artificial tables #2792

Closed

timmaxw mentioned this issue Aug 11, 2014

Table name collision resolution #2860

Closed

timmaxw closed this as completed Aug 25, 2014

danielmewes modified the milestones: reql-admin, 1.16 Jan 2, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider replacing vector clocks with naive timestamp-based conflict resolution #2784

Consider replacing vector clocks with naive timestamp-based conflict resolution #2784

timmaxw commented Jul 31, 2014

timmaxw commented Jul 31, 2014

mlucy commented Jul 31, 2014

timmaxw commented Jul 31, 2014

timmaxw commented Jul 31, 2014

timmaxw commented Aug 1, 2014

timmaxw commented Aug 1, 2014

timmaxw commented Aug 16, 2014

mlucy commented Aug 16, 2014

coffeemug commented Aug 18, 2014

timmaxw commented Aug 21, 2014

timmaxw commented Aug 25, 2014

Consider replacing vector clocks with naive timestamp-based conflict resolution #2784

Consider replacing vector clocks with naive timestamp-based conflict resolution #2784

Comments

timmaxw commented Jul 31, 2014

timmaxw commented Jul 31, 2014

mlucy commented Jul 31, 2014

timmaxw commented Jul 31, 2014

timmaxw commented Jul 31, 2014

timmaxw commented Aug 1, 2014

timmaxw commented Aug 1, 2014

timmaxw commented Aug 16, 2014

mlucy commented Aug 16, 2014

coffeemug commented Aug 18, 2014

timmaxw commented Aug 21, 2014

timmaxw commented Aug 25, 2014