currentTimeUUID creates duplicates when called at the same point in time #6208

haaawk · 2020-04-16T05:50:09Z

The issue was reported on StackOverflow here

Internally currentTimeUUID calls get_time_UUID which creates UUID using current time and a clock_seq_and_node constant generated once per shard.

    static UUID get_time_UUID()
    {
        auto uuid = UUID(instance->create_time_safe(), clock_seq_and_node);
        assert(uuid.is_timestamp());
        return uuid;
    }

const thread_local int64_t UUID_gen::clock_seq_and_node = make_clock_seq_and_node();

The text was updated successfully, but these errors were encountered:

haaawk · 2020-04-16T06:44:25Z

This issue will affect CDC as well because CDC generates TimeUUID for clustering keys in CDC Log

haaawk · 2020-04-16T06:55:11Z

Fix is here -> #6209

tgrabiec · 2020-04-16T15:33:10Z

It's not clear to me what the supposed bug is about. Please clarify in the description.

haaawk · 2020-04-16T15:39:00Z

https://stackoverflow.com/questions/61223391/how-to-guarantee-monotonically-increasing-timeuuid-when-selecting-from-scylla/61243270

The user used currentTimeUUID and got 20-40 duplicates per partition.

tgrabiec · 2020-04-16T19:08:33Z

https://stackoverflow.com/questions/61223391/how-to-guarantee-monotonically-increasing-timeuuid-when-selecting-from-scylla/61243270

The user used currentTimeUUID and got 20-40 duplicates per partition.

I am confused as to what he means by that.

He says:

I tried using currentTimeUUID function and it seems to work(monotonically increasing within the same partition key)

which suggests that currentTimeUUID() doesn't generate duplicates.

Also, createdAt is a clustering key so queries cannot return duplicates in that column.

nyh · 2020-07-20T15:58:57Z

I think you are mis-understanding the problem (or I am misunderstanding you...). The function which you quoted,

static UUID get_time_UUID()
    {
        auto uuid = UUID(instance->create_time_safe(), clock_seq_and_node);
        assert(uuid.is_timestamp());
        return uuid;
    }

doesn't need clock_seq_and_node to change to avoid returning the same id twice... It uses create_time_safe() which ensures that the same uuid is not generated even if called in the same millisecond on the same shard (unfortunately we use millisecond resolution, which sucks for other reasons, but wouldn't break uniqueness). This is the reason for the "safe()" in its name.

So I don't think that the C++ function get_time_UUID() actually has the problem that you think it has.

Another possibility is that our code doesn't call it but call one of its variants which takes a time argument? Those variants indeed appear broken because they don't check for ties. Those broken variants should be outright deleted - do you know why they even exist?

I propose that before you do anything on this issue, that you try to write a test which reproduces this issue (unit test of C++ functions, or dtest or pytest to see the end-to-end problem in CQL), and then you can confirm you understand what the real problem is.

nyh · 2020-11-23T07:57:55Z

@kostja observed, if I understand correctly, that the error causing the problem reported above isn't the details mentioned above. Rather, it is make_node() in utils/UUID_gen.cc.
This function should have been different in different nodes - e.g., a MAC address - but we take the same number on all nodes (just the shard number), and the result is that two nodes that calculate a timestamp around the same time produce the same UUID.
The code even has a FIXME that this needs to be fixed :-(

kostja · 2020-11-27T17:08:44Z

Apart from filling host id, the patch should make sure timeuuid compare respects host id.

nyh · 2020-11-29T21:28:31Z

Apart from filling host id, the patch should make sure timeuuid compare respects host id.

Do you suspect that compare_visitor (the timeuuid_type_impl& case) in types.cc` ignores some parts of the timeuuid? Or by "make sure" you just mean we need to check and verify that this is the case?

kostja · 2020-11-29T21:35:04Z

The logic of the compare function confused me. Apart from being very inefficient, it falls back to signed compare of the entire UUID if timeuuid_compare_bytes() returns 0. I didn't notice it at first.

@todo

Before this patch, UUID generation code was not creating sufficiently unique IDs: the 6 byte node identifier was mostly empty, i.e. only containing shard id. This could lead to collisions between queries executed concurrently at different coordinators, and, since timeuuid is used as key in list append and prepend operations, lead to lost updates. To generate a unique node id, the patch uses a combination of hardware MAC address (or a random number if no hardware address is available) and the current shard id. The shard id is mixed into higher bits of MAC, to reduce the chances on NIC collision within the same network. With sufficiently unique timeuuids as list cell keys, such updates are no longer lost, but multi-value update can still be "merged" with another multi-value update. E.g. if node A executes SET l = l + [4, 5] and node B executes SET l = l + [6, 7], the list value could be any of [4, 5, 6, 7], [4, 6, 5, 7], [6, 4, 5, 7] and so on. At least we are now less likely to get any value lost. Fixes #6208. @todo: initialize UUID subsystem explicitly in main() and switch to using seastar::engine().net().network_interfaces() test: unit (dev)

@todo

Before this patch, UUID generation code was not creating sufficiently unique IDs: the 6 byte node identifier was mostly empty, i.e. only containing shard id. This could lead to collisions between queries executed concurrently at different coordinators, and, since timeuuid is used as key in list append and prepend operations, lead to lost updates. To generate a unique node id, the patch uses a combination of hardware MAC address (or a random number if no hardware address is available) and the current shard id. The shard id is mixed into higher bits of MAC, to reduce the chances on NIC collision within the same network. With sufficiently unique timeuuids as list cell keys, such updates are no longer lost, but multi-value update can still be "merged" with another multi-value update. E.g. if node A executes SET l = l + [4, 5] and node B executes SET l = l + [6, 7], the list value could be any of [4, 5, 6, 7], [4, 6, 5, 7], [6, 4, 5, 7] and so on. At least we are now less likely to get any value lost. Fixes scylladb#6208. @todo: initialize UUID subsystem explicitly in main() and switch to using seastar::engine().net().network_interfaces() test: unit (dev)

avikivity · 2022-08-16T12:50:25Z

Fix present on all active branches, not backporting.

haaawk added the area/cdc label Apr 16, 2020

haaawk self-assigned this Apr 16, 2020

haaawk added this to the 4.1 milestone Apr 16, 2020

haaawk mentioned this issue Apr 16, 2020

UUID: Stop caching clock_seq_and_node #6209

Closed

slivne modified the milestones: 4.1, 4.2 May 31, 2020

haaawk removed the area/cdc label Jun 1, 2020

slivne unassigned haaawk Jun 1, 2020

slivne added the area/internals an issue which refers to some internal class or something which has little exposure to users and is label Jun 1, 2020

slivne modified the milestones: 4.2, 4.x Jun 1, 2020

haaawk mentioned this issue Jul 20, 2020

List appends sent by same client to different coordinators can be reordered #6846

Open

haaawk mentioned this issue Nov 27, 2020

Scylla timeuuid type is not monotonic #7717

Closed

scylladb-promoter closed this as completed in 6d1781b Jan 21, 2021

scylladb-promoter added the Backport candidate label Jan 21, 2021

DoronArazii modified the milestones: 5.x, 5.0 Jul 14, 2022

avikivity removed the Backport candidate label Aug 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

currentTimeUUID creates duplicates when called at the same point in time #6208

currentTimeUUID creates duplicates when called at the same point in time #6208

haaawk commented Apr 16, 2020

haaawk commented Apr 16, 2020

haaawk commented Apr 16, 2020

tgrabiec commented Apr 16, 2020

haaawk commented Apr 16, 2020

tgrabiec commented Apr 16, 2020

nyh commented Jul 20, 2020

nyh commented Nov 23, 2020

kostja commented Nov 27, 2020

nyh commented Nov 29, 2020

kostja commented Nov 29, 2020

avikivity commented Aug 16, 2022

currentTimeUUID creates duplicates when called at the same point in time #6208

currentTimeUUID creates duplicates when called at the same point in time #6208

Comments

haaawk commented Apr 16, 2020

haaawk commented Apr 16, 2020

haaawk commented Apr 16, 2020

tgrabiec commented Apr 16, 2020

haaawk commented Apr 16, 2020

tgrabiec commented Apr 16, 2020

nyh commented Jul 20, 2020

nyh commented Nov 23, 2020

kostja commented Nov 27, 2020

nyh commented Nov 29, 2020

kostja commented Nov 29, 2020

avikivity commented Aug 16, 2022