Proper handling of incompatible zedtokens #1723

josephschorr · 2024-01-29T21:28:23Z

NOTE: ZedTokens are a bit longer now as a result of this change, but should still be well within the 1024 limit previously defined

Fixes #1541

josephschorr · 2024-03-13T19:32:43Z

Rebased

josephschorr · 2024-04-01T13:17:53Z

Rebased

josephschorr · 2024-07-01T18:00:01Z

Updated

vroldanbet

Some early feedback, I'll continue tomorrow. Please describe in the PR body what problems are you trying to solve and design choices you took to come up with this solution. It helps folks reviewing the PR with the right context 🙏🏻

vroldanbet · 2024-07-09T18:36:48Z

internal/datastore/crdb/stats.go

 		}
+
+		cds.uniqueID.Store(&uniqueID)


I don't think this matters too much here because the value converges, but this can race. Ideally we do a CAS operation.

I'd prefer to see this implemented with sync.Once instead of an atomic pointer. The ID does not change for the lifetime of the server.

sync.Once won't work because if we get an error, we need to rerun this code the next time its called

Fair. Can we at least use the CAS operation to store it? if it fails, it probably means some other goroutine already set it.

Another point in favor of the atomic value is it does not rely on any mutex. sync.Once relies on it while it's not initialized, but once it is, the overhead of the mutex disappears.

Why even do that though? The ID is guaranteed to be the same by the datastore, so no need to compare?

vroldanbet · 2024-07-09T18:45:40Z

pkg/datastore/datastore.go

+	// UniqueID returns a unique identifier for the datastore. This identifier
+	// must be stable across restarts of the datastore if the datastore is
+	// persistent.
+	UniqueID(context.Context) (string, error)


Did you consider alternatives using the UniqueID as a stable identifier for zedtokens?

I assume (not present in the PR body) that the goal is making sure zedtokens from one datastore type are not used in another datastore type. But what happens if you evolve the zedtoken implementation from one version to another for the same datastore, and want to force them to be dropped? the uniqueID wouldn't help, would it?
Shouldn't we store a zedtoken versioning parameter like datastoreType+zedtokenVersion?

Yes, but then it doesn't meet the goal which is to prevent zedtokens not just across types but instances of the datastore as well

josephschorr · 2024-07-09T20:41:01Z

Some early feedback, I'll continue tomorrow. Please describe in the PR body what problems are you trying to solve and design choices you took to come up with this solution. It helps folks reviewing the PR with the right context 🙏🏻

I added the fixes; it was on the commit but not the PR

vroldanbet

We are transferring UUIDs in every API call for no reason other than to prevent an eventual migration within the same datastore type. That's extremely wasteful.

It's still within the zedtoken spec limit, but it is a very high price to pay for an incredible rare event (a legit one, don't get me wrong). We are storing UUIDs in the datastore as strings, which is the least compact way of storing them and transferring them over the wire. And we are forcing everybody to see the data transferred increase significantly (and latency!) to prevent a scenario that the system will not be subjected to in, I'll dare to say, practically 100% of its lifespan.

We need to rethink this. I get the zedtoken is a stateless token and that it needs to self-contain this information, but I think we can get 99% there by defining the datastore type as a small attribute in the token. I understand migrating from different instances of the same datastore can be problematic, but no one asked for this, and we could find alternative ways for it (e.g. a flag that forces SpiceDB to ignore requested consistency and fallback to full_consistency all the time, or minimize_latency if the customer can take the hit of ignoring the new enemy problem during the migration). I'd argue that by adding such a flag we could completely avoid having to add the datastore type too.

vroldanbet · 2024-07-10T09:37:24Z