Incremental migrations #891

jakedt · 2022-10-07T19:28:48Z

Migration instructions:

spicedb migrate add-xid-columns
Rolling update spicedb with flag --datastore-migration-phase write-both-read-old
spicedb migrate add-xid-constraints - performs a backfill and may take a long time
Rolling update spicedb with flag --datastore-migration-phase write-both-read-new
spicedb migrate drop-id-constraints
Rolling update spicedb without a --datastore-migration-phase flag
spicedb migrate head

williamdclt

If I understand the migration instructions correctly, we'd need to run the migrations manually rather than automatically? Would that mean disabling the k8s post-upgrade hook temporarily?

Suggestion: instead of starting SpiceDB with different flags, should that be different spicedb versions? ie something like:

Upgrade to v1.14.0
- migrations ran by post-upgrade hook (migration 9 and 10 but without the backfilling in 10)
- 1.14.0 always does "write both read old"
Manually run spicedb run-job backfill-xid
- run-job is something I made up. Effectively a migration to be ran manually.
Upgrade to v1.15.0
- Migration 11 ran by post-upgrade hook
Upgrade to v1.16.0
- 1.16.0 always does "write both read new"
- Check that the v1.15.0 migrations have been ran, refuse to start otherwise
Upgrade to v1.17.0
- Run migrations 12/13
- 1.16.0 always does "write new read new" ("old" doesn't exist anymore)

Maybe it can be consolidated to 3-phase rather than 4, I haven't overly thought about it

williamdclt · 2022-10-10T09:36:23Z

internal/datastore/postgres/migrations/zz_migration.0010_backfill_xid_add_indices.go

+	"github.com/rs/zerolog/log"
+)
+
+const batchSize = 1000


That sounds very low :) Need to trade-off between execution speed and large table locks though, there's no right answer I don't think

I'm just going to make it a command line parameter to the migrate command, defaulting to 1000.

williamdclt · 2022-10-10T09:43:16Z

internal/datastore/postgres/migrations/zz_migration.0010_backfill_xid_add_indices.go

+		SET xid = id::text::xid8, snapshot = CONCAT(id, ':', id, ':')::pg_snapshot
+		WHERE id IN (
+			SELECT id FROM relation_tuple_transaction
+			WHERE snapshot IS NULL


This WHERE clause (and the 2 other ones) might get very slow and resource-intensive as there's no index on snapshot.

You could create an index on snapshot/created_xid for the purpose of the migration, then clean it up at the end of the migration (or in a follow-up migration)?

williamdclt · 2022-10-10T09:49:46Z

internal/datastore/postgres/migrations/zz_migration.0010_backfill_xid_add_indices.go

+
+var addXIDIndices = []string{
+	// Replace the indices that are inherent from having a primary key constraint
+	"CREATE UNIQUE INDEX CONCURRENTLY ix_rttx_oldpk ON relation_tuple_transaction (id)",


If I'm understanding correctly this migration isn't ran in a transaction. There's going to be issues if this migration is interrupted while creating indices: it won't be possible to re-run it as PG will complain about indices already existing.

Could just add a IF NOT EXISTS here

jakedt · 2022-10-11T14:28:18Z

@williamdclt Updated, PTAL

…y named pkey

josephschorr · 2022-10-11T20:46:54Z

pkg/migrate/context.go

+
+// MigrationVariable contains constants that can be used as context keys that might
+// be relevant in a number of different migration scenarios.
+type MigrationVariable int


do we want to make these ints vs strings? I have concerns there might be accidental overlap with something we don't inject

Not possible, the equality operator the context uses considers the type:
https://go.dev/play/p/16BfmcC0d9X

josephschorr

LGTM

jakedt requested a review from josephschorr October 7, 2022 19:28

jakedt requested a review from a team as a code owner October 7, 2022 19:28

github-actions bot added area/CLI Affects the command line area/datastore Affects the storage system area/tooling Affects the dev or user toolchain (e.g. tests, ci, build tools) labels Oct 7, 2022

jakedt force-pushed the incremental-migrations branch 3 times, most recently from c4e28fa to 1588cdc Compare October 7, 2022 21:55

williamdclt reviewed Oct 10, 2022

View reviewed changes

jakedt force-pushed the incremental-migrations branch from beddbac to 9aff3ba Compare October 11, 2022 14:27

jakedt added 6 commits October 11, 2022 16:43

datastore/postgres: change the migrations to use incremental backfill

4aa4c70

datastore/postgres: add a migration phase flag

b521828

datastore/postgres: add code to support various migration phases

2e7d7f1

datastore/postgres: test all phases of migration

e9bcd68

datastore/postgres: provide alternate paths for finding the implicitl…

78610c3

…y named pkey

datastore/postgres: pass migration batch size from the command line

ca8c50a

jakedt force-pushed the incremental-migrations branch from 9aff3ba to ca8c50a Compare October 11, 2022 20:45

josephschorr reviewed Oct 11, 2022

View reviewed changes

josephschorr approved these changes Oct 11, 2022

View reviewed changes

jakedt merged commit 53c75a2 into main Oct 11, 2022

jakedt deleted the incremental-migrations branch October 11, 2022 20:59

github-actions bot locked and limited conversation to collaborators Oct 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incremental migrations #891

Incremental migrations #891

jakedt commented Oct 7, 2022 •

edited

Loading

williamdclt left a comment

williamdclt Oct 10, 2022

jakedt Oct 11, 2022

williamdclt Oct 10, 2022

williamdclt Oct 10, 2022

jakedt commented Oct 11, 2022

josephschorr Oct 11, 2022

jakedt Oct 11, 2022

josephschorr left a comment

Incremental migrations #891

Incremental migrations #891

Conversation

jakedt commented Oct 7, 2022 • edited Loading

williamdclt left a comment

Choose a reason for hiding this comment

williamdclt Oct 10, 2022

Choose a reason for hiding this comment

jakedt Oct 11, 2022

Choose a reason for hiding this comment

williamdclt Oct 10, 2022

Choose a reason for hiding this comment

williamdclt Oct 10, 2022

Choose a reason for hiding this comment

jakedt commented Oct 11, 2022

josephschorr Oct 11, 2022

Choose a reason for hiding this comment

jakedt Oct 11, 2022

Choose a reason for hiding this comment

josephschorr left a comment

Choose a reason for hiding this comment

jakedt commented Oct 7, 2022 •

edited

Loading