Metric v2 migration #42613

metamben · 2024-05-13T22:19:22Z

fixes #42186
fixes #42187
fixes #42188
fixes #42189
fixes #42190
fixes #42191

Include commented out code for removing cards not consuming queries.

replay-io · 2024-05-13T22:38:38Z

Status	Complete ↗︎
Commit	`f907cef`
Results	❌ 1 Failed ⚠️ 3 Flaky ✅ 2563 Passed

snoe

LGTM but probably worth thinking about the couple questions.

src/metabase/db/custom_migrations/metrics_v2.clj

snoe · 2024-05-16T18:51:44Z

resources/migrations/001_update_migrations.yaml

+              - column:
+                  name: dataset_query_metrics_v2_migration_backup
+                  remarks: The copy of dataset_query before the metrics v2 migration
+                  type: ${text.type}


I wonder if this should be a separate table [report-card-id, old-query]

Having this hang around on report_card could be rather annoying long-term.

I was considering this too. Do you know of prior art? I assumed we would drop that field a few releases later. We would have to do that with a separate table too, I guess. I wonder if such an extra column can mess up serdes (although the tests don't fail).

Yeah, my concern was serdes as well, just from the cleanliness of the serialized cards (in a git backed serdes system) I think it would probably be better separate if it's not too much more cumbersome

I think creating a separate table is ultimately going to be more code and more noise, I think we should just have SerDes ignore the backup column entirely. It's only there in case we broke things anyway

Populating the backup column with values can and should be done in Liquibase. Look at migration 85 for example. Only do stuff in Clojure migrations if they can't be done in SQL

@camsaul, the custom migration populates this field only if it rewrites dataset_query. Are you suggesting always setting this field, or adding a condition approximating what's in the custom migration or something else?

See my other comment, I think it's probably fine to just always set this Field since it's just copying pointers and we're already doing a full table scan anyway.

Wouldn't that imply that on rollback we lose changes that we could keep? I ranked the importance of this higher than speed.

camsaul · 2024-05-16T20:08:10Z

I think you have to put all of your Fixes #issue on different lines otherwise it's not going to link them to this PR and close them automatically

src/metabase/db/custom_migrations.clj

src/metabase/db/custom_migrations/metrics_v2.clj

camsaul · 2024-05-16T20:48:41Z

src/metabase/db/custom_migrations/metrics_v2.clj

+                    (:id card)
+                    (-> rewritten
+                        (assoc :dataset_query_metrics_v2_migration_backup (:dataset_query card)
+                               :updated_at (Instant/now))))))))


I think it's usually better to use current_timestamp or now() (or now(3) for MySQL/MariaDB to get millisecond precision, altho that's probably not important in this case) so you don't have to worry about weirdness happening here with timezone conversion if this is a datetime or timestamp [without time zone] column but I guess it doesn't really matter that much

Yes, in fact, I'm not 100% sure that updating this column is what everybody would wish.

I've removed this occurrence altogether, but there is another one. Is this important? As long as setting the timestamp works, I think we are fine.

src/metabase/db/custom_migrations/metrics_v2.clj

camsaul · 2024-05-16T21:07:55Z

src/metabase/db/custom_migrations/metrics_v2.clj

+        (t2/update! :report_card
+                    (:id card)
+                    (-> rewritten
+                        (assoc :dataset_query_metrics_v2_migration_backup (:dataset_query card)


I think it's probably easier just do

UPDATE report_card SET dataset_query_metrics_v2_migration_backup = dataset_query

or

UPDATE report_card SET dataset_query_metrics_v2_migration_backup = dataset_query WHERE dataset_query LIKE '%["metric",%'

in a Liquibase migration to back up all the Card queries in one go, then you have a discrete step that is super easy to reverse because all you need to do is

UPDATE report_card SET dataset_query = dataset_query_metrics_v2_migration_backup WHERE dataset_query_metrics_v2_migration_backup IS NOT NULL

those should both be reasonably quick (other than the full table scan) since the strings should be stored separately anyway so it's just copying pointers rather than entire strings.

Not sure if

WHERE dataset_query LIKE '%["metric",%'

would actually make it faster, on the one hand you're avoiding a lot of updates but on the other hand now you have to fetch and scan every string. It's simpler just to copy the value for every Card

I think knowing which cards actually got rewritten (and might have to be restored) is more important than speed, but I can be convinced otherwise.

test/metabase/api/card_test.clj

test/metabase/events/revision_test.clj

camsaul · 2024-05-16T21:12:27Z

test/metabase/models/card_test.clj

+                         :made_public_by_id
+                         ;; we don't expect a description for this column because it should never change
+                         ;; once created by the migration
+                         :dataset_query_metrics_v2_migration_backup nil} col)


Same suggestion, just change assertion from = to =? instead of adding this

Can you be a bit more precise what you mean? This is an explicit set of fields we want to exclude from testing.

…tion

camsaul

Other than relatively small style suggestions my main concern here is that the migration is doing things really inefficiently, you're doing things like

(doseq [card (select :report_card :collection_id 1)]
  (t2/delete! :report_card :id (:id card))

which is pretty slow when you could have just done

(t2/delete! :report_card :collection_id 1)

instead. This can be pretty important since it affects startup time and we're dealing with potentially hundreds of things that need migration -- one DML query versus 100 is a pretty big performance difference at startup time.

But it would have been even better to make most of these things plain SQL/Liquibase migrations in the first place anyway. It's way easier to write efficient and unbuggy SQL queries than it is to write Clojure migrations.

Creating the migrations collection, granting it perms, and populating the backup column for dataset_query can all be done in Liquibase migrations easily, as can reversing those migrations.

The only thing that really needs to be done as a Clojure migrations is the part where you migrate queries for Cards that referenced v1 Metrics.

Migrating V1 Metrics to Cards could be done in pure SQL if you wanted to. Something like this would probably do the trick

INSERT INTO report_card
  (name, type, creator_id, migrated_from_v1_metric_id, dataset_query, database_id, table_id, collection_id)
SELECT
  concat(metric.name, ' (Migrated from metric ', cast(metric.id AS text), ')'),
  'metric',
  metric.creator_id,
  metric.id,
  -- Maybe using native JSON manipulation functions here would be easier
  concat(
    '{"database":', 
    cast(t.db_id AS text),
    ',"type":"query","query":', 
    concat(
      trim(trailing '}' FROM metric.definition),
      ',"source-table":',
      metric.table_id,
      '}'
    ), 
    '}'
  ),
  t.db_id,
  metric.table_id,
  (SELECT id FROM collection WHERE slug = 'migrated_metrics_v1')
FROM metric
LEFT JOIN metabase_table t
  ON metric.table_id = t.id;

You could add a new column called migrated_from_v1_metric_id and then build a map of migrated_from_v1_metric_id => id to migrate queries in Clojure land.

For the Clojure migrations select-reducible + reduce is going to be a lot more efficient than select and fetching everything all in memory at once and then iterating over it with doseq, even more so if you only fetch the columns you actually need

camsaul · 2024-05-16T22:22:52Z

Ok I was thinking about doing the Metric => Card migration in SQL a little more and the only real tricky part here is populating dataset_query, in Postgres using native JSON manipulation functions you can do this instead of all the concat and trim shenanigans

jsonb_set(
  jsonb_set(
    jsonb_set('{"type":"query"}'::jsonb, 
      '{database}', to_jsonb(t.db_id)),
    '{query}', metric.definition::jsonb),
  '{query, source-table}', to_jsonb(metric.table_id)
)::text

This seems a lot more readable to me, the only downside is you'd have to do different versions for MySQL and H2, but maybe you'd have to do that anyway. Take a look at v42.00-001 for an example of using native JSON functions in a DB migration

camsaul · 2024-05-16T23:09:06Z

Ok so I guess I misspoke a bit and the version of H2 we're using doesn't have enough JSON functions to make this work with native JSON manipulation in H2. So I guess you'd have to do concat/trim there

metamben · 2024-05-17T17:02:42Z

@camsaul, I have the feeling that either we go all in and do the whole migration in SQL or we keep the question rewriting part in Clojure and then it makes little sense to do the v2 metric card creation in SQL, because that's orders of magnitude less work. I expect the updates to the question cards to dominate the whole process.

I'm not super keen on moving the whole thing to SQL, because reproducing all the tests I did with the Clojure land rewrite would be quite a large amount of work. Unless you think it's crucial to do as much as possible in standard liquibase as possible, I would prefer to keep the data manipulating parts of the migration together.

…tion

metamben added 8 commits May 14, 2024 01:17

Add metric v2 migration

66bcf3a

Implement migration of multi-metric cards with failing test

dd0a4e9

Rewrite only the metric references in consuming queries

0111731

Add batch tests

fa9db83

Add liquibase migrations and more tests

80f9abb

Update process steps documentation

78a5f0d

Include commented out code for removing cards not consuming queries.

Make batch test optional

3935208

Remove Rich comments

1871b56

metamben requested a review from camsaul as a code owner May 13, 2024 22:19

metabase-bot bot assigned metamben May 13, 2024

metabase-bot bot added the .Team/QueryProcessor(deprecated) Use .Team/Querying instead label May 13, 2024

Resolve TODO comments

6f14cab

metamben requested a review from snoe May 13, 2024 22:31

metamben added 2 commits May 14, 2024 18:51

Fix driver tests

a293c10

Use java.time.Instant/now instead of java.util.Date.

a47dd72

snoe approved these changes May 16, 2024

View reviewed changes

src/metabase/db/custom_migrations/metrics_v2.clj Show resolved Hide resolved

src/metabase/db/custom_migrations/metrics_v2.clj Outdated Show resolved Hide resolved

snoe reviewed May 16, 2024

View reviewed changes