Sample content #40753

calherries · 2024-03-28T17:38:18Z

Follows #40907, which was a pre-requisite for this change

To verify:

drop the app DB, and if using Postgres or MySQL re-create it
start up metabase

During setup, we insert sample content from resources/sample-content.edn into the app DB as a migration.
The sample content includes the "Examples" collection, which contains the "E-commerce insights" dashboard and its dependencies.

The approach avoids having to maintain any code to generate the collection and its contents, because it's just data. This approach ensures the contents is forward compatible to the same extent that any dashboard/card/collection is forward compatible, because we have to keep these entities forward compatible anyway.

The content comes from this collection: https://stats.metabase.com/collection/1449-example-dashboard-prototype

Testing strategy

The testing strategy for this is controversial, and worth questioning. There's two things you should know:

e2e tests run without the sample content
There are two reasons for this:
(a) we don't want to couple e2e tests to the sample content to make it easy to update
(b) updating the e2e tests are a pain, because there are a lot of tests that depend on the sample content not being there

However, I would argue we can get away with minimal test coverage using the following logic:

1. we promise users that "previously created content" will work after any upgrade in the future
2. we (should) have tests that enforce this property
3. whenever an upgrade happens, the sample content is always "previously created content" because it is created when a metabase app DB is initialized as a DB migration
4. therefore, we have tests to enforce that the sample content will work after any upgrade in the future

Also

1. we have tests that check the behaviour of content created with the metabase UI works as expected
2. the sample content is created with the metabase UI
3. therefore, we have tests that prove the sample content works as expected

Most of the backend tests run without the sample content. This is just to keep the test suite fast, because with the sample content the postgres and mysql test suites were taking over 90 minutes to complete. This carries substantial risk, but I don't have an alternative solution. There's potentially room to increase the number of tests where the sample content is present and it could be important though, e.g. loading from h2 tests. My worry is any code that depends on an initialized metabase instance having no content may pass tests but fail in production. As far as I'm aware there's no existing code of this nature but it's possible this could exist in the future. At least in development, we would catch the issue.

replay-io · 2024-03-28T17:50:26Z

Status	Complete ↗︎
Commit	`a195d56`
Results	⚠️ 12 Flaky ✅ 2395 Passed

resources/sample-content.edn

calherries · 2024-04-04T09:56:54Z

I feel that I'm missing something here re: substantial risk. Besides the "load-from-h2" scenario, what's the worst kind of issue you could anticipate slipping through?

This is a very good question, and I don't have a great answer. My worry is any code that depends on an initialized metabase instance having no content may pass tests but fail in production. As far as I'm aware there's no existing code of this nature but it's possible this could exist in the future. At least in development, we would catch the issue.

albertoperdomo

It looks like we could tweak a few of the question names - @vbenedetti ?
- Some questions have - Maz in the name. We should probably drop that
- (not binned yet) - is this question used or was it replaced with a new one
- Maybe we should not use the auto-generated names?

Also, is it me? But I can't seem to find the dashboard itself?

calherries · 2024-04-10T16:16:18Z

@albertoperdomo @vbenedetti I've tweaked the content to incorporate these changes:

everything is one collection instead of two
changes in titles and dashboards here notion.so/metabase/Example-dashboard-final-copy-c0c6b2b8eb114d5b832eb5612a8653a1?pvs=6&utm_content=c0c6b2b8-eb11-4d5b-832e-b5612a8653a1&utm_campaign=T078VCLCR&n=slack&n=slack_link_unfurl

calherries · 2024-04-11T12:12:26Z

@albertoperdomo @vbenedetti I combed through the content and realised there were a few errors in the queries and places where the dashboard could be improved. I've made the following changes:

"Buyer's ages by segments" (which I renamed to "Buyers by age group") was showing sum of counts of people by individual ages, but the x-axis was the count of people in each age, not the age.
Now it shows the number of people in each 5 year age range.
There were a few questions in "Demographics" that had auto-binned ages, which meant strange bins of 7.5 years, e.g. "22.5-30". Now they have bins of 5 years, e.g. "20-25".
Trend charts like "Revenue per quarter" had N/A for the previous quarter/month and now they report the last two quarters/months.
Time-series graphs had either hard-coded date filters or no date filters applied at all, which meant the data went to 2026. Now I added filters for these questions with orders.created_at in the last 24 months
"Unique customers per month" was using people.created_at, which grouped customers by the time they signed up. So I changed it to group by orders.created_at, so the data matches the title
The Location filter (which applies to people.state) was an input box, so I changed it to a dropdown list
I deleted questions we weren't using

I'd appreciate if you (or one of you) double-checked these changes, comparing against stats, and approve this PR when you're done.

github-actions · 2024-04-12T16:22:06Z

@calherries Did you forget to add a milestone to the issue for this PR? When and where should I add a milestone?

calherries requested a review from camsaul as a code owner March 28, 2024 17:38

metabase-bot bot assigned calherries Mar 28, 2024

metabase-bot bot added the .Team/BackendComponents also known as BEC label Mar 28, 2024

calherries changed the title ~~initial dashboard loading~~ Sample dashboard Mar 28, 2024

calherries mentioned this pull request Mar 28, 2024

[Epic] Seed new instances w/ an example dashboard #40066

Closed

calherries marked this pull request as draft March 28, 2024 18:05

calherries added the no-backport Do not backport this PR to any branch label Apr 1, 2024

calherries marked this pull request as ready for review April 1, 2024 14:28

calherries requested review from vbenedetti and albertoperdomo April 1, 2024 14:32

calherries changed the title ~~Sample dashboard~~ Sample content Apr 1, 2024

calherries requested a review from npretto April 1, 2024 14:35

npretto reviewed Apr 2, 2024

View reviewed changes

resources/sample-content.edn Outdated Show resolved Hide resolved

Create the internal user in a migration

679fd35

calherries mentioned this pull request Apr 2, 2024

Create the internal user in a migration #40907

Merged

calherries added 14 commits April 2, 2024 19:13

Add internal user tests

e9c5c42

Remove internal user being created for EE

e35e8c1

initial dashboard loading

347528c

use the example dashboard

0707884

Update password of user

6149e66

Depend on MB_LOAD_SAMPLE_CONTENT environment variable

c9b5121

Update comment

45f6be3

Use auto-generated IDs

88f000e

Fix comment

8f645fd

fix comment

6e2ef86

Add tests

c0ed4e9

Fix test

e07c103

Fix copying from h2

bbcafef

Fix test

2f86207

flip to true

bb996f9

albertoperdomo requested changes Apr 8, 2024

View reviewed changes

calherries mentioned this pull request Apr 9, 2024

Update sample content #41201

Merged

calherries added 2 commits April 10, 2024 19:08

Update sample content (#41201)

7c96b27

Merge branch 'master' into cal-sample-dashboard

2191324

calherries requested a review from albertoperdomo April 10, 2024 16:16

calherries added 3 commits April 10, 2024 19:20

Fix merge conflict

e12296d

Fix sample content IDs

a0a2fcf

Fix merge conflict

cc18680

calherries mentioned this pull request Apr 10, 2024

Create example-dashboard-id setting #41272

Merged

calherries and others added 4 commits April 11, 2024 13:35

Update content

9d4e227

Update filters and graphs

51b7f03

fix migration and delete card

6c9e0e9

Merge branch 'master' into cal-sample-dashboard

571a4c2

calherries and others added 4 commits April 11, 2024 15:13

Changesets in order

bad7cb2

Update migration ID in test

0b9c30e

Make dashboard full width

99aa65c

Create example-dashboard-id setting (#41272)

a195d56

npretto mentioned this pull request Apr 12, 2024

embed-homepage ms2: connect example dashboard #41180

Merged

albertoperdomo approved these changes Apr 12, 2024

View reviewed changes

calherries merged commit bdb34a3 into master Apr 12, 2024
105 checks passed

calherries deleted the cal-sample-dashboard branch April 12, 2024 16:21

crisptrutski pushed a commit that referenced this pull request Apr 15, 2024

Sample content (#40753)

9348bc0

This was referenced Apr 17, 2024

Add curate permissions for "All Users" for example collection #41512

Merged

Exclude sample content and Metabase Analytics from usage stats #41519

Merged

Exclude sample content and "Metabase Analytics" content from usage stats (clone) #41522

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sample content #40753

Sample content #40753

calherries commented Mar 28, 2024 •

edited

replay-io bot commented Mar 28, 2024 •

edited

calherries commented Apr 4, 2024 •

edited

albertoperdomo left a comment •

edited

calherries commented Apr 10, 2024 •

edited

calherries commented Apr 11, 2024 •

edited

github-actions bot commented Apr 12, 2024

Sample content #40753

Sample content #40753

Conversation

calherries commented Mar 28, 2024 • edited

Testing strategy

replay-io bot commented Mar 28, 2024 • edited

calherries commented Apr 4, 2024 • edited

albertoperdomo left a comment • edited

Choose a reason for hiding this comment

calherries commented Apr 10, 2024 • edited

calherries commented Apr 11, 2024 • edited

github-actions bot commented Apr 12, 2024

calherries commented Mar 28, 2024 •

edited

replay-io bot commented Mar 28, 2024 •

edited

calherries commented Apr 4, 2024 •

edited

albertoperdomo left a comment •

edited

calherries commented Apr 10, 2024 •

edited

calherries commented Apr 11, 2024 •

edited