scripts for rebuilding a dev environment (with sample data) #7256

pdurbin · 2020-09-09T18:57:22Z

Yesterday in tech hours we talked about things to help developers and I brought up what I think of as the "rebuild with sample data" scripts I've been using for years and years. (I stopped using them only recently because we switched to Payara.)

There are lots of reasons developers might want to rebuild from time to time:

You've been running integration tests and all your "nice" datasets are buried beneath them.
You're ready for a clean start with known (or no) data.
You're having trouble with database upgrade scripts.

By rebuild I don't mean that every single dependency is removed. In fact, the application server (Glassfish or Payara) is largely untouched in my scripts. That said, a number of major changes are made:

The database is dropped.
The data files are deleted.
Solr is cleared out.

These scripts eventually evolved into the ones we used on the phoenix server for many years. This server was rebuilt as described above on every run. The scripts can be found at https://github.com/IQSS/dataverse/tree/develop/scripts/deploy/phoenix.dataverse.org

After "rebuild" has run, the "post" scripts gets executed, and starts with some setup...

Run setup-all.sh.
Run SQL scripts (reference data and create sequence).
Set DOI provider to FAKE.

... and then continues on to load some sample data. As I mentioned on the call, these scripts create some "birds and trees" users, dataverses, and datasets. (Even though our sample data repo is newer, it doesn't create users.) The Spruce Goose dataset (screenshot below) might be familiar.

When estimating this issue, here are some questions to consider:

Do we want the "birds and trees" data? Or would we rather have the sample data? Or both?
Should we defer worrying about sample data until a later issue?
Should we consider adding a rebuild or reinstallation of Payara as part of this? Or should we stick to the model above?
Should we use the bash scripts above as a starting point? Or should this be a (dangerous!) feature of the installer?

poikilotherm · 2020-09-10T12:46:44Z

Some ideas:

Move all SQL stuff to a Flyway baseline and/or migration. Flyway: disable DDL generation from EclipseLink #5871
Make DOI providers configurable via MicroProfile Config API (just like everything else) As a sysadmin I want to use MicroProfile Config API to configure my installation #7000
Move all from setup.sh to bootstrapping code Bootstrapping config on first deployment #5361 while keeping it configurable with As a sysadmin I want to use MicroProfile Config API to configure my installation #7000
Load test/sample data depending on environment/configuration with the same trick or with a Maven profile using Flyway from there.

"Birds and tress" data sounds good. It should be quick to load. dataverse-sample-data sometimes is a bit slow, yet we could change that.

djbrooke · 2020-09-30T18:19:08Z

This would be updating existing scripts to work with Payara
A better outline/index to exemplify the characteristics of the sample data (New Index/Outline of Characteristics + Review/Revise/Iterate Data Files dataverse-sample-data#23)

…S#7256

…Migrate callback SQL script. IQSS#7256

pdurbin · 2020-10-29T14:57:15Z

I've made two pull requests:

Pull request add script for rebuilding dev environment #7256 #7363 - add script for rebuilding dev environment
allow API token to be retrieved from environment variable #24 dataverse-sample-data#25 - allow API token to be retrieved from environment variable

add script for rebuilding dev environment #7256

…will be applied in order (cannot create an older, out-of-order version as this would break migrations for everyone). IQSS#7256

…QSS#7256 introduced a migration for this.

djbrooke added this to Up Next 🛎 in IQSS/dataverse (TO BE RETIRED / DELETED in favor of project 34) Sep 30, 2020

djbrooke added the Medium label Sep 30, 2020

poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Oct 23, 2020

Remove reference_data.sql usages. IQSS#7256

1a122d4

poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Oct 23, 2020

Move index creation from reference_data.sql into Flyway baseline. IQS…

fc3dbb6

…S#7256

poikilotherm mentioned this issue Oct 23, 2020

Epic: small footprint container usable for development, testing and production purposes #5292

Closed

poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Oct 23, 2020

Replace initial data insert from reference_data.sql with Flyway after…

4611825

…Migrate callback SQL script. IQSS#7256

poikilotherm mentioned this issue Oct 23, 2020

7256 purge referencedata #7355

Merged

pdurbin moved this from Up Next 🛎 to IQSS Team - In Progress 💻 in IQSS/dataverse (TO BE RETIRED / DELETED in favor of project 34) Oct 26, 2020

pdurbin self-assigned this Oct 26, 2020

pdurbin added a commit that referenced this issue Oct 27, 2020

add script for rebuilding dev environment #7256

5b5e946

pdurbin mentioned this issue Oct 27, 2020

add script for rebuilding dev environment #7256 #7363

Merged

pdurbin removed this from IQSS Team - In Progress 💻 in IQSS/dataverse (TO BE RETIRED / DELETED in favor of project 34) Oct 29, 2020

pdurbin removed their assignment Oct 29, 2020

pdurbin added a commit that referenced this issue Oct 29, 2020

note that files dir must be the default #7256

ecde956

pdurbin added a commit that referenced this issue Oct 29, 2020

typo #7256

d3778f3

kcondon closed this as completed in #7363 Oct 30, 2020

kcondon added a commit that referenced this issue Oct 30, 2020

Merge pull request #7363 from IQSS/7256-dev-rebuild

0b268ca

add script for rebuilding dev environment #7256

poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Nov 6, 2020

Add a comment to the first bootstrap SQL script. IQSS#7256

8993a35

poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Nov 6, 2020

Add a comment to the first bootstrap SQL script. IQSS#7256

5132e04

poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Nov 14, 2020

Update Flyway SQL files to reflect version change to 5.2. IQSS#7256

422bc26

poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Dec 1, 2020

dataverse-k8s: remove reference_data.sql loading from bootstrap, as I…

4422e59

…QSS#7256 introduced a migration for this.

poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Jun 11, 2021

dataverse-k8s: remove reference_data.sql loading from bootstrap, as I…

5ae67f3

…QSS#7256 introduced a migration for this.

poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Aug 23, 2021

dataverse-k8s: remove reference_data.sql loading from bootstrap, as I…

c5ac53e

…QSS#7256 introduced a migration for this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scripts for rebuilding a dev environment (with sample data) #7256

scripts for rebuilding a dev environment (with sample data) #7256

pdurbin commented Sep 9, 2020

poikilotherm commented Sep 10, 2020 •

edited

djbrooke commented Sep 30, 2020 •

edited by mheppler

pdurbin commented Oct 29, 2020

scripts for rebuilding a dev environment (with sample data) #7256

scripts for rebuilding a dev environment (with sample data) #7256

Comments

pdurbin commented Sep 9, 2020

poikilotherm commented Sep 10, 2020 • edited

djbrooke commented Sep 30, 2020 • edited by mheppler

pdurbin commented Oct 29, 2020

poikilotherm commented Sep 10, 2020 •

edited

djbrooke commented Sep 30, 2020 •

edited by mheppler