Feature / database extract runs #1397

WilliamHPNielsen · 2018-11-27T10:11:20Z

Changes proposed in this pull request:

Add a function, extract_runs_into_db that takes a set of run_ids from one database file and inserts those runs into another database file.
Add a notebook explaining this function

The current limitations and behaviours are:

All runs to be inserted (in one function call) must come from the same experiment
The source database must be of the newest version. If it is not, calling the extraction function is a NOOP, and the user will be warned that an upgrade is needed. A boolean kwarg allows the user to have extract_runs_into_db upgrade the source DB.
The target DB will be created if it does not exist. If it does exist, ~~it will be upgraded~~ and is in an old version, the user will be warned similarly to how it works for the source DB.

@QCoDeS/core

astafan8

This is ready to go! It was a great freaking piece of work. I tested successfully on actual real .db files, and now everything looks fine :) Time to celebrate!

WilliamHPNielsen · 2018-11-30T12:13:30Z

@QCoDeS/core shall we merge once CI (minus codacy, who doesn't like fixtures) is happy?

Merge: b0ad951 557e627 Author: William H.P. Nielsen <whpn@mailbox.org> Merge pull request #1397 from WilliamHPNielsen/feat/database_copy_paste_runs

ThorvaldLarsen · 2019-01-02T14:24:34Z

@WilliamHPNielsen Concerning this comment when extracting to different databases:

"Note that the runs will have different run_ids in the new database. Their GUIDs are, however, the same (as they must be)."

This can cause a lot of confusion as currently the run_id and sample name are the unique identifiers that we use for a measurement run (that is the reason to have this information printed in the title of all plots). Is the idea that we should be using the guid as the unique identifier when putting datasets into presentations?
I think we should make sure that either the run_id sample name stays unique across all databases or change the identifier we use.

I am happy to meet and discuss/learn if these concerns have already been addressed elsewhere.

WilliamHPNielsen · 2019-01-03T08:21:37Z

@ThorvaldLarsen as discussed offline yesterday, the core team will get together and attack this. But just for completeness, I note here that I currently think changing the identifier you use is better. The dataset name seems like a good pick.

WilliamHPNielsen added 30 commits November 14, 2018 12:55

Add very rudimentary copy-paste module

063c15a

Add sql_placeholder_string function

bb51e60

Make Experiment take a connection and use that in funcs

89a4580

Add first partial copy function

3d98f58

Protect copy_single_dataset against inserting duplicates

de60f6b

Add experiment table update to run copy

6f7b0b7

Add basic database copy_paste test

01b2579

Add complete single run copy function

78e767b

Refactor table name formatting into function

2c04cac

Extend test to test for dataset equility

52f5a88

Move atomicity further out

eb8ee09

Handle existing vs non-existing experiments

edc32b7

Extend test to assert experiment equality

1dc6c44

Change source-target fixture to yield conns

084d1f0

Close connections

8ffc7e3

Adapt test to fixture change

1de9771

Remove wrong statement in docstring

1a2995d

Add test for experiment routing

3219453

Change function name

fed3b61

Pass on connections correctly

ff63bed

Add method for testing dataset similarity

ab05e69

Add a load_by_GUID method to data_set module

36a70c9

Extend tests to check dataset similarity

669cb37

Handle data column types correctly

7db1467

Improve two docstrings

8de6a8a

Move log initiation line

9add02b

Fix typo

95c2b49

Refactor run_id from guid into sqlite_base

f9a4285

Be correct about conn and path_to_db in docstring

d51455c

Move list definition closer to where it is used

41feda8

WilliamHPNielsen and others added 21 commits November 29, 2018 13:11

Remove dead line of code

043f03c

Fix critical typo in test

fd9d046

Also extract metadata

4a2b8c8

Refactor how snapshot get into dataset

3ac5677

Also extract snapshot

9200cce

Add a station to the example notebook

965be0b

Add version check of target DB file

7311b05

Make an error message more informative

b0b731d

Change the indentation in a docstring

c5d9bbb

Unindent a block

c813da9

Fix indentation error

08061c4

Remove print call

e20d4d0

Reword the docstring of a test

18cbcea

Assert some more

b3d2060

Fix typo in a comment

ea58bfd

Merge branch 'master' into feat/database_copy_paste_runs

bf56cd5

Replace SomeConnection with ConnectionPlus

848373b

Add test for old target db file

8eadd19

Add test for atomicity

5d7557e

Add test for column mismatch between target and source

c37c1fb

Use instrument fixture in tests to ensure instrument gets closed

114253d

astafan8 approved these changes Nov 29, 2018

View reviewed changes

WilliamHPNielsen added 2 commits November 30, 2018 09:26

Merge branch 'master' into feat/database_copy_paste_runs

1a9fae0

Merge branch 'master' into feat/database_copy_paste_runs

557e627

WilliamHPNielsen merged commit d20647c into microsoft:master Nov 30, 2018

WilliamHPNielsen deleted the feat/database_copy_paste_runs branch November 30, 2018 12:50

giulioungaretti pushed a commit that referenced this pull request Nov 30, 2018

Generated gh-pages for commit d20647c

9530782

Merge: b0ad951 557e627 Author: William H.P. Nielsen <whpn@mailbox.org> Merge pull request #1397 from WilliamHPNielsen/feat/database_copy_paste_runs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature / database extract runs #1397

Feature / database extract runs #1397

WilliamHPNielsen commented Nov 27, 2018 •

edited

astafan8 left a comment

WilliamHPNielsen commented Nov 30, 2018

ThorvaldLarsen commented Jan 2, 2019

WilliamHPNielsen commented Jan 3, 2019

Feature / database extract runs #1397

Feature / database extract runs #1397

Conversation

WilliamHPNielsen commented Nov 27, 2018 • edited

astafan8 left a comment

Choose a reason for hiding this comment

WilliamHPNielsen commented Nov 30, 2018

ThorvaldLarsen commented Jan 2, 2019

WilliamHPNielsen commented Jan 3, 2019

WilliamHPNielsen commented Nov 27, 2018 •

edited