[WIP] Reference by UUIDs #443

jhprinz · 2016-03-08T23:23:17Z

This implements the choice to use UUIDs to reference objects in the storage. Usually for pickling objects are referenced in a storage by using the type of the object and the index in the store.

Now the user has the choice to pick either the old (faster and simpler) way of referencing or switch to using UUIDs. Due to the significant overhead for searching uuids this does not make sense for ToyEngine on a local machine. If you run large systems then the overhead can be neglected.

The benefit is that you can load/save and analyze objects independent of the storage. I will add the possibility to have a split storage. This way you can also run jobs independently and later join the data to one big dataset while maintaining the connections between objects.

Missing

Currently stored and cached CVs cannot be stored in a file with no snapshots. The utility function in it will hence only work for systems where snapshots use kinetics and statics for storage. So the split in
two files works that one contains all trajectories, snapshots, statics and kinetics,

While the other contains all but statics and kinetics.

There is a utility function to split and a function to join Storages

update RemoteKernel for an example
implement DistributedStorage
implement file merger
implement file split
example where we run independent simulations and analyze the joint results ?

Move to other PRs

decouple CVs from snapshot storage will be in separate PR [WIP] Decouple storage of CVs from stored Snapshots #503
more thorough testing. Currently all works as before but I want to add more UUID specific testing.

…remote

jhprinz · 2016-03-08T23:23:56Z

First step towards an adaptive sampling scheme.

dwhswenson · 2016-03-09T09:31:03Z

First step towards an adaptive sampling scheme.

How is this related to adaptive sampling?

jhprinz · 2016-03-09T14:16:16Z

Because I might use it then later for adaptive sampling :) It has more todo with what I need to be able to do adative sampling like on a cluster. So it is a step to make OPS more useable for the MSM_TIS Adaptice sampling...

jhprinz · 2016-03-09T14:21:41Z

Hmmm, this was more work than expected. I have now something working where you can select the way netcdfplus handles internal referenes. Either by UUID or by integer reference. Default is disabled. That means you can switch to multifile support if you wanted to. using UUID makes the objects unique spanning multiple files, but has some overhead, although the test run almost the same. The mstis_analyis uses 1.5 additional seconds for 500 mcsteps. I guess the benefit for large systems outweighs this, but for small and test systems the current implementation is much better. Single file and short references.

Also an additional UUID per object is required. For a million objects this are (currently) 36 MiB more. I can get this down to 16 MiB when storing the uuid as bytes and not a string. The implementation can also be improved upon, but in general this works.

…uuid

…remote

jhprinz · 2016-03-13T23:47:36Z

Trying single trajectory files now. Seems to work very nicely and should make the distributed trajectory generation simpler.

…uuid

jhprinz · 2016-06-23T09:02:30Z

Closes #98

…uuid

jhprinz added 10 commits March 7, 2016 18:24

get works

3b99af3

get and set works

cbf9c5a

Merge branch 'master' of github.com:choderalab/openpathsampling into …

c22080b

…remote

reverted mstis example

3933ba5

cleanup

b1c5f39

cleanup

186bc24

cleanup

a7c8ef5

implement use of uuid

addb2cb

fix errors

cafa86a

fix uuid_cache

88642b8

jhprinz changed the title ~~UUIDs~~ [WIP] Reference by UUIDs Mar 8, 2016

jhprinz added 3 commits March 9, 2016 14:47

first working

a8a54d4

make uuid optional

d30b64b

make uuid optional

d334425

jhprinz added 6 commits March 9, 2016 17:51

add MergedStore

c8fdbc3

Merge branch 'master' of github.com:choderalab/openpathsampling into …

3215302

…uuid

Merge branch 'master' of github.com:choderalab/openpathsampling into …

812ecb0

…remote

Merge branch 'remote' into uuid

a8378e0

updates

87259da

updates

b49e1dc

jhprinz added 5 commits April 3, 2016 23:41

Merge branch 'master' of github.com:choderalab/openpathsampling into …

d858e37

…uuid

merge updates

acf7b2f

Merge branch 'master' of github.com:choderalab/openpathsampling into …

8e359f4

…uuid

fix storage of engine with pathmover

77be36a

fix storage of engine with pathmover

fdd906f

jhprinz added 2 commits May 10, 2016 02:24

fix traj attribute delegation

f95fc12

allos json picking of all objects

2041dc4

jhprinz mentioned this pull request May 16, 2016

[WIP] OPS on remote kernels #442

Closed

jhprinz added 4 commits May 17, 2016 12:38

fix setting of boxvectors

c6c9e51

fix openmm test

f16309f

fix openmm test

07dc302

Merge branch 'master' of github.com:choderalab/openpathsampling into …

45c27b9

…uuid

jhprinz mentioned this pull request May 19, 2016

Box_Vectors not set in openmm engine #483

Closed

jhprinz added 11 commits May 19, 2016 09:50

update examples

081c1e5

Merge branch 'master' of github.com:choderalab/openpathsampling into …

f0bd61d

…uuid

CUDA to CPU

c3a0ebc

fix nosetests

be04063

fix nosetests for both storage types

b02c6cb

fix requirements

f87e35a

speed tests

e3d76d8

faster rate matrix computation

63a79b1

fix nb test

865b1c6

fix nb test

024d900

Merge branch 'master' of github.com:choderalab/openpathsampling into …

40fa01a

…uuid

dwhswenson added this to the 1.0 milestone Jun 2, 2016

Merge branch 'master' of github.com:choderalab/openpathsampling into …

3e5ac2b

…uuid

jhprinz mentioned this pull request Jun 23, 2016

Splitting of netCDF4 files #98

Closed

jhprinz added 4 commits June 24, 2016 14:16

Merge branch 'master' of github.com:choderalab/openpathsampling into …

c9b3d4d

…uuid

add split function for storages

bb89c48

Merge branch 'master' of github.com:choderalab/openpathsampling into …

52f8f07

…uuid

move split to util.py

871c5b3

jhprinz mentioned this pull request Jul 12, 2016

New Storage implementation #517

Merged

36 tasks

dwhswenson closed this in #517 Aug 1, 2016

jhprinz deleted the uuid branch October 6, 2016 12:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Reference by UUIDs #443

[WIP] Reference by UUIDs #443

jhprinz commented Mar 8, 2016 •

edited

jhprinz commented Mar 8, 2016

dwhswenson commented Mar 9, 2016

jhprinz commented Mar 9, 2016

jhprinz commented Mar 9, 2016

jhprinz commented Mar 13, 2016

jhprinz commented Jun 23, 2016

[WIP] Reference by UUIDs #443

[WIP] Reference by UUIDs #443

Conversation

jhprinz commented Mar 8, 2016 • edited

Missing

Move to other PRs

jhprinz commented Mar 8, 2016

dwhswenson commented Mar 9, 2016

jhprinz commented Mar 9, 2016

jhprinz commented Mar 9, 2016

jhprinz commented Mar 13, 2016

jhprinz commented Jun 23, 2016

jhprinz commented Mar 8, 2016 •

edited