Add VAMP, CKTests (MSM and VAMP) #25

marscher · 2019-09-10T15:27:27Z

This adds VAMP estimator/model and the infrastructure for lagged model validation (cktests).

During the path of getting the stuff to work, I noticed that calling fit on an estimator has unexpected side effects. That is why we need to take a copy of it in LaggedModelValidator. The factory pattern however should make the need for this copy unnecessary, but because we encapsulate the current model instance, we can not work around this.
@clonker do you think it would be sane to call _create_model upon fit() to avoid this kind of hassle? How would we enforce this behavior without interfering with overridden fit methods?

…es?)

Actually this should not be needed, as estimators are a plain model factory.

TODO: investigate why

marscher · 2019-09-12T17:08:30Z

https://stackoverflow.com/a/34225828/3086470 there are some patterns, as well as design considerations.

marscher · 2019-09-12T17:17:54Z

the test failure in kmeans is totally unrelated and surprising!

clonker · 2019-09-14T18:43:01Z

well it is a randomized test 🤷‍♂️ and it just checks whether the callback was invoked twice.. couldve converged after one iteration?

clonker · 2019-09-14T18:47:10Z

Would it help to always yield a copy when calling fetch_model?

marscher · 2019-09-16T13:37:48Z

On 14.09.19 20:47, Moritz Hoffmann wrote: Would it help to always yield a copy when calling fetch_model?

It would cure the symptoms, but I thought we decided against it in the beginning, since copying is a heavy (memory) operation and unexpected at this point.

clonker · 2019-09-16T15:41:16Z

It would cure the symptoms, but I thought we decided against it in the beginning, since copying is a heavy (memory) operation and unexpected at this point.

Symptoms implying there is a deeper underlying issue? To be fair I don't think it is very heavy in terms of memory since there is no data attached to models, just statistics. Also isn't it rather unexpected that a factory returns references?

marscher · 2019-09-17T09:41:13Z

I'll add the default copy then, but it also involves enforcing implementations of fetch_model to do the copy. I think it would be much clearer, that a fit() should produce a new instance.

marscher · 2019-09-17T11:48:50Z

Creating the instance prior invocations of fit is rather easy to do, minimal invasive, because people should derive from Estimator in any case. There is a note in the doc string quoting this behavior and we can also put it in the developer docs at some point.

clonker · 2019-09-17T14:50:31Z

Sounds good to me in principle, but is that compatible with partial fit? I just suggested fetch_model because of that.

marscher · 2019-09-17T15:09:57Z

The default Estimator constructor checks for this case and creates the model if partial_fit is implemented.

clonker

very nitpicky probably 😇

sktime/base.py

sktime/decomposition/tica.py

sktime/decomposition/vamp.py

sktime/lagged_model_validator.py

sktime/numeric/__init__.py

… borked mlags.

@clonker

@clonker: I'd be happy, if you could explain the deviation in msm cktest... I couldn't spot it :(

…nstance).

marscher · 2019-09-24T16:37:50Z

@clonker figured it out by myself. Thanks for the rigorous review, it was very helpful!

clonker · 2019-09-24T20:24:49Z

sweet! what was it in the end?

marscher · 2019-09-24T20:31:11Z

On 24.09.19 22:24, Moritz Hoffmann wrote: sweet! what was it in the end?

an indexing issue (mlag0 offset+1) and a too high precision for testing (just adopted to the pyemma rtol and atol).

clonker · 2019-09-24T20:43:03Z

yeah indexing issues are nasty.. nice!

marscher force-pushed the add_vamp branch from c1c1b8c to ac30156 Compare September 11, 2019 17:01

marscher added 20 commits September 12, 2019 16:34

[vamp] added vamp, test_vamp

18f2535

wip

f696909

some refactoring

7920e9b

add Estimator.fetch_model(copy_flag)

e2b1278

ck works

756f240

added doublewell test data

6dc5fc5

use sktime dw test data

24dec2d

[vamp] fix ck

942845e

be explicit in type annotation

c9a45d5

wip

180f49e

ck fixes, 2 failing tests remain.

227ee74

[model] revert copy flag addition from fetch_model

ed81086

fixes deeptime-ml#20

a417d21

[laggedmodelvalidator] take copy of test model to avoid side-effects.

188011a

remove duped code

350834f

minor changes

c0d2d60

[lagged model validator] process models in seperate loops (cache miss…

058776f

…es?)

store lagtimes in LaggedModelValidation

4ee3910

[LaggedModelvalidator] avoid side-effects by copying test_estimator

0cc548c

Actually this should not be needed, as estimators are a plain model factory.

[test-cktest msm] needed to lower precision for estimates comparision

0ea8cfb

TODO: investigate why

marscher force-pushed the add_vamp branch from ae7ee67 to 0ea8cfb Compare September 12, 2019 16:23

marscher requested a review from clonker September 12, 2019 16:48

marscher mentioned this pull request Sep 12, 2019

ck-test: deal with smaller active set in propagate function (which refers to a larger set) #2

Closed

[ci] use strict channel priority

dff789f

marscher added 2 commits September 17, 2019 13:44

[base] create a new model instance upon fit calls.

301c8a5

adopt test cases to new model instances upon fit.

c885538

clonker reviewed Sep 17, 2019

View reviewed changes

marscher added 3 commits September 17, 2019 18:08

model estimator cleanup

bd312f9

[lagged model validator] set input lag time on construction. Warn for…

7d58953

… borked mlags.

mdot iterative

26fa043

clonker approved these changes Sep 17, 2019

View reviewed changes

marscher added 2 commits September 17, 2019 18:30

[lagged model validators] disallow mlag=0

0b2e370

fix cktests for removed mlag=0 case.

1b3565b

@clonker: I'd be happy, if you could explain the deviation in msm cktest... I couldn't spot it :(

clonker approved these changes Sep 17, 2019

View reviewed changes

marscher changed the title ~~Add vamp~~ Add VAMP, CKTests (MSM and VAMP) Sep 24, 2019

marscher added 3 commits September 24, 2019 16:57

fix cktest msm

68032ed

[cktest] do not copy model (guaranteed by estimator-fit() new model i…

3a9163f

…nstance).

[test-ck] use same precision as in pyemma.

7cc8380

marscher merged commit 071108c into deeptime-ml:master Sep 24, 2019

marscher deleted the add_vamp branch September 24, 2019 16:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add VAMP, CKTests (MSM and VAMP) #25

Add VAMP, CKTests (MSM and VAMP) #25

marscher commented Sep 10, 2019 •

edited

marscher commented Sep 12, 2019

marscher commented Sep 12, 2019

clonker commented Sep 14, 2019

clonker commented Sep 14, 2019

marscher commented Sep 16, 2019 via email

clonker commented Sep 16, 2019

marscher commented Sep 17, 2019

marscher commented Sep 17, 2019

clonker commented Sep 17, 2019

marscher commented Sep 17, 2019

clonker left a comment

marscher commented Sep 24, 2019

clonker commented Sep 24, 2019

marscher commented Sep 24, 2019 via email

clonker commented Sep 24, 2019

Add VAMP, CKTests (MSM and VAMP) #25

Add VAMP, CKTests (MSM and VAMP) #25

Conversation

marscher commented Sep 10, 2019 • edited

marscher commented Sep 12, 2019

marscher commented Sep 12, 2019

clonker commented Sep 14, 2019

clonker commented Sep 14, 2019

marscher commented Sep 16, 2019 via email

clonker commented Sep 16, 2019

marscher commented Sep 17, 2019

marscher commented Sep 17, 2019

clonker commented Sep 17, 2019

marscher commented Sep 17, 2019

clonker left a comment

Choose a reason for hiding this comment

marscher commented Sep 24, 2019

clonker commented Sep 24, 2019

marscher commented Sep 24, 2019 via email

clonker commented Sep 24, 2019

marscher commented Sep 10, 2019 •

edited