[WIP] FFT based equilibration detection. #112

kyleabeauchamp · 2014-09-26T18:48:54Z

One issue is that some of the previous "tricks" are ineffective (e.g. fast=True), as the FFT-based approach calculates all lagtimes simultaneously.

For the search the discard region t, I've elected to just do binary search of the logarithmically-spaced lagtime grid.

kyleabeauchamp · 2014-09-26T18:49:39Z

Right now I'm using the correlation function code from statsmodels. We can backport their code to avoid an extra dependency, once we finalize the big picture roadmap here

jchodera · 2014-09-26T19:38:34Z

What about the code for this contributed from @trendelkampschroer ?

kyleabeauchamp · 2014-09-26T19:39:28Z

It was a major refactoring of our current code, which would require more work than the current PR.

jchodera · 2014-09-26T19:39:35Z

Also, we had discussed using only N ~ 50 origin time points to evaluate as potential time origins t0. Did you have any luck with that?

jchodera · 2014-09-26T19:41:42Z

One issue is that some of the previous "tricks" are ineffective (e.g. fast=True), as the FFT-based approach calculates all lagtimes simultaneously.

Do you mean all lag times simultaneously, or time origins t0 simultaneously?

kyleabeauchamp · 2014-09-26T19:42:35Z

All lag times, not time origins. The time origin still needs to be searched manually.

kyleabeauchamp · 2014-09-26T19:44:00Z

Basically, the idea here is:

Iteratate time origins t0 via binary search on logarithmically spaced values.
For each t0, calculate C(t) for all lagtimes t via FFT.
Find maximum value of t0 and repeat on a finergrid of t0.

jchodera · 2014-09-26T19:45:29Z

It was a major refactoring of our current code, which would require more work than the current PR.

I thought we only needed to replace the statistical_inefficiency method and grab the dependencies (not refactored, just new methods) from @trendelkampschroer's timeseries.py. That would make everything downstream of statistical_inefficiency, including detect_equilibration, fast.

I do like the idea of the binary search, but note that the computed effective number of samples isn't necessarily concave in t0.

jchodera · 2014-09-26T19:46:06Z

Also, I'm working on a short paper on the detect_equilibration. This could be a neat follow-up, or maybe we should combine...

kyleabeauchamp · 2014-09-26T19:49:20Z

The issue is that the current statistical inefficiency uses a heuristic for skipping lagtimes, which isn't as useful when you use the FFT approach, which gives you results for all lagtimes simultaneously.

Also, https://github.com/trendelkampschroer/pymbar/blob/master/src/pymbar/timeseries.py has lots of extra stuff in there.

kyleabeauchamp · 2014-09-26T19:50:45Z

I agree that our function is not concave, so my heuristic gives only a local maximum. However, we're already using heuristics, so it's not a deal-breaker.

jchodera · 2014-09-26T19:54:31Z

I agree that FFT is better than the multiresolution approach I coded.

There's extra stuff in @trendelkampschroer's timeseries.py, but we only need statistical_inefficiency and its dependencies. And that will speed up all statistical inefficiency calculations, including detect_equilibration. I would prefer this to hacking FFT in for only detect_equilibration.

The statsmodel dependency isn't awful---@trendelkampschroer uses scipy instead. Not sure either is a big headache.

Let's definitely try out the binary search of t0 idea!

kyleabeauchamp · 2014-09-26T19:58:00Z

So what about the Newton's method code? His code for statistical inefficiency calls a Newton optimizer. That's a big change from the current code.

kyleabeauchamp · 2014-09-26T19:58:04Z

Also, there are no tests.

kyleabeauchamp · 2014-09-26T20:01:04Z

I chose statsmodels to calculate the ACF to avoid duplicating code that is maintained (https://github.com/statsmodels/statsmodels/blob/master/statsmodels/tsa/stattools.py#L362) and tested (https://github.com/statsmodels/statsmodels/blob/master/statsmodels/tsa/tests/test_stattools.py) elsewhere.

jchodera · 2014-09-26T20:05:56Z

I'm happy with the statsmodel version, but would prefer this go into statistical_inefficiency (perhaps as a selectable method?) instead of just detect_equilibration. That way, all uses can benefit.

That sound reasonable?

kyleabeauchamp · 2014-09-26T20:11:44Z

It's just not possible for all uses to benefit, because the FFT approach only works for a single input observable. The current statisticalIneffiency(A, B) is more general. Also, the current statisticalIneffiency() has a Python while loop hard-coded, which we have avoided in the new code entirely.

How about I have statisticalIneffiency(A) call statisticalIneffiency_fft(A) when B is set to None? I think that will lead to the most readable code.

jchodera · 2014-09-26T20:22:58Z

Sounds great!

jchodera · 2014-10-17T02:30:26Z

I think there were still some planned changes here. Specifically, you were going to rework detect_equilibration() to use the normal statisticalInefficiency(), but statisticalInefficiency would use the FFT code when only one argument (A_t) was specified.

kyleabeauchamp · 2014-10-17T17:54:40Z

So I'm going to open an new PR for the FFT code.

kyleabeauchamp added 3 commits September 26, 2014 14:36

Added fft timeseries.

476f020

Updated docstring.

d9211ee

Added better docstring.

c6a7d22

kyleabeauchamp added 4 commits September 26, 2014 16:59

statisticalIneffiency() calls fft version when possible.

f4a19db

Fix docstring in timeseries

5114593

Merge remote-tracking branch 'upstream/nk' into fft

281509a

Added numexpr and statsmodels to requirements.

f319fb0

kyleabeauchamp mentioned this pull request Oct 16, 2014

Add FFT-based correlation time computation to timeseries module #7

Closed

Merge remote-tracking branch 'upstream/nk' into fft

5659ebb

Merge remote-tracking branch 'upstream/nk' into fft

9d617c7

kyleabeauchamp closed this Oct 17, 2014

kyleabeauchamp deleted the fft branch January 3, 2015 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] FFT based equilibration detection. #112

[WIP] FFT based equilibration detection. #112

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

jchodera commented Oct 17, 2014

kyleabeauchamp commented Oct 17, 2014

[WIP] FFT based equilibration detection. #112

[WIP] FFT based equilibration detection. #112

Conversation

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

kyleabeauchamp commented Sep 26, 2014

jchodera commented Sep 26, 2014

jchodera commented Oct 17, 2014

kyleabeauchamp commented Oct 17, 2014