Wildly varying time to run STAMP with query #36

ginoa · 2018-12-10T13:40:28Z

Description
Running time of STAMP varies hugely depending on size of data. For some data sizes it even gets stuck without showing the progress bar.
For example computing the distance profile between a dataset and a query of length 34 and window size 13, I observe the following

average of 1 it/s for dataset of length 620579
average of 0.5 it/s for dataset of length 620578
average of 0.3 it/s for dataset of length 620577
average of 2.5 it/s for dataset of length 620576
average of 0.04 it/s for dataset of length 620574
stamp gets stuck without error for some sizes of dataset - or more probably it takes a lot of time to get to the point of showing the progress bar - so much that I do not have patience to wait and want to interrupt it. I must then hard kill Rstudio to interrupt evaluation.

In other words, changing the size of the dataset by just one unit has a big, nonlinear and unintuitive effect on execution time of stamp. For some size of data the function gets stuck without error.

Working examples
I could reproduce this behavior with random data.

ref_data <- rnorm(620576)
query_data <- rnorm(34)
tst <- tsmp::stamp(ref_data,query_data,window_size=13)
## execution speed: 0.4 it/s

ref_data <- rnorm(620577)
query_data <- rnorm(34)
tst <- tsmp::stamp(ref_data,query_data,window_size=13)
## execution speed: 2.5 it/s

Expected behavior
I would expect that running time get shorter as data decreases in size.

The text was updated successfully, but these errors were encountered:

franzbischoff · 2018-12-10T22:00:38Z

Confirmed. With STOMP this behaviour is not present.
I'll have to profile the code and check where is the bottleneck.

Thanks.

franzbischoff · 2019-02-24T02:49:12Z

The bottleneck is stats::fft(), this may have something to do with the FFT algorithm. Maybe padding the data with zeroes (the right amount) can solve this. Need to look into it.

franzbischoff · 2019-02-28T22:25:25Z

Implementing mass_v3 solves this problem

https://www.cs.unm.edu/~mueen/MASS_V3.m

franzbischoff · 2019-03-01T18:09:35Z

Changes STAMP to use MASS_V3

franzbischoff self-assigned this Dec 10, 2018

franzbischoff added the need work label Dec 10, 2018

franzbischoff added this to the v0.3.5 milestone Dec 10, 2018

franzbischoff closed this as completed Mar 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wildly varying time to run STAMP with query #36

Wildly varying time to run STAMP with query #36

ginoa commented Dec 10, 2018

franzbischoff commented Dec 10, 2018

franzbischoff commented Feb 24, 2019

franzbischoff commented Feb 28, 2019

franzbischoff commented Mar 1, 2019

Wildly varying time to run STAMP with query #36

Wildly varying time to run STAMP with query #36

Comments

ginoa commented Dec 10, 2018

franzbischoff commented Dec 10, 2018

franzbischoff commented Feb 24, 2019

franzbischoff commented Feb 28, 2019

franzbischoff commented Mar 1, 2019