rolling functions, rolling aggregates, sliding window, moving average #2778

jangorecki · 2018-04-21T03:52:54Z

jangorecki · 2018-04-27T04:09:57Z

@mattdowle answering questions from PR

Why are we doing this inside data.table? Why are we integrating it instead of contributing to existing packages and using them from data.table?

There were 3 different issues created asking for that functionality in data.table. Also multiple SO questions tagged data.table. Users expects that to be in scope of data.table.
data.table fits perfectly for time-series data and rolling aggregates are pretty useful statistic there.

my guess is it comes down to syntax (features only possible or convenient if built into data.table; e.g. inside [...] and optimized) and building data.table internals into the rolling function at C level; e.g. froll* should be aware and use data.table indices and key. If so, more specifics on that are needed; e.g. a simple short example.

For me personally it is about speed and lack of chain of dependencies, nowadays not easy to achieve.
Key/indices could be useful for frollmin/frollmax, but it is unlikely that user will create index on measure variable. It is unlikely that user will make index on measure variable, also we haven't made this optimization for min/max yet. I don't see much sense for GForce optimization because allocated memory is not released after roll* call but returned as answer (as opposed to non-rolling mean, sum, etc.).

If there is no convincing argument for integrating, then we should contribute to the other packages instead.

I listed some above, if you are not convinced I recommend you to fill a question to data.table users, ask on twitter, etc. to check response. This feature was long time requested and by many users. If response won't convince you then you can close this issue.

jangorecki · 2019-08-22T04:51:18Z

An example of how vectorized x/n arguments can impact performance.
AdrianAntico/AutoQuant@d837071#r34769837
less loops, code easier to read, much faster. Code using frollmean in a loop vs passing lists/vectors to frollmean, result 10x-36x speedup.

jangorecki · 2019-09-01T09:05:05Z

frollapply ready: #3600

    ### fun             mean     sum  median
    # rollfun          8.815   5.151  60.175
    # zoo::rollapply  34.373  27.837  88.552
    # zoo::roll[fun]   0.215   0.185      NA
    # frollapply       5.404   1.419  56.475
    # froll[fun]       0.003   0.002      NA

jangorecki · 2023-08-30T11:13:26Z

rollcor
rollcov
rollrank
rollunqn
rolllm

went out of scope as of current moment. All can work using frollapply (not master branch but PRs), just not super fast. We could consider adding them to scope in future. For the current moment the following set of sum mean prod min max sd var median feels fine and complete to me.

jangorecki added the feature request label Apr 21, 2018

jangorecki added this to the v1.11.2 milestone Apr 21, 2018

jangorecki self-assigned this Apr 21, 2018

This was referenced Apr 21, 2018

[R-Forge #2187] Add/document rolling mean, median etc.. combined with i #624

Closed

[R-Forge #2185] Add features/documentation for sliding windows with data.table #626

Closed

[Request] rollapply written in data.table #1855

Closed

This comment was marked as resolved.

Sign in to view

This comment was marked as outdated.

Sign in to view

jangorecki added a commit that referenced this issue Apr 24, 2018

roll.md moved to gh issue #2778

61d2ff6

jangorecki mentioned this issue Apr 24, 2018

[WIP] Rolling functions: rollmean #2795

Closed

This comment was marked as outdated.

Sign in to view

jangorecki added a commit that referenced this issue May 19, 2018

roll.md moved to gh issue #2778

7206ea5

jangorecki added a commit that referenced this issue May 29, 2018

roll.md moved to gh issue #2778

51f1310

jangorecki mentioned this issue Jul 3, 2018

rolling mean #2961

Merged

This comment was marked as outdated.

Sign in to view

This comment was marked as off-topic.

Sign in to view

ja-thomas mentioned this issue Nov 8, 2018

make sliding window faster QuayAu/fxtract#8

Closed

jangorecki added a commit that referenced this issue Nov 11, 2018

roll.md moved to gh issue #2778

5c1730d

jangorecki mentioned this issue Dec 15, 2018

rollmean post merge improvements #3224

Closed

3 tasks

st-pasha mentioned this issue Dec 21, 2018

Rolling aggregate support based on windows within a DT h2oai/datatable#1500

Open

jangorecki removed this from the 1.12.0 milestone Jan 5, 2019

MichaelChirico mentioned this issue Oct 15, 2019

Master list of most-requested issues #3189

Open

76 tasks

This comment was marked as outdated.

Sign in to view

SteveBronder mentioned this issue Nov 20, 2019

Forecasting Data Format mlr-org/mlr3temporal#1

Closed

jangorecki mentioned this issue Nov 24, 2019

Rolling regression #4075

Closed

This comment was marked as duplicate.

Sign in to view

This comment was marked as resolved.

Sign in to view

This comment was marked as outdated.

Sign in to view

MichaelChirico added the High label May 30, 2020

jangorecki removed the High label Jun 3, 2020

This comment was marked as outdated.

Sign in to view

This comment was marked as resolved.

Sign in to view

AdrianAntico mentioned this issue Aug 27, 2021

Benchmark on bigger data SebKrantz/collapse#184

Closed

jangorecki mentioned this issue Aug 31, 2022

rolling functions: adaptive left, frollmax, frollapply adaptive, partial #5441

Open

10 tasks

This comment was marked as outdated.

Sign in to view

jangorecki added the froll label Sep 26, 2022

jangorecki mentioned this issue Sep 2, 2023

more rolling functions #5682

Open

MichaelChirico added the top request One of our most-requested issues label Apr 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rolling functions, rolling aggregates, sliding window, moving average #2778

rolling functions, rolling aggregates, sliding window, moving average #2778

jangorecki commented Apr 21, 2018 •

edited

Loading

This comment was marked as resolved.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

jangorecki commented Apr 27, 2018 •

edited

Loading

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as off-topic.

jangorecki commented Aug 22, 2019 •

edited

Loading

jangorecki commented Sep 1, 2019

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as duplicate.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as resolved.

This comment was marked as outdated.

jangorecki commented Aug 30, 2023 •

edited

Loading

rolling functions, rolling aggregates, sliding window, moving average #2778

rolling functions, rolling aggregates, sliding window, moving average #2778

Comments

jangorecki commented Apr 21, 2018 • edited Loading