Implement TSMixer Model #2293

cristof-r · 2024-03-21T12:08:34Z

Add TSMixer model #1807 with several unit tests.
Adopted PyTorch implementation from this repository: https://github.com/ditschuk/pytorch-tsmixer/
The paper can be found here: https://arxiv.org/pdf/2303.06053.pdf

…asting test suites

cristof-r · 2024-03-21T14:25:03Z

Any feedback is very welcome, for me it seems good so far.
I can add it to the different markdowns and make an example notebook if you like the implementation.

VascoSch92

Whoa... nice job

darts/models/forecasting/tsmixer_model.py

darts/tests/models/forecasting/test_tsmixer.py

Co-authored-by: Vasco Schiavo <115561717+VascoSch92@users.noreply.github.com>

…/darts into feature/ts_mixer_model

Co-authored-by: Vasco Schiavo <115561717+VascoSch92@users.noreply.github.com>

…/darts into feature/ts_mixer_model

codecov-commenter · 2024-03-22T08:31:42Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 94.01%. Comparing base (91c7087) to head (cef5678).

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #2293      +/-   ##
==========================================
+ Coverage   93.95%   94.01%   +0.05%     
==========================================
  Files         136      137       +1     
  Lines       13687    13857     +170     
==========================================
+ Hits        12860    13027     +167     
- Misses        827      830       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

dennisbader · 2024-03-25T08:05:38Z

Hi @cristof-r, wow, this indeed looks amazing from a first glance!
Thanks a lot for this great PR, just give us some time to review it :) 🚀 🚀 🚀

leoniewgnr · 2024-03-29T20:41:34Z

If there is anything I can help with, please let me know! Looking forward to this:)

cristof-r · 2024-04-03T09:55:38Z

@leoniewgnr If there is anything I can help with, please let me know! Looking forward to this:)

It would be great if you have an interesting idea for a small notebook example to demonstrate the TSMixer.
I was thinking of comparing it against the TFT model, showing its (hopefully) higher performance, like it was demonstrated in the original paper.
Unfortunately using a bigger dataset like the "ETTm1" (which was also used in the paper) takes too long to train and evaluate.

dennisbader · 2024-04-03T10:50:14Z

@cristof-r and @leoniewgnr, I'm currently reviewing the PR. There were a couple of things to change, so I started working on a new branch with a couple of adaptions to this PR. I'll soon open a PR to merge into this one.

Among other things it will improve the performance and reduce training time drastically.

While working on it I also made a little notebook for testing with the ETTh1Dataset. The model works pretty well, very close to TiDEModel.

I'll keep you updated.

cristof-r · 2024-04-03T11:11:51Z

@dennisbader sounds great! Thank you very much

review-notebook-app · 2024-04-05T12:42:41Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

dennisbader

Thanks for this really nice PR @cristof-r. I took the freedom to push the changes already into this branch. I will comment on the most important changes:

tests

removed some of the tests that were either already handled in other test files, or tests that were taking a "long" time to complete (e.g. model performance (accuracy) tests which took ~30 seconds to complete)

model implementation

norms:
- adapted TimeBatchNorm2d implementation to use actual 2d batch norm
- removed support for RINorm since it should be used with model parameter use_reversible_instance_norm
modules
- made all module classes private to hide them in the rendered documentation
- removed ConditionalFeatureMixing and instead added the logic to _ConditionalMixerLayer
model parameters
- lowered the default parameters values (e.g. blocks, hidden_size, ...) to make a lighter default version
main things that were fixed:
- before, output_dim for all modules was set to hidden_output_size=hidden_size * output_dim * nr_params, whereas it should just be hdden_size. This was why the model was getting really slow for multivariate target series or probabilistic models.
- multi-component static covariates were not properly handled. We need to flatten the static covariates, and have static_cov_dim=n components * n static features
- I believe the static mixing was not handled correctly before. It looks like it was done as described in the paper. However, I'm not sure if the paper described it correctly.
  - before, static covariates were project to hidden_size with a linear layer, and then concatenated with x. In the first block x has only the actual number of input features. So then for the concatenation, x has much lower dimensionalty than x_static.
  - Now at the beginning we apply feature mixing to historical, future, and static covariates separately, and then feed them together to the mixing layers.

model example notebook

added an example notebook comparing a probabilistic TSMixer with TiDEmodel on a multivariate dataset, including future covariates (encoders) and static information

Let me know if you agree with the changes :) And again, thanks a lot for this great contribution, really appreciated!

cristof-r · 2024-04-05T15:52:54Z

@dennisbader thank you very much for the improvements, I learned a lot.
Also thank you very much for the darts library in general, it is really great.
Do you already know when the next release will be?

dennisbader · 2024-04-08T14:34:20Z

@cristof-r, we're aiming to release within the next two weeks.

Implement TSMixer Model

50f773b

cristof-r requested review from dennisbader and madtoinou as code owners March 21, 2024 12:08

added TSMixerModel to inits

b2e2f53

cristof-r marked this pull request as draft March 21, 2024 12:46

add TSMixerModel to global, historical, probabilistic and torch forec…

1cb4a74

…asting test suites

cristof-r marked this pull request as ready for review March 21, 2024 14:31

add MIT License information

b1aa5a4

VascoSch92 reviewed Mar 21, 2024

View reviewed changes

cristof-r and others added 10 commits March 22, 2024 08:59

Update darts/models/forecasting/tsmixer_model.py

64b422f

Co-authored-by: Vasco Schiavo <115561717+VascoSch92@users.noreply.github.com>

Update darts/models/forecasting/tsmixer_model.py

f0aa850

Co-authored-by: Vasco Schiavo <115561717+VascoSch92@users.noreply.github.com>

Update darts/models/forecasting/tsmixer_model.py

3b9962f

Co-authored-by: Vasco Schiavo <115561717+VascoSch92@users.noreply.github.com>

Update darts/models/forecasting/tsmixer_model.py

1605d07

Co-authored-by: Vasco Schiavo <115561717+VascoSch92@users.noreply.github.com>

update tuple annotations

8b958e2

Merge branch 'feature/ts_mixer_model' of https://github.com/cristof-r…

3846947

…/darts into feature/ts_mixer_model

Update darts/tests/models/forecasting/test_tsmixer.py

4c92d95

Co-authored-by: Vasco Schiavo <115561717+VascoSch92@users.noreply.github.com>

Merge branch 'feature/ts_mixer_model' of https://github.com/cristof-r…

10775cd

…/darts into feature/ts_mixer_model

Upate TORCH_AVAILABLE is tests

1d76bb8

update time_to_feature function

492d2a5

cristof-r added 6 commits March 22, 2024 09:41

Update '|' typing operator

bfbf53c

update parametric test

78167e5

remove unused MixerLayer

6880245

add tests for TimeBatchNorm2d

2763536

update tsmixer

200e497

add tsmixer to markdowns

a6f52a0

cristof-r added 2 commits March 25, 2024 10:59

update TSMixer activation function and tests

c24e733

update tsmixer

3964afd

cristof-r added 3 commits March 25, 2024 14:48

update tuple annotation

db19913

fix tsmixer where the number of blocks was always 1 size too large

b30ac4f

update forward pass to remove if statement

5c601ef

Update test_probabilistic_models.py

cef5678

dennisbader added 4 commits April 3, 2024 16:00

refactor tsmixer

fc20d9e

udpate example notebook

cc08364

Merge branch 'master' into feat/ts_mixer_model_update

f064a9c

finalize tsmixer notebook

7a2938a

dennisbader added 2 commits April 5, 2024 15:24

final updates

e4ee807

update changelog

af0bfb9

dennisbader approved these changes Apr 5, 2024

View reviewed changes

dennisbader added 2 commits April 5, 2024 16:02

add dedicated dropout

5b6b72e

update docs

2bf599f

dennisbader added 4 commits April 8, 2024 11:09

Merge branch 'master' into feature/ts_mixer_model

715c642

fix failing unit tests

3ad5454

Merge branch 'master' into feature/ts_mixer_model

b27b3e0

update tsmixer notebook with metric specific quantiles

7c447f1

dennisbader merged commit 0d5c722 into unit8co:master Apr 8, 2024
7 of 9 checks passed

cristof-r deleted the feature/ts_mixer_model branch April 9, 2024 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement TSMixer Model #2293

Implement TSMixer Model #2293

cristof-r commented Mar 21, 2024 •

edited

Loading

cristof-r commented Mar 21, 2024

VascoSch92 left a comment

codecov-commenter commented Mar 22, 2024 •

edited

Loading

dennisbader commented Mar 25, 2024

leoniewgnr commented Mar 29, 2024

cristof-r commented Apr 3, 2024

dennisbader commented Apr 3, 2024

cristof-r commented Apr 3, 2024

review-notebook-app bot commented Apr 5, 2024

dennisbader left a comment •

edited

Loading

cristof-r commented Apr 5, 2024

dennisbader commented Apr 8, 2024

Implement TSMixer Model #2293

Implement TSMixer Model #2293

Conversation

cristof-r commented Mar 21, 2024 • edited Loading

cristof-r commented Mar 21, 2024

VascoSch92 left a comment

Choose a reason for hiding this comment

codecov-commenter commented Mar 22, 2024 • edited Loading

Codecov Report

dennisbader commented Mar 25, 2024

leoniewgnr commented Mar 29, 2024

cristof-r commented Apr 3, 2024

dennisbader commented Apr 3, 2024

cristof-r commented Apr 3, 2024

review-notebook-app bot commented Apr 5, 2024

dennisbader left a comment • edited Loading

Choose a reason for hiding this comment

cristof-r commented Apr 5, 2024

dennisbader commented Apr 8, 2024

cristof-r commented Mar 21, 2024 •

edited

Loading

codecov-commenter commented Mar 22, 2024 •

edited

Loading

dennisbader left a comment •

edited

Loading