[timeseries] Add wrappers for Statsforecast models #2758

shchur · 2023-01-25T16:47:42Z

Description of changes:

Add AutoETS, AutoARIMA and DynamicOptimizedTheta models from StatsForecast.

To Do:

Add tests
Benchmark & add models to presets
Expose the n_jobs parameter

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

github-actions · 2023-01-25T18:13:41Z

Job PR-2758-7212948 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2758/7212948/index.html

github-actions · 2023-01-25T21:35:38Z

Job PR-2758-a376e62 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2758/a376e62/index.html

tonyhoo

Are we planning to add them to any popular presets such as medium_quality or high_quality?

tonyhoo · 2023-01-26T07:41:43Z

timeseries/setup.py

@@ -32,6 +32,7 @@
    "torch>=1.9,<1.14",
    "pytorch-lightning>=1.7.4,<1.9.0",
    "networkx",
+    "statsforecast==1.4.0",


Can we make 1.4.0 the lower bound?

This way we are protecting ourselves from potentially breaking changes / regressions caused by newer versions of the dependencies. Both things already happened to us a few times (caused by minor releases of sktime & GluonTS), so I would rather be extra cautious here.

tonyhoo · 2023-01-26T07:48:14Z

timeseries/src/autogluon/timeseries/models/local/statsforecast.py

+        """
+        # TODO: Find a way to ensure that SF models respect time_limit
+        # Fitting usually takes >= 20 seconds
+        if time_limit is not None and time_limit < 20:


The fit time should be dependent on instance type as well. If time_limit is not implemented and we expect the local model fit shall be quick compared with other models, shall we send warning message instead?

Updated this logic based on the discussion below.

tonyhoo · 2023-01-26T07:51:41Z

timeseries/src/autogluon/timeseries/models/local/statsforecast.py

+        # Fitting usually takes >= 20 seconds
+        if time_limit is not None and time_limit < 20:
+            raise TimeLimitExceeded
+        super()._fit(train_data=train_data, time_limit=time_limit, verbosity=verbosity, **kwargs)


One quick idea to achieve time_limit is to take advantage of timeout from ThreadPoolExecutor or ProcessPoolExecutor. Details can be found here

I agree but, if I understand correctly, this would require monkey-patching StatsForecast. Is it fine if we leave it as-is for now and add a TODO comment?

Let's add it in TODO and follow up with that

tonyhoo · 2023-01-26T07:57:41Z

timeseries/src/autogluon/timeseries/models/presets.py

+    AutoARIMA=30,
+    AutoETS=70,
+    DynamicOptimizedTheta=60,


How do we determine the priorities of these stats models? How will they added to existing ARIMA and ETA impl? Some accuracy metrics comparison might be helpful to give insights with ensemble results

I set the priorities inversely proportional to the average time it takes to fit these models (slower model -> lower priority).

Also added models to the presets based on the discussion below.

shchur · 2023-01-26T14:26:26Z

After benchmarking AutoETS, AutoARIMA and DynamicOptimizedTheta models on 28 datasets on m5.4xlarge:

Fit time (i.e., prediction time for the validation set)
- ≥25s for all datasets, median time is 45s (these are approximately the same for all models)
- For 90% of the datasets these times are slower than the respective Statsmodels models (ETS, ARIMA, Theta) by 30s in the median case
- The runtime numbers are similar on my p3.8xlarge cloud desktop
- 99% of the time is spent inside StatsForecast.forecast, so our wrapper does not introduce any noticeable overhead
Performance comparison (MASE) on the test set
- AutoETS > ETS winrate = 71%
- AutoARIMA > ARIMA winrate = 62%
- DynamicOptimizedTheta > Theta winrate = 57%

Based on these findings, I suggest to

Add AutoETS to the medium_quality preset
Add AutoETS and AutoARIMA to the high_quality & best_quality presets + remove HPO for ARIMA, ETS in these presets.
Add DynamicOptimizedTheta to the best_quality preset only.
Raise TimeLimitExceeded if less than 10s remaining, warn that the model might exceed the time limit if less than 30s remaining.

I will do another round of benchmarking after adding TFT & updating the SFF model from the new GluonTS release. We can update the presets based on these results.

github-actions · 2023-01-26T18:36:52Z

Job PR-2758-ed4706a is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2758/ed4706a/index.html

shchur added 3 commits January 25, 2023 16:45

Add Statforecast models

f38f86f

Remove Theta

7212948

Add tests for SF

2652ddd

shchur added 2 commits January 25, 2023 19:20

Fix bug

8eca792

Move StatsForecast models to inherit from AbstractLocalModel

e05d993

shchur requested review from tonyhoo January 25, 2023 19:35

Fix test

a376e62

tonyhoo reviewed Jan 26, 2023

View reviewed changes

Disable HPO

3daa4dd

shchur added 3 commits January 26, 2023 14:44

Shorten model names

c18cd29

Fix test

3fca4c0

Fix HPO test

ed4706a

tonyhoo approved these changes Jan 26, 2023

View reviewed changes

shchur merged commit 106ba2f into autogluon:master Jan 27, 2023

shchur deleted the statsforecast branch January 27, 2023 08:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[timeseries] Add wrappers for Statsforecast models #2758

[timeseries] Add wrappers for Statsforecast models #2758

shchur commented Jan 25, 2023 •

edited

github-actions bot commented Jan 25, 2023

github-actions bot commented Jan 25, 2023

tonyhoo left a comment

tonyhoo Jan 26, 2023

shchur Jan 26, 2023

tonyhoo Jan 26, 2023

shchur Jan 26, 2023 •

edited

tonyhoo Jan 26, 2023

shchur Jan 26, 2023

tonyhoo Jan 26, 2023

tonyhoo Jan 26, 2023

shchur Jan 26, 2023

shchur Jan 26, 2023

shchur commented Jan 26, 2023 •

edited

github-actions bot commented Jan 26, 2023

[timeseries] Add wrappers for Statsforecast models #2758

[timeseries] Add wrappers for Statsforecast models #2758

Conversation

shchur commented Jan 25, 2023 • edited

github-actions bot commented Jan 25, 2023

github-actions bot commented Jan 25, 2023

tonyhoo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shchur Jan 26, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shchur commented Jan 26, 2023 • edited

github-actions bot commented Jan 26, 2023

shchur commented Jan 25, 2023 •

edited

shchur Jan 26, 2023 •

edited

shchur commented Jan 26, 2023 •

edited