&SmirnGregHM [BUG] ensure forecasting tuners do not vectorize over columns (variables) #5145

fkiraly · 2023-08-23T13:44:44Z

This PR introduces parameters to forecasting tuners such as ForecastingGridSearchCV to control whether parameters are tuned overall, or for instances or variables.

The current behaviour is retained for the default, which is tuning per variable and for all instances.

As this is somewhat inconsistent, this PR also starts a deprecation period with an end state where the default is a single parameter tuned for all instances and variables.

Fixes #5143, see there for a discussion why this is a (logic) bug - the solution is now to move to the most intuitive setting as a default, but leave the option to tune by instance or variable to an user, and expose this cleaerly in the docstrings.

Also extends the tests as follows:

adds test cases where the multivariate case is tested
adds checks for presence of best_params in all relevant tests

SmirnGregHM

I have checked, it works as expected. Are you sure it can go without the deprecation warning though?

sktime/forecasting/model_selection/tests/test_tune.py

fkiraly · 2023-08-24T13:30:12Z

I have checked, it works as expected. Are you sure it can go without the deprecation warning though?

Not entirely, that's why I would like to hear from @yarnabrina.

Whether we need one hinges on how many users would think this is a bug, and how many users have actually used it as-is, accessing the best_params_ via forecasters_ (despite that not being clearly documented).

If there's even a small number of the latter, we ought to run this through a deprecation cycle as it would break their code.

What do you think, @SmirnGregHM?

SmirnGregHM · 2023-08-24T13:51:09Z

@fkiraly it is hard for me to estimate the number of people who would use it as it is now. Can you maybe leave something like

class ForecastingGridSearchCV(...):
    ...
    
    @property
    def _forecasters(self):
        raise AttributeError(
            "`_forecasters` property was removed and the behaviour of `ForecastingGridSearchCV`"
            " for multivariate series changed in 0.22.1. Access `best_params_`, `cv_results_`,"
            " etc. directly from `ForecastingGridSearchCV` instead."
        )

Would it break something else?

fkiraly · 2023-08-24T13:52:30Z

Can you maybe leave something like

Hm, that sounds like a good compromise. I don't think it will break anything, since broadcasting doesn't happen after this PR.

fkiraly · 2023-08-24T15:09:05Z

added the error message, slightly reworded and with more pointers. Review appreciated, @SmirnGregHM. Also added a credit to you to this PR due to your contributions.

yarnabrina · 2023-08-24T17:01:51Z

I'll prefer this to go through deprecation, possibly through a temporary legacy_behaviour argument similar to predict_interval/predict_quantiles methods.

(As it's probably obvious from my recent questions in Discord and Github, I don't use this yet, and please feel free to consider this with low to zero priority as just a random opinion.)

yarnabrina

Does it make sense to add examples documenting how to access best model/score/parameters for both univariate/multivariate(/ForecastX ?) cases?

(In the github discussion, I noticed if passed forecaster if ForecastX with same model for both, y and X forecasters get treated separately, which may not be obvious to users.)

fkiraly · 2023-08-25T13:52:33Z

@yarnabrina, point taken. FYI @SmirnGregHM

I now went with a deprecation solution after all - which is close to @SmirnGregHM's original suggestion to have additional parameters.

Why:

following @yarnabrina's comment, I infer that some users may use the estimators with their current interface
wrapping the grid search in ForecastByLevel or ColumnEnsembleForecaster to achieve the effect is not as intuitive as a dedicated parameter, and incurs more nested estimators.
deprecation is better managed by a dedicated parameter that remains rather than a more unintuitive transition with temporary legacy_interface parameter, and it is much simpler (switch of a default, single cycle)

SmirnGregHM

Great, @fkiraly! I only suggest you to make the warning explicitly DeprecationWarning rather than the generic UserWarning. I also did not test the recent version yet, I will test later today. Codewise all seems good.

SmirnGregHM · 2023-08-29T11:28:15Z

sktime/forecasting/model_selection/_tune.py

+                "to version 0.24.0."
+            )


"to version 0.24.0.", DeprecationWarning

thanks, changed

sktime/forecasting/model_selection/tests/test_tune.py

SmirnGregHM · 2023-08-29T12:56:26Z

Good, works as expected, and old behavior is kept with a warning that it will change soon

…encies * origin/main: [DOC] speed-up tutorial notebooks - deep learning classifiers (sktime#5169) [ENH] fixture names in probability distribution tests (sktime#5159) [ENH] test for specification conformance of tag register (sktime#5170) &SmirnGregHM [BUG] ensure forecasting tuners do not vectorize over columns (variables) (sktime#5145) [ENH] VMD (variational mode decomposition) transformer based on `vmdpy` (sktime#5129) [ENH] Interface statsmodels MSTL - transformer (sktime#5125) [ENH] add tag for inexact `inverse_transform`-s (sktime#5166) [ENH] refactor and add conditional execution to `numba` based distance tests (sktime#5141) [MNT] move fixtures in `test_dropna` to `pytest` fixtures (sktime#5153) [BUG] prevent exception in `PyODAnnotator.get_test_params` (sktime#5151) [MNT] move fixtures in `test_reduce_global` to `pytest` fixtures (sktime#5157) [MNT] fix dependency isolation of `DateTimeFeatures` tests (sktime#5154) [MNT] lower dep bound compatibility patch - `binom_test` (sktime#5152) [MNT] test forecastingdata downloads only on a small random subset (sktime#5146) [ENH] widen scope of change-conditional test execution (sktime#5147) [DOC] update forecasting extension template on `predict_proba` (sktime#5138)

Update _tune.py

ebc864f

fkiraly added module:forecasting forecasting module: forecasting, incl probabilistic and hierarchical forecasting bugfix Fixes a known bug or removes unintended behavior labels Aug 23, 2023

fkiraly requested review from achieveordie, benHeid and yarnabrina as code owners August 23, 2023 13:44

fkiraly mentioned this pull request Aug 23, 2023

[BUG] ForecastingGridSearchCV does not save best_forecaster_, cv_results_ and other properties for multivariate time series #5143

Closed

fkiraly added 4 commits August 23, 2023 15:56

ensure multivariate pandas type present

0762091

docstring

e330fc0

tests

9c39e15

Update test_tune.py

e847d8b

SmirnGregHM approved these changes Aug 24, 2023

View reviewed changes

sktime/forecasting/model_selection/tests/test_tune.py Outdated Show resolved Hide resolved

fkiraly changed the title ~~[BUG] ensure forecasting tuners do not vectorize over columns (variables)~~ &SmirnGregHM [BUG] ensure forecasting tuners do not vectorize over columns (variables) Aug 24, 2023

error message

39df7cf

yarnabrina reviewed Aug 24, 2023

View reviewed changes

deprecation

4cd5f97

fkiraly added 3 commits August 25, 2023 14:58

Update _tune.py

8c52df3

Update test_tune.py

ca25f30

fix wrong variable name

e6d011b

fkiraly requested review from SmirnGregHM and yarnabrina August 25, 2023 16:01

SmirnGregHM approved these changes Aug 29, 2023

View reviewed changes

DeprecationWarning

c31cc8d

remove superfluous longley

91b78f6

fkiraly merged commit 0df3754 into main Sep 2, 2023
23 of 24 checks passed

fkiraly deleted the gridsearch-no-columnvec branch September 2, 2023 11:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

&SmirnGregHM [BUG] ensure forecasting tuners do not vectorize over columns (variables) #5145

&SmirnGregHM [BUG] ensure forecasting tuners do not vectorize over columns (variables) #5145

fkiraly commented Aug 23, 2023 •

edited

SmirnGregHM left a comment

fkiraly commented Aug 24, 2023 •

edited

SmirnGregHM commented Aug 24, 2023 •

edited

fkiraly commented Aug 24, 2023

fkiraly commented Aug 24, 2023

yarnabrina commented Aug 24, 2023

yarnabrina left a comment

fkiraly commented Aug 25, 2023

SmirnGregHM left a comment

SmirnGregHM Aug 29, 2023

fkiraly Aug 29, 2023

SmirnGregHM commented Aug 29, 2023

&SmirnGregHM [BUG] ensure forecasting tuners do not vectorize over columns (variables) #5145

&SmirnGregHM [BUG] ensure forecasting tuners do not vectorize over columns (variables) #5145

Conversation

fkiraly commented Aug 23, 2023 • edited

SmirnGregHM left a comment

Choose a reason for hiding this comment

fkiraly commented Aug 24, 2023 • edited

SmirnGregHM commented Aug 24, 2023 • edited

fkiraly commented Aug 24, 2023

fkiraly commented Aug 24, 2023

yarnabrina commented Aug 24, 2023

yarnabrina left a comment

Choose a reason for hiding this comment

fkiraly commented Aug 25, 2023

SmirnGregHM left a comment

Choose a reason for hiding this comment

SmirnGregHM Aug 29, 2023

Choose a reason for hiding this comment

fkiraly Aug 29, 2023

Choose a reason for hiding this comment

SmirnGregHM commented Aug 29, 2023

fkiraly commented Aug 23, 2023 •

edited

fkiraly commented Aug 24, 2023 •

edited

SmirnGregHM commented Aug 24, 2023 •

edited