[ENH] ensure that all estimators have two test parameter sets #3429

fkiraly · 2022-09-14T22:38:40Z

We should ensure that all estimators (that have parameters) possess at least two test parameter sets.

The two (or more) parameter sets should:

be fast to run together - fit is the bottleneck (so we should not overdo it with too many parameter sets)
cover substantially different settings for all the important parameters, i.e., substantially different typical cases and/or important edge cases

Recipe:

search for estimators which have parameters but only a single test parameter set. These are estimators with no get_test_params implemented, or get_test_params returning only a single dictionary instead of a list of two (or more) dictionaries.
post here in this issue which estimator you picked (to avoid duplication of work)
come up with a parameter set satisfying the above constraints and add it to the return (should be list of two or more dictionaries)
make a PR

An example PR that adds second parameter sets for some estimators can be found here: #3428

Finding some estimators that have only one parameter set can also be done speedily by using this PR #2862 which adds a test for two parameter sets - either run the test suite from the branch locally, or look into the failing CI.

Locally running code which does this:

from sktime.registry import all_estimators

all_ests = all_estimators()
[x[0] for x in all_ests if (len(x[1].get_test_params())<2 or isinstance(x[1].get_test_params(), dict)) and len(x[1].get_param_names())>0]

Current output:

The text was updated successfully, but these errors were encountered:

Towards #3429. This adds a second parameter set for all estimators checked via `check_estimator` in the `no-softdeps` CI element. This is generally useful, and also allows #2862 to pass that CI element. Also fixes a bug discovered through this: `ExponentTransformer.inverse_transform` breaking if `power` is close to zero. This is now dealt with by a skip and a warning.

Abelarm · 2022-10-01T14:26:02Z

Hi @fkiraly

I’d like to to tackle this if it’s ok for you.

fkiraly · 2022-10-01T15:21:37Z

sure, @Abelarm - pick an estimator!

…gressorPipeline` (#3857) `set_params` bug in `ClassifierPipeline` and `RegressorPipeline` was broken, it would not correctly update parameters. The failure to detect this is an instance of the known problem #3429 The bug has been fixed, and is now covered by appropriate tests (addition of a second parameter set). The fix is as follow: * the issue was in `set_params` which accidentally had one too many layer of nesting in the param dict indexing, e.g., `classifier__` etc. This would materialize only for doubly nested estimators. * this was fixed, with a concomitant extension of the dict subset utility in the `_HetereogenousMetaEstimator`. Depends on #3858 as the `DummyClassifier` is used as one of two param sets.

…ts per estimator to 2 or larger (#4043) This PR adds test parameter sets to some estimators which have only one, towards issue #3429. This ensures that the test suite can properly test set and reset of parameters, and increases test coverage by ensuring that more parameters have more than one value setting in the tests. Test that detects estimators with only one parameter set: #2862 (related, not a dep) Depends on fixes of bugs detected through the new parameter sets: * #4047 * #4049 * #4057

…ts per estimator to 2 or larger (sktime#4043) This PR adds test parameter sets to some estimators which have only one, towards issue sktime#3429. This ensures that the test suite can properly test set and reset of parameters, and increases test coverage by ensuring that more parameters have more than one value setting in the tests. Test that detects estimators with only one parameter set: sktime#2862 (related, not a dep) Depends on fixes of bugs detected through the new parameter sets: * sktime#4047 * sktime#4049 * sktime#4057

Second test parameter set for `ARIMA`, towards #3429. Split off from #2862 where it must have ended up accidentally.

Towards #3429, test parameter sets for performance metrics.

This adds a second test parameter set to `AutoETS`. Towards #3429 Related: #4587, as the second set has `auto=True`

janpipek · 2023-07-22T09:47:47Z

I am picking SARIMAX.

julia-kraus · 2023-07-24T17:36:53Z

which estimators are still left?

fkiraly · 2023-07-24T18:25:22Z

@julia-kraus, the failures in this diagnostic PR #2862 correspond to the ones that do have only one - it might not be 100% up to date, I'll restart it so it is:

namita0210 · 2024-03-18T14:02:10Z

Hi , can I take up this issue for the estimator: "TimeSeriesForestClassier"
@fkiraly

fkiraly · 2024-03-18T15:23:01Z

@namita0210, absolutely! All yours!

#### What does this implement/fix? Explain your changes.  Implemented the standard 'get_test_params' class method with the appropriate docstring and applicable parameters. Added a couple test params for `RNNNetwork` contributing towards #3429. One test param that covers the default set and another that covers the 'units' parameter.

Towards #3429 Adds a second test parameter set for shapeDTW

MMTrooper · 2024-03-21T16:56:22Z

Hi, I will try to tackle the MatrixProfileClassifier. @fkiraly

fkiraly · 2024-03-21T23:29:14Z

great, thanks, @MMTrooper!

shankariraja · 2024-03-23T06:52:58Z

Hi @fkiraly,

I'm currently working on adding new test parameter sets for the estimators identified in the issue. I'll be focusing on TimeSeriesKMeansTslearn.
I'll create a pull request once I've completed the changes and tests. In the meantime, please let me know if you have any suggestions.

Thanks!

) - Introduce two test parameter sets for ``TimeSeriesKMeansTslearn`` in the ``get_test_params`` function. - Reference Issues Towards : #3429 - Tests passed: pytest sktime\clustering\tests\test_k_means.py

KaustubhUp025 · 2024-03-23T22:05:08Z

Hello @fkiraly , I will try to work on the estimator:- LogTransformer

Z-Fran · 2024-03-25T12:48:18Z

Hi @fkiraly , I will try to work on KNeighborsTimeSeriesRegressor.

@ianspektor

This PR enforces a stricter condition on `get_test_params`, namely that it should always run, even if all sensible instances require soft dependencies. This is to make the inspection contracts simpler and unconditional as regards dependencies. Two instances where this has recently caused problems is the `TemporianTransformer` in #5956 (FYI @ianspektor, @achoum, @javiber), and the #5880 (FYI @benHeid, @astrogilda). Having breaking `get_test_params` will also prevent the code snippet in the entry issue #3429 from running, which is causing problems from new contributors, as that issue is presented as a simple entry task. The code snippet is now covered by guaranteeing that `get_test_params` always runs. Includes: fix for `TSBootstrapAdapter`, which was the only non-compliant estimator.

… 3429 (#6209) #### What does this implement/fix? Explain your changes. Implemented get_test_params for both ```CNNNetwork``` and ```ResnetNetwork``` for issue #3429 . Fixed a couple typos inside the docstring for ```CNNNetwork```, changed from nb_conv_layers to n_conv_layers.

… and non-precomputed mode to improve memory efficiency (#6217) #### Reference Issues/PRs #3429 (comment) #### What does this implement/fix? Explain your changes. adds test parameter sets for `KNeighborsTimeSeriesRegressor`; adds support for non-brute algorithms and non-precomputed mode, mirroring #5937

shlok191 · 2024-04-05T00:28:50Z

Hello @fkiraly,
Could I please work on the LSTMFCNNetwork?

Thank you!!

fkiraly · 2024-04-05T12:16:39Z

do you mean, you would like to work on LSTMFCNNetwork, or are you asking me to?

shlok191 · 2024-04-05T13:28:17Z

@fkiraly, Oh I'm sorry about that! I meant if I could work on this! I wrote that while traveling, so sorry again!

fkiraly · 2024-04-05T15:40:30Z

Sure! No worries, and thanks for contributing!

For this estimator, kindly be aware of:

failures reported in [BUG] test failures in deep learning classifiers, regressors #6153 which you might encounter
the requirement to remove the test skip after fixing, as described in [BUG] test failures in deep learning classifiers, regressors #6153 (or tests will not run)

shlok191 · 2024-04-05T15:52:56Z

@fkiraly, Thank you so much for letting me help out!
I'll keep my eye on the possible failures and I'll remove the test skips as well :)

#### What does this implement/fix? Explain your changes. Towards #3429 I decided to add the estimator parameter and set it to the scikit-learn classifier `KNeighborsClassifier`. I also added my self as a contributor. Let me know if it was appropriate or I need to make a better implementation.

shlok191 · 2024-04-09T08:50:33Z

Hello @fkiraly,

I hope that you're having a good start to your week! I wanted to let you know that I added 2 test parameters for the LSTMFCNNetwork here. I also checked tests/_config.py to make sure that this estimator is included in CI tests. ☺️

I am really excited to test out the test parameters and getting your feedback! I can try to test out the changes locally first if that is the preferred protocol. I learned a lot about LSTMs in the context of time series from this. I would really love to possibly contribute more after this estimator's parameters are completed if that is okay.

Thank you so much again!

fkiraly · 2024-04-09T10:31:31Z

Great, @shlok191!

I recommend you open a pull request, where core developers can discuss your contribution further and possiby merge it!

shlok191 · 2024-04-09T19:43:48Z

@fkiraly, I just added a PR. Thanks a lot again for letting me contribute! 😃

fkiraly · 2024-04-09T20:36:38Z

sktime is an open project, so everyone can contribute!

Thanks for your contribution!

fkiraly added good first issue Good for newcomers maintenance Continuous integration, unit testing & package distribution module:tests test framework functionality - only framework, excl specific tests enhancement Adding new functionality labels Sep 14, 2022

This was referenced Sep 14, 2022

Good first issues & getting started, for new contributors #1147

Open

[ENH] test for more than one parameter sets per estimator #2862

Merged

[ENH] second param sets for selected estimators #3428

Merged

Abelarm mentioned this issue Oct 2, 2022

&fkiraly [BUG] fix sklearn interface non-conformance for estimators in _proximity_forest.py, add further test parameter sets #3520

Merged

fkiraly mentioned this issue Nov 30, 2022

[BUG] fix unreported set_params bug in ClassifierPipeline and RegressorPipeline #3857

Merged

This was referenced Jan 10, 2023

[ENH] add test parameter sets to increase number of test parameter sets per estimator to 2 or larger #4043

Merged

[ENH] second test parameter set for AutoETS #4091

Draft

fkiraly mentioned this issue Jan 11, 2023

[ENH] second set of test parameters for ARIMA #4099

Merged

mateuja mentioned this issue Jan 14, 2023

[ENH] Refactor/simplify sktime.forecasting.model_selection._split.BaseSplitter._split_vectorized #4108

Merged

5 tasks

fkiraly added a commit that referenced this issue Jan 23, 2023

[ENH] second set of test parameters for ARIMA (#4099)

b9b6e2e

Second test parameter set for `ARIMA`, towards #3429. Split off from #2862 where it must have ended up accidentally.

fkiraly mentioned this issue Feb 18, 2023

[ENH] additional test parameter sets for performance metrics #4246

Merged

fkiraly added a commit that referenced this issue Feb 28, 2023

[ENH] additional test parameter sets for performance metrics (#4246)

4ec4576

Towards #3429, test parameter sets for performance metrics.

This was referenced May 13, 2023

[BUG] AutoETS predict_quantile and predict_interval fail in sktime 0.17.2 #4587

Closed

[ENH] add more test parameter sets to AutoETS #4588

Merged

fkiraly added a commit that referenced this issue May 17, 2023

[ENH] add more test parameter sets to AutoETS (#4588)

868c433

This adds a second test parameter set to `AutoETS`. Towards #3429 Related: #4587, as the second set has `auto=True`

janpipek mentioned this issue Jul 22, 2023

[ENH] improve SARIMAX test parameter coverage #4932

Merged

2 tasks

Gigi1111 mentioned this issue Jul 22, 2023

[ENH] add test cases for Croston and ExponentialSmoothing #4935

Merged

2 tasks

julian-fong mentioned this issue Mar 17, 2024

[ENH] added test params to RNNNetwork on 3429 #6155

Merged

6 tasks

fkiraly pushed a commit that referenced this issue Mar 21, 2024

[ENH][BUG] Second test parameter set for shapeDTW (#6093)

533fe8c

Towards #3429 Adds a second test parameter set for shapeDTW

shankariraja mentioned this issue Mar 23, 2024

[ENH] add new test parameter sets for TimeSeriesKMeansTslearn #6195

Merged

julian-fong mentioned this issue Mar 25, 2024

[ENH] added test params toCNNNetwork and ResnetNetwork on 3429 #6209

Merged

6 tasks

Z-Fran mentioned this issue Mar 26, 2024

[ENH] k-nearest neighbors regressor: support for non-brute algorithms and non-precomputed mode to improve memory efficiency #6217

Merged

3 tasks

fkiraly mentioned this issue Mar 27, 2024

[ENH] stricter condition for get_test_params not failing #6223

Merged

fkiraly mentioned this issue Apr 7, 2024

[ENH] added test parameters for MatrixProfileClassifier #6193

Merged

3 tasks

shlok191 mentioned this issue Apr 9, 2024

[ENH] Added test parameters for the LSTM FCNN network #6281

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] ensure that all estimators have two test parameter sets #3429

[ENH] ensure that all estimators have two test parameter sets #3429

fkiraly commented Sep 14, 2022 •

edited

Abelarm commented Oct 1, 2022

fkiraly commented Oct 1, 2022

janpipek commented Jul 22, 2023

julia-kraus commented Jul 24, 2023

fkiraly commented Jul 24, 2023

namita0210 commented Mar 18, 2024

fkiraly commented Mar 18, 2024

MMTrooper commented Mar 21, 2024

fkiraly commented Mar 21, 2024

shankariraja commented Mar 23, 2024

KaustubhUp025 commented Mar 23, 2024 •

edited

Z-Fran commented Mar 25, 2024

shlok191 commented Apr 5, 2024 •

edited

fkiraly commented Apr 5, 2024

shlok191 commented Apr 5, 2024

fkiraly commented Apr 5, 2024

shlok191 commented Apr 5, 2024

shlok191 commented Apr 9, 2024

fkiraly commented Apr 9, 2024

shlok191 commented Apr 9, 2024

fkiraly commented Apr 9, 2024

[ENH] ensure that all estimators have two test parameter sets #3429

[ENH] ensure that all estimators have two test parameter sets #3429

Comments

fkiraly commented Sep 14, 2022 • edited

Abelarm commented Oct 1, 2022

fkiraly commented Oct 1, 2022

janpipek commented Jul 22, 2023

julia-kraus commented Jul 24, 2023

fkiraly commented Jul 24, 2023

namita0210 commented Mar 18, 2024

fkiraly commented Mar 18, 2024

MMTrooper commented Mar 21, 2024

fkiraly commented Mar 21, 2024

shankariraja commented Mar 23, 2024

KaustubhUp025 commented Mar 23, 2024 • edited

Z-Fran commented Mar 25, 2024

shlok191 commented Apr 5, 2024 • edited

fkiraly commented Apr 5, 2024

shlok191 commented Apr 5, 2024

fkiraly commented Apr 5, 2024

shlok191 commented Apr 5, 2024

shlok191 commented Apr 9, 2024

fkiraly commented Apr 9, 2024

shlok191 commented Apr 9, 2024

fkiraly commented Apr 9, 2024

fkiraly commented Sep 14, 2022 •

edited

KaustubhUp025 commented Mar 23, 2024 •

edited

shlok191 commented Apr 5, 2024 •

edited