[ENH] should we have a `reset` method? #1614

fkiraly · 2021-11-12T15:02:42Z

Should sktime estimators have a reset method that sets the estimator's entire state to what it would be after initialization?

The main reason is that this would be a clean way to reset the estimator at the beginning of fit whenever called for the second or any repeated time.

This would prevent bugs such as in #1595 which can lead to unexpected and hard-to-debug behaviour in grid search etc, where fit might be called multiple times.

Of course, a "clean" implementation would use clone, but not all contributors/extenders are aware of the convention.

A more implementer friendly logic is the described behaviour, where fit resets the estimator when called a second time - this is "like" a call to sklearn clone, but it requires no proactivity and advanced knowledge of sklearn internals.

FYI @mloning, @aiwalter, @ltsaprounis, @danbartl, @TonyBagnall, opinions?

The text was updated successfully, but these errors were encountered:

ltsaprounis · 2021-11-14T21:54:08Z

@fkiraly - Should we also raise another issue to add clone in all Cross Validation and Grid Search classes/functions?

fkiraly · 2021-11-15T22:52:05Z

@ltsaprounis, excellent idea! Could you kindly open this and point to the instances where it's not done?

ltsaprounis · 2021-11-19T00:36:41Z

Will do shortly, need to find where we need to do it and make a checklist.

@miraep8

Provides the functionality discussed in #1614, i.e., a `reset` method in `BaseObject` that resets attributes and internal state of an estimator/object to its post-`__init__` state (while keeping parameters), equivalent to overwriting `self` with a `sklearn.clone`. This is motivated by the bug in `MultiplexForecaster` which @miraep8's test suite uncovered in combination with #2458, and the observation that: * boilerplate needs to be copied from `__init__` to `_fit` to fix it, and * that the same issue possibly exists in a number of other composites which are not tested for subsequent `fit` with different parameters. This PR also adds `reset` at the start of `BaseForecaster.fit`, addressing a family of potential bugs where not initializing in `_fit` causes unexpected behaviour in a sequence of calls `__init__`, `set_params`, `fit` (the bug present in the original #2458).

fkiraly added implementing framework Implementing or improving framework for learning tasks, e.g., base class functionality API design API design & software architecture labels Nov 12, 2021

fkiraly mentioned this issue Apr 22, 2022

[ENH] BaseObject reset functionality #2531

Merged

fkiraly linked a pull request Apr 22, 2022 that will close this issue

[ENH] BaseObject reset functionality #2531

Merged

fkiraly closed this as completed in #2531 Apr 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] should we have a `reset` method? #1614

[ENH] should we have a `reset` method? #1614

fkiraly commented Nov 12, 2021

ltsaprounis commented Nov 14, 2021

fkiraly commented Nov 15, 2021

ltsaprounis commented Nov 19, 2021

[ENH] should we have a reset method? #1614

[ENH] should we have a reset method? #1614

Comments

fkiraly commented Nov 12, 2021

ltsaprounis commented Nov 14, 2021

fkiraly commented Nov 15, 2021

ltsaprounis commented Nov 19, 2021

[ENH] should we have a `reset` method? #1614

[ENH] should we have a `reset` method? #1614