Classifier base class #1517

TonyBagnall · 2021-10-13T19:01:50Z

What does this implement/fix? Explain your changes.

Some changes to the base class for classifiers and the tags

Remove classification from the tags "X-y-must-have-same-index" and "enforce_index_type". They are not used by any classifiers
Add the capability tags to the base class. think capabilities should be checked against data in the check_X functionality.
some not dependent on the data could be move down, but will be defined so may as well have them in the base imo
Changes in comments. Note it claims to allow 2D arrays, which is not true, but I left them because it will soon be true.
remove unnecessary argument to get_tag
coerce_to_numpy = self.get_tag("coerce-X-to-numpy")
removed the classifier_list
its not complete and probably should not be here, I imagine I put it there.
Remove default _predict behaviour, add default _predict_proba behaviour. The enhancement template states predict must be implemented. I think that is correct

note there will be a few more refinements, but this will allow us to blast through the fit/_fit refactor

…g-institute/sktime into classifier_base_class

_predict made abstract, _ predict_proba given a default implmentation n_classes_ added to base class

fkiraly · 2021-10-17T20:59:54Z

quick question, while you're at it:

do you want to add the planned support for the various Panel formats?

All it would take is adding one or two convert from datatypes to the functions.

TonyBagnall · 2021-10-18T06:46:50Z

quick question, while you're at it:

do you want to add the planned support for the various Panel formats?

All it would take is adding one or two convert from datatypes to the functions.

Not on this PR, I want to finish refactor to fit first then look at checking and transforming data, one thing at a time

MatthewMiddlehurst

Pushed a fix to the default predict_proba and introduced a threading tag.

Happy with this, next step is to finish the refactor.

sktime/base/_base.py

sktime/classification/base.py

fkiraly

docstrings of BaseObject.get_tag are changed from sth that's correct imo to sth that's incorrect? This is a very small change, but a blocker.
I think the default behaviour in _predict should remain? Perhaps with a check whether an infinite look occurs (in the public part of the function) that raises an error.
Happy with the changes to the BaseClassifier, but I think we need to add a section to the docstring that explains the state change caused by fit etc. See BaseForecaster. Good to have but not a blocker.

…into classifier_base_class

…g-institute/sktime into classifier_base_class

TonyBagnall · 2021-10-19T09:33:27Z

docstrings of BaseObject.get_tag are changed from sth that's correct imo to sth that's incorrect? This is a very small change, but a blocker.

yes, fixed, my misunderstanding (see above)

I think the default behaviour in _predict should remain? Perhaps with a check whether an infinite look occurs (in the public part of the function) that raises an error.

I dont. Your extension guidelines say that _predict must be implemented. I see no value in this default behaviour, it will allow the user to run code which should raise an error, if, say, they implement _predicted instead of _predict

Happy with the changes to the BaseClassifier, but I think we need to add a section to the docstring that explains the state change caused by fit etc. See BaseForecaster. Good to have but not a blocker.

yes I agree, on the next PR for base when I change the checking/recasting/capabilities, I want to push through the fit refactor first.

…g-institute/sktime into classifier_base_class

Use markus's github instead of full name, added matthew.

…g-institute/sktime into classifier_base_class

fkiraly · 2021-10-21T09:00:16Z

I think the default behaviour in _predict should remain? Perhaps with a check whether an infinite look occurs (in the public part of the function) that raises an error.

@TonyBagnall, generally please ask for re-review if changes have been requested and don't merge.

The above has not been addressed, it's an interface breaking change, so it should not have been merged.

It breaks the interface for anyone who has locally implemented a classifier and has only implemented _predict_proba, you are breaking the extension contract.

I don't think we should remove the "create _predict from _predict_proba" logic since this is standard in scikit-learn.

But in-principle I'm fine with this as long as we deprecate properly.

TonyBagnall · 2021-10-21T09:13:38Z

your own extension template stated that _predict must be implemented, and did not even mention _predict_proba. I recently had a student who implemented _predict as instructed on the template, but it failed when calling _predict_proba. Feel free to revise, I will review, but please leave the default predic_proba in there. This is how I think it should be

TonyBagnall · 2021-10-21T09:24:13Z

I'm ok with a default predict, but you then need to change the extension template to make it clear you need one of them. I'm not that concerned at the moment about this, so will leave it to you.

fkiraly · 2021-10-21T11:36:54Z

I recently had a student who implemented _predict as instructed on the template, but it failed when calling _predict_proba. Feel free to revise, I will review, but please leave the default predict_proba in there. This is how I think it should be

Three things here:

the extension template states the "implementer"/"extender" contract, the base class should state the "user" contract (cf BaseForecaster).
I added the "implementer" contract in the extension template because there wasn't one and people were extending. That is, because there wasn't one, not because I want it to be a certain way.
I didn't consider predict_proba to be a mandatory part of the "user" contract because not all classifiers can predict probabilities. But there is also no "user" contract written down.

I'd be fine if the "user" contract guarantees predict_proba, but in that case I'd strongly recommend you write down the user contract as a module docstring for BaseClassifier, just like for BaseForecaster.

I'm in-principle fine with any "should" state that you put out there, as long as the contracts are clearly stated and consistent; and as long as interface changes are properly deprecated.

Tony Bagnall added 6 commits October 13, 2021 17:08

base

f01b032

enforce univariate in base class

3c14b91

remove unnecessary classifier tags

498b8dd

predict and predict_proba

8b72f4d

tweaks to classifier base class

b79f65b

formatting 1

9d36d91

TonyBagnall added the module:classification classification module: time series classification label Oct 13, 2021

Tony Bagnall added 15 commits October 13, 2021 20:13

formatting 3

4ccd2df

formatting 4

5503c54

formatting 6?

50ca231

blank lines or no blank lines?

f7dc729

remove unnecessary argument to get_tag

5919fbb

negate tag correctly, remove unnecessary get_tag argument

4bdb457

Merge branch 'classifier_base_class' of https://github.com/alan-turin…

721052f

…g-institute/sktime into classifier_base_class

correct tag negation

6accbc6

Merge branch 'main' into classifier_base_class

aa1ca8e

Merge branch 'main' into classifier_base_class

5734947

Merge branch 'main' into classifier_base_class

cee784b

_predict _predict_proba

d408408

_predict made abstract, _ predict_proba given a default implmentation n_classes_ added to base class

formatting 1

b17f9a7

formatting 2

57347f7

Merge branch 'main' into classifier_base_class

5af98a0

TonyBagnall marked this pull request as ready for review October 16, 2021 22:17

TonyBagnall requested a review from mloning as a code owner October 16, 2021 22:17

HC comments an experiments fixes

8b29a99

TonyBagnall requested review from fkiraly and chrisholder October 18, 2021 06:47

MatthewMiddlehurst previously approved these changes Oct 18, 2021

View reviewed changes

fkiraly reviewed Oct 18, 2021

View reviewed changes

sktime/base/_base.py Outdated Show resolved Hide resolved

fkiraly reviewed Oct 18, 2021

View reviewed changes

sktime/classification/base.py Outdated Show resolved Hide resolved

fkiraly requested changes Oct 18, 2021

View reviewed changes

Update base.py

b31afcd

TonyBagnall dismissed MatthewMiddlehurst’s stale review via b31afcd October 19, 2021 09:16

Tony Bagnall and others added 4 commits October 19, 2021 10:19

Update base.py

26069f7

Merge branch 'main' of https://github.com/alan-turing-institute/sktime …

2bb1d48

…into classifier_base_class

fix comments in get_tags

27949f6

Merge branch 'classifier_base_class' of https://github.com/alan-turin…

a226422

…g-institute/sktime into classifier_base_class

Tony Bagnall and others added 5 commits October 19, 2021 10:34

remove debug

031a18b

get_tag comment revert

dee524b

format 1

feade0e

Merge branch 'classifier_base_class' of https://github.com/alan-turin…

8cdddff

…g-institute/sktime into classifier_base_class

doc consistency

87dfa72

TonyBagnall requested a review from fkiraly October 19, 2021 11:48

change contributors on both base

d85c4c6

Use markus's github instead of full name, added matthew.

TonyBagnall requested a review from aiwalter as a code owner October 19, 2021 12:38

Matthew Middlehurst and others added 4 commits October 19, 2021 14:00

class dictionary useage in predict_proba default

f0b64bc

Merge branch 'classifier_base_class' of https://github.com/alan-turin…

77779dc

…g-institute/sktime into classifier_base_class

code quality

aa7f500

Update base.py

a4b5b1a

chrisholder approved these changes Oct 19, 2021

View reviewed changes

MatthewMiddlehurst approved these changes Oct 19, 2021

View reviewed changes

TonyBagnall merged commit 005152a into main Oct 19, 2021

TonyBagnall deleted the classifier_base_class branch October 19, 2021 16:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Classifier base class #1517

Classifier base class #1517

TonyBagnall commented Oct 13, 2021 •

edited

fkiraly commented Oct 17, 2021

TonyBagnall commented Oct 18, 2021

MatthewMiddlehurst left a comment

fkiraly left a comment

TonyBagnall commented Oct 19, 2021

fkiraly commented Oct 21, 2021 •

edited

TonyBagnall commented Oct 21, 2021

TonyBagnall commented Oct 21, 2021

fkiraly commented Oct 21, 2021 •

edited

Classifier base class #1517

Classifier base class #1517

Conversation

TonyBagnall commented Oct 13, 2021 • edited

What does this implement/fix? Explain your changes.

fkiraly commented Oct 17, 2021

TonyBagnall commented Oct 18, 2021

MatthewMiddlehurst left a comment

Choose a reason for hiding this comment

fkiraly left a comment

Choose a reason for hiding this comment

TonyBagnall commented Oct 19, 2021

fkiraly commented Oct 21, 2021 • edited

TonyBagnall commented Oct 21, 2021

TonyBagnall commented Oct 21, 2021

fkiraly commented Oct 21, 2021 • edited

TonyBagnall commented Oct 13, 2021 •

edited

fkiraly commented Oct 21, 2021 •

edited

fkiraly commented Oct 21, 2021 •

edited