New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH] New transformation based pipeline classifiers #1721
Conversation
…into transform-classifiers
we already have a pipeline, or do you mean a wrapper for a pipeline where transformer and classifier are specified in constructor? Something like a PipelineClassifier? Seems a bit superfluous and I thought you liked using pipelines directly, but I see no reason why not have one. I agree its best to not have commented out code. Maybe put it all in a directory in contrib called regenerate tests or somesuch? |
RE: Commented out sections for tests, that is a bigger issue than this PR (same pattern follows for most tests), but I don't mind doing it in a separate PR with all the others. |
lets discuss at the dev meeting, I'll put together an agenda |
…into summary-classifier
@fkiraly could you rereview in the light of comments above please? ta |
we agreed at the dev meeting to centralise the commented out code in a separate PR. @fkiraly does this answer your issues? |
Yes, happy if there's a separate issue that says what needs to be done. What did you decide on the boilerplate/DRY issue? |
I still don't really see an issue with the current set-up, issues such as SummaryClassifier multivariate output and extra functionality like train estimates in FreshPrince would not lend well to a generic pipeline. Im not against having a generic pipeline wrapper for users. Using such a wrapper in the current pipelines would require a bit of unwrapping for the above cases, however. |
so to summarise, we thought a wrapper containing a configurable pipeline was a layer of wrapping too far, and it is in fact preferable to encourage users to use a Pipeline, since sklearn users are familiar with them. There is not that much code repetition and not all classifiers could use the base class. |
…into summary-classifier
cc31960
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
questions answered, looks fine to me now.
Can you please remove commented out code before you merge?
this is deferred to here #1765 |
Implements new feature based pipelines for summary statistics, random intervals (with a new transformer to generate them) and a TSFresh based pipeline from an upcoming workshop paper by me and @TonyBagnall. We can probably close #1063 after this unless there are any objections.
There is quite a lot going on here, but the majority of new lines and file changes are just tests.
Does your contribution introduce a new dependency? If yes, which one?
No
What should a reviewer concentrate their feedback on?
The new estimators, they are relatively simple and most are basic transformer > classifier pipelines.
Files of interest are:
sktime/classification/feature_based/_fresh_prince.py
sktime/classification/feature_based/_random_interval_classifier.py
sktime/classification/feature_based/_summary_classifier.py
sktime/transformations/panel/random_intervals.py