-
-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH] Implementing SplitterSummarizer Transformer #4759
Conversation
Thanks for your contribution.
Unfortunately, your code style is a bit different the rest of the codebase, and this is being checked by the configured hooks in What you can do are the following:
Please review the changes before you stage. Ideally, they should match what you see here. Also, feel free to use any commit message of your choice. For your next commits, all the configured git hooks will run automatically for the staged files while you run |
do you still need help on this, or input, @BensHamza? |
Sorry for the time it takes and thank you for the follow-up. I needed some time to get used to the codebase. I had first a problem understanding how to make the pipeline mentionned in #4664 work since This last code version works fine on this last example, Can you please tell me if it is the right guess? Also, there are surely some improvements to this version. I would appreciate any feedback. Thank you ! |
That's an excellent point! I made a mistake here, I was thinking of a (hypothetical) PCA transformer that produces PCA loadings probably (a primitive/row vector), rather than the actual PCA transforme that is currently in the code base (this produces scores, column vectors).
Yes, you understand correctly imo! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@BensHamza, looks great! This is exactly what we were looking for!
Let's see whether the tests run through, nice contribution!
I left some comments above. We could merge as is, but it would be great to straighten out the docstrings and clarify the index
parameter.
Additional features we can leave for later, e.g., an option to remember data seen in fit
and use the union of data seen in fit
/ transform
- that might be important for forecasting use cases.
Re test failures, this is due to the test parameters using the catch22 transformer, which has some soft dependencies ( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great!
Would approve if all tests pass.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Really great, good to go!
I've made two minor changes to expedite this for the release:
- added the estimator to the API reference, so users can find it easily
- fixed the docstring - for multi-line cells, use
...
and not>>>
(see the latest commits for this)
FYI, I think you did not change the default from |
Thank you @fkiraly for these changes and the explanations.
My bad. I did the changes. I was having a problem passing the |
sure - let's check where/if it still fails. |
FYI, I shortened the windows in some test cases in |
Thank you @fkiraly , I learned a lot in the process. |
well, @BensHamza, hopefully this is just the start of a long and fruitful collaboration! |
Reference Issues/PRs
Fixes #4664 .
What does this implement/fix? Explain your changes.
This PR introduces a new series-to-series transformer SplitterSummarizer.
Does your contribution introduce a new dependency? If yes, which one?
No
Any other comments?
The testing isn't working
PR checklist
For all contributions
How to: add yourself to the all-contributors file in the
sktime
root directory (not theCONTRIBUTORS.md
). Common badges:code
- fixing a bug, or adding code logic.doc
- writing or improving documentation or docstrings.bug
- reporting or diagnosing a bug (get this pluscode
if you also fixed the bug in the PR).maintenance
- CI, test framework, release.See here for full badge reference
See here for further details on the algorithm maintainer role.
For new estimators
docs/source/api_reference/taskname.rst
, follow the pattern.Examples
section.python_dependencies
tag and ensureddependency isolation, see the estimator dependencies guide.