Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ENH] feature set benchmarks after Fulcher et al for TSC #2983

Open
fkiraly opened this issue Jul 12, 2022 · 3 comments
Open

[ENH] feature set benchmarks after Fulcher et al for TSC #2983

fkiraly opened this issue Jul 12, 2022 · 3 comments
Labels
enhancement Adding new functionality module:classification classification module: time series classification module:metrics&benchmarking metrics and benchmarking modules module:transformations transformations module: time series transformation, feature extraction, pre-/post-processing

Comments

@fkiraly
Copy link
Collaborator

fkiraly commented Jul 12, 2022

It would be really great if we could get sktime to a state where we can easily replicate standard feature set benchmarks (e.g., time series classification) as presented by @benfulcher.

Example experiment:
using the algorithm [transformer][1-knn classif] or [transformer][linear svm], compute/compare performance as supervised time series classifier on the UCI data set repository.

Relevant feature sets, useable as transformers, are listed in #2982

Caveat mentioned by @benfulcher: catch-22 should not be used in isolation on the UCI repository (or not only), since it does not contain summary features. Better option: catch-24 = catch-22 plus mean, var; or, similar: feature union of catch-22 transformer and SummaryTransformer.

FYI @ltsaprounis, @DBCerigo, @TNTran92, interesting use case for kotsu and/or benchmarking work?

@fkiraly fkiraly added module:classification classification module: time series classification module:transformations transformations module: time series transformation, feature extraction, pre-/post-processing enhancement Adding new functionality module:metrics&benchmarking metrics and benchmarking modules labels Jul 12, 2022
@fkiraly fkiraly changed the title [ENH] feature set benchmarks [ENH] feature set benchmarks after Fulcher et al for TSC Jul 12, 2022
@fkiraly
Copy link
Collaborator Author

fkiraly commented Jul 12, 2022

FYI @alex-hh

@benfulcher
Copy link

Some feature sets (constructed to focus only on properties of dynamics, and which are therefore insensitive to rescalings, and thus shifts in mean or variance) would be put at a disadvantage on problems in which there are trivial class differences (like class 1 has a higher mean than class 2). pycatch22 has an option to set catch24=True and include two simple additional features (mean + std) for problems in which this may be class-informative.

@fkiraly
Copy link
Collaborator Author

fkiraly commented Jul 13, 2022

pycatch22 has an option to set catch24=True and include two simple additional features (mean + std) for problems in which this may be class-informative.

Thanks for reminding us 😄. SummaryTransformer has mean, std, and quantile features, so that should also work if in FeatureUnion with catch22.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Adding new functionality module:classification classification module: time series classification module:metrics&benchmarking metrics and benchmarking modules module:transformations transformations module: time series transformation, feature extraction, pre-/post-processing
Projects
None yet
Development

No branches or pull requests

2 participants