Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extend FeatureRanking interface for regression tasks #684

Open
sabbatinif opened this issue Jul 22, 2021 · 2 comments
Open

Extend FeatureRanking interface for regression tasks #684

sabbatinif opened this issue Jul 22, 2021 · 2 comments

Comments

@sabbatinif
Copy link

It may be useful to have a feature ranking procedure applicable not only to classification tasks (e.g. SignalNoiseRatio and SumSquaresRatio, implementing the FeatureRanking interface), but also to regression tasks.
At the moment the FeatureRanking interface only accepts integer target vectors for calculating the feature rank.

@haifengl
Copy link
Owner

What feature selection criteria for regression are of interest?

@sabbatinif
Copy link
Author

I have no strong preferences about the criteria. I can suggest something similar to Python SciKit-Learn's feature_selection.f_regression. It consists of a sequential algorithm aimed at iteratively and greedly selecting the most relevant features of a dataset. It starts by training a temporary regressor on a single feature (the most correlated with respect to the output values) and it keeps repeating this operation by adding one feature at a time, always peaking the one that mostly increases the temporary regressor predictive performance. At the end of this process, features are ranked on the basis of their relevance. But any other criteria is useful for me

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants