Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PUBDEV-8927: Remove redundant predictors if found for backward mode #6446

Merged

Conversation

wendycwong
Copy link
Contributor

@wendycwong wendycwong commented Dec 2, 2022

This PR fixes the issue in JIRA: https://h2oai.atlassian.net/browse/PUBDEV-8927

I added support to remove multiple predictors at the beginning if we detected collinear columns for backward mode of modelselection.

Basically, if I detected that we are the beginning of the model building process, after the first model is built, we will check and see if there are columns that are not active. This is done by looking at the coefficient indices stored in best submodel idxs. If the idxs is null, that means all coefficients are used.

Added a python test to make sure the fix is correct.

…election.

PUBDEV-8927: Add check to stop model building when no more predictors are available.
Copy link
Contributor

@tomasfryda tomasfryda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, thank you @wendycwong !

@wendycwong wendycwong merged commit d2c4bb3 into rel-zygmund Dec 11, 2022
@wendycwong wendycwong deleted the wendy_pubdev_8927_remove_collinear_columns_backward branch December 11, 2022 22:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants