New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
getFeaturesCols() should not return the fold column or weight column #4508
Comments
Jakub Hava commented: It should provably print warning, letting the user know about this behavior. Also, from data science point of view, [~accountid:557058:eac185dd-5a5c-46e9-bb5a-13217ee9c218], can you verify that excluding these 2 columns for scoring does make sense? I don't have the DS background. Thanks! |
Lauren DiPerna commented: hi [~accountid:557058:eeeb611c-665e-431d-b442-1f255171db6f] yes I think it makes sense to exclude these two columns from a DS perspective, because the fold column and weights column shouldn't be used as features during training, and the fold column in particular shouldn't be required during scoring. hope this helps! |
Jakub Hava commented: Cool! thank you [~accountid:557058:eac185dd-5a5c-46e9-bb5a-13217ee9c218]. This change will overlap with big API clean up introduced by SW-1231 ( will go into major release) |
Marek Novotny commented: PR: #1177 |
JIRA Issue Migration Info Cont'd Jira Issue Created Date: 2019-04-25T17:40:22.836-0700 |
Currently the
getFeaturesCols
will return the fold column and the weights column. We should update this method so that it only returns columns that are used to train on.The text was updated successfully, but these errors were encountered: