Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[python] ignore pandas ordered categorical columns by default #2115

Merged
merged 5 commits into from Apr 19, 2019

Conversation

StrikerRUS
Copy link
Collaborator

Closed #2108.

else:
categorical_feature = list(categorical_feature) + list(cat_cols)
categorical_feature = list(categorical_feature) + cat_cols_not_ordered
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder whether this is a good decision to add default categorical columns even in case user explicitly specifies categorical features. Firstly, this feature is not documented. Secondly, it seems that this approach makes some discomfort for users: the only way to disable default cat features is to convert them in source dataframe. I think it's more comfortable just not include them in list of cat features. The intuition behind this is that user want either use default cat features in cat columns, or specify cat features in param, but not both at the same time.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good point, this indeed is not comfortable.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@guolinke Cool! I'll address this in next PR, OK?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah, sure

@StrikerRUS StrikerRUS merged commit d115769 into master Apr 19, 2019
@StrikerRUS StrikerRUS deleted the pandas_cat branch April 19, 2019 11:28
@lock lock bot locked as resolved and limited conversation to collaborators Mar 11, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Are SKLEARN API estimators can distinguish PANDAS nominal type feature and PANDAS ordinal type feature?
2 participants