Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use make_column_selector where appropriate. #92

Merged
merged 1 commit into from Nov 18, 2020

Conversation

lesteve
Copy link
Collaborator

@lesteve lesteve commented Nov 17, 2020

I left the one in the data exploration notebook because at this stage we don't want to introduce make_column_selector.

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

Copy link
Collaborator

@glemaitre glemaitre left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just a single question. Otherwise good to be merged.

'workclass', 'education', 'marital-status', 'occupation',
'relationship', 'race', 'native-country', 'sex']
categorical_columns_selector = selector(dtype_include=object)
categorical_columns = categorical_columns_selector(data)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe we can split using a new cell here just to show the output of using the selector?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess we do it already in some notebooks previously e.g. here:
https://inria.github.io/scikit-learn-mooc/python_scripts/03_categorical_pipeline.html#working-with-categorical-variables
image

I am wondering whether it is better to do in all notebooks or only in one of the beginning at the beginning.

@glemaitre
Copy link
Collaborator

glemaitre commented Nov 17, 2020 via email

@lesteve lesteve merged commit c58b74d into INRIA:master Nov 18, 2020
@lesteve lesteve deleted the use-make-column-selector-everywhere branch November 18, 2020 07:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants