Use make_column_selector where appropriate. #92

lesteve · 2020-11-17T11:22:09Z

I left the one in the data exploration notebook because at this stage we don't want to introduce make_column_selector.

review-notebook-app · 2020-11-17T11:22:25Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

glemaitre

just a single question. Otherwise good to be merged.

glemaitre · 2020-11-17T14:08:37Z

python_scripts/04_parameter_tuning_search.py

-    'workclass', 'education', 'marital-status', 'occupation',
-    'relationship', 'race', 'native-country', 'sex']
+categorical_columns_selector = selector(dtype_include=object)
+categorical_columns = categorical_columns_selector(data)


Maybe we can split using a new cell here just to show the output of using the selector?

I guess we do it already in some notebooks previously e.g. here:
https://inria.github.io/scikit-learn-mooc/python_scripts/03_categorical_pipeline.html#working-with-categorical-variables

I am wondering whether it is better to do in all notebooks or only in one of the beginning at the beginning.

glemaitre · 2020-11-17T15:28:38Z

Yep it is true that it is the fourth notebook. This is also fine.

…

On Tue, 17 Nov 2020 at 15:41, Loïc Estève ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In python_scripts/04_parameter_tuning_search.py <#92 (comment)> : > from sklearn.preprocessing import OrdinalEncoder -categorical_columns = [ - 'workclass', 'education', 'marital-status', 'occupation', - 'relationship', 'race', 'native-country', 'sex'] +categorical_columns_selector = selector(dtype_include=object) +categorical_columns = categorical_columns_selector(data) I guess we do it already in some notebooks previously e.g. here: https://inria.github.io/scikit-learn-mooc/python_scripts/03_categorical_pipeline.html#working-with-categorical-variables [image: image] <https://user-images.githubusercontent.com/1680079/99403694-0ae2e680-28eb-11eb-8e5a-35884bd6bc6e.png> I am wondering whether it is better to do in all notebooks or only in one of the beginning at the beginning. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#92 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABY32P6JAKZLLFUPD57ZBCTSQKDR3ANCNFSM4TYN2GJQ> .

-- Guillaume Lemaitre Scikit-learn @ Inria Foundation https://glemaitre.github.io/

Use make_column_selector where appropriate.

763d5e9

glemaitre approved these changes Nov 17, 2020

View reviewed changes

lesteve merged commit c58b74d into INRIA:master Nov 18, 2020

lesteve deleted the use-make-column-selector-everywhere branch November 18, 2020 07:30

lesteve added a commit that referenced this pull request Nov 26, 2020

Use make_column_selector where appropriate. (#92)

829f4cf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use make_column_selector where appropriate. #92

Use make_column_selector where appropriate. #92

lesteve commented Nov 17, 2020

review-notebook-app bot commented Nov 17, 2020

glemaitre left a comment

glemaitre Nov 17, 2020

lesteve Nov 17, 2020

glemaitre commented Nov 17, 2020 via email

Use make_column_selector where appropriate. #92

Use make_column_selector where appropriate. #92

Conversation

lesteve commented Nov 17, 2020

review-notebook-app bot commented Nov 17, 2020

glemaitre left a comment

Choose a reason for hiding this comment

glemaitre Nov 17, 2020

Choose a reason for hiding this comment

lesteve Nov 17, 2020

Choose a reason for hiding this comment

glemaitre commented Nov 17, 2020 via email