Expose get_feature_names on OneHotEncoder #1193

angela97lin · 2020-09-17T21:12:50Z

Questions:

scikit-learn only allows ohe.get_feature_names(input_features) to be called on the entire data set (https://github.com/scikit-learn/scikit-learn/blob/0fb307bf3/sklearn/preprocessing/_encoders.py#L580). Do we want to loosen this restriction? Unclear since I'm not sure exactly what context this will be used in. For now, could be sufficient to just follow impl and add as a later PR if the need arises.

codecov · 2020-09-17T21:19:04Z

Codecov Report

Merging #1193 into main will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##             main    #1193   +/-   ##
=======================================
  Coverage   99.92%   99.92%           
=======================================
  Files         196      196           
  Lines       11978    11987    +9     
=======================================
+ Hits        11969    11978    +9     
  Misses          9        9

Impacted Files	Coverage Δ
...components/transformers/encoders/onehot_encoder.py	`100.00% <100.00%> (ø)`
...alml/tests/component_tests/test_one_hot_encoder.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e9330de...0e3540b. Read the comment docs.

evalml/pipelines/components/transformers/encoders/onehot_encoder.py

dsherry

@angela97lin looks great!

I wonder if we'll eventually want to adapt get_feature_names to be defined in Transformer. So that we can access the names of features generated by any component.

evalml/pipelines/components/transformers/encoders/onehot_encoder.py

angela97lin · 2020-09-18T15:21:34Z

@dsherry Yeah, I could see something like that for the Transformer method too in the future. We already have input_feature_names on the pipelines, and getting the output could also be useful.

init

8aace59

angela97lin added this to the September 2020 milestone Sep 17, 2020

angela97lin self-assigned this Sep 17, 2020

angela97lin added 2 commits September 17, 2020 17:13

release notes

c403982

Merge branch 'main' into 1183_get_feature_names

f582d61

move release notes

b11c2b4

angela97lin requested review from dsherry, freddyaboulton, bchen1116 and jeremyliweishih September 17, 2020 21:36

freddyaboulton reviewed Sep 17, 2020

View reviewed changes

evalml/pipelines/components/transformers/encoders/onehot_encoder.py Outdated Show resolved Hide resolved

dsherry approved these changes Sep 17, 2020

View reviewed changes

evalml/pipelines/components/transformers/encoders/onehot_encoder.py Outdated Show resolved Hide resolved

evalml/pipelines/components/transformers/encoders/onehot_encoder.py Show resolved Hide resolved

Merge branch 'main' into 1183_get_feature_names

c54d414

angela97lin added 2 commits September 18, 2020 12:28

remove input_features param

52e3628

Merge branch 'main' into 1183_get_feature_names

0e3540b

angela97lin merged commit 94af4f0 into main Sep 18, 2020

angela97lin deleted the 1183_get_feature_names branch September 18, 2020 18:05

This was referenced Sep 23, 2020

OneHotEncoder.get_feature_names errors #773

Closed

Release v0.14.1 #1241

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose get_feature_names on OneHotEncoder #1193

Expose get_feature_names on OneHotEncoder #1193

angela97lin commented Sep 17, 2020

codecov bot commented Sep 17, 2020 •

edited

Loading

dsherry left a comment

angela97lin commented Sep 18, 2020

Expose get_feature_names on OneHotEncoder #1193

Expose get_feature_names on OneHotEncoder #1193

Conversation

angela97lin commented Sep 17, 2020

codecov bot commented Sep 17, 2020 • edited Loading

Codecov Report

dsherry left a comment

Choose a reason for hiding this comment

angela97lin commented Sep 18, 2020

codecov bot commented Sep 17, 2020 •

edited

Loading