wip: addressing comments #159

angela97lin · 2019-10-28T21:39:21Z

Addressing comments for Pipeline v2.0 PR (#108)

…review

codecov · 2019-10-29T17:24:22Z

Codecov Report

❗ No coverage uploaded for pull request base (pipeline_v2@8eb43df). Click here to learn what that means.
The diff coverage is 93.58%.

@@              Coverage Diff               @@
##             pipeline_v2     #159   +/-   ##
==============================================
  Coverage               ?   96.58%           
==============================================
  Files                  ?       88           
  Lines                  ?     2140           
  Branches               ?        0           
==============================================
  Hits                   ?     2067           
  Misses                 ?       73           
  Partials               ?        0

Impacted Files	Coverage Δ
evalml/pipelines/regression/random_forest.py	`100% <ø> (ø)`
...ml/pipelines/classification/logistic_regression.py	`100% <ø> (ø)`
evalml/pipelines/components/utils.py	`97.87% <ø> (ø)`
evalml/models/auto_base.py	`92.16% <ø> (ø)`
...valml/pipelines/components/estimators/estimator.py	`76.47% <ø> (ø)`
evalml/problem_types/utils.py	`100% <ø> (ø)`
evalml/pipelines/regression/linear_regression.py	`100% <100%> (ø)`
evalml/pipelines/classification/random_forest.py	`100% <100%> (ø)`
...l/pipelines/components/transformers/transformer.py	`100% <100%> (ø)`
evalml/tests/component_tests/test_components.py	`100% <100%> (ø)`
... and 5 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8eb43df...d45ef64. Read the comment docs.

evalml/pipelines/pipeline_base.py

angela97lin · 2019-10-31T15:10:15Z

Note: currently investigating scikit-learn/scikit-learn#5523

…ents in DEFAULT_COMPONENTS

…review

jeremyliweishih

Everything looks good but maybe we can add a test checking if input feature names retains the column headers for the set pipelines for now.

* adding skeleton for component_base * oops, added nested components folders, fixing * linting * adding skeleton estimator/transfomer classes * adding basic estimator components for merge merge * WIP: Components (#107) * Added imputer * Clean up imputer * added onehot and standard scaler * Need fix selectfrommodel and validating * Add selectfrommodel * Cleanup and added basic init test * lint * cleaning up, more merging, combining tests * adding componenttype enum, a little more merging * linting * Moved to estimator * lint fix :P * pipeline v2 base * pushing new base for pipeline and simple test * Faulty scaler for LR * Working LR * Broken RF * Fixed SelectFromModel * Added RF pipeline Fixed SelectFromModel * changing xgboost to use our components * Added RF Regression * beginning to fix broken pieces * continuing to fix tests * fixing tests in autoclassifier * linting * fixing silly typo bug * linting * cleaning up pipeline and pipelinebase classes * adding describe to components and pipeline * linting and fixing minor bug * adding check for estimator in pipeline * Pipeline indexing (#118) * Added indexing and basic tests * Switched to pipelinebase for slicing * Clean up and add docstrings * lint * lint again * Clean up and add error for setting * adding default value to next() to prevent StopIteration error * oops, actually fixing... * fixing docstrings and cleaning imports * adding simple tests for describe * linting * Autogenerated pipeline names (#122) * Basic name without check * Add assert * lint * Changed name format and name constants * Cleanup * moving files to subfolders and removing hyperparameters as class var * updating file hierarchy * linting * removing duplicate * Add feature_importance tests * adding extra tests * adding test, cleanup * Added linear regression and test for pipeling fitting (#131) * Added linear regression pipeline and added test for fitting * Remove unnecessary fit for xgboost component (#148) * Remove fit for xgboost * Remove kwargs * addressing pr comments: rename, abstract feature_importances, del __init__, cleanup, etc. (#149) * addressing pr comments: rename, del __init__, cleanup * cleaning up describe * feature importances + cleanup of components, added subclasses encoder + feature_selector * linting, fixing errors * adding less specific version * addressing comments * import errors * String and component_type component (#153) * added handling str and component type for component list * Adding model_type / problem_type to PipelineBase to allow inference (#157) * adding problem_type and model_type to pipeline base * problem_type --> problem_types * cleaning up self.component_list and init pipeline, typo * forgot to remove print * removing comments * changelog * removing generic SelectFromModel * Jeremy changes (#164) * cleanup components utils * Switch to category_encoder and cleanup RF Select * add feature_importance to estimator * More switching to CE * Move parameters to class attributes and make test * lint * Separate changelog test * Text cleanup * wip: addressing comments (#159) * wip: addressing comments * feature names? * adding fix for no estimator in generate_name * fixing test * addressing more comments on feature_importance * feature_importances fixed and cleaned :) * oops, missed merge conflict * change of name * minor cleanup * adding basic test for two feature selectors and adding default components in DEFAULT_COMPONENTS * adding tests for retaining feature names in input_feature_names * addressing comments * cleanup * fixing

angela97lin added 5 commits October 28, 2019 17:37

wip: addressing comments

3962107

feature names?

52b13e4

Merge branch 'pipeline_v2' of github.com:FeatureLabs/evalml into pv2_…

b4f5950

…review

adding fix for no estimator in generate_name

8d8fe8d

fixing test

4b5a7de

addressing more comments on feature_importance

6a42929

angela97lin commented Oct 29, 2019

View reviewed changes

evalml/pipelines/pipeline_base.py Show resolved Hide resolved

angela97lin added 5 commits October 31, 2019 14:18

feature_importances fixed and cleaned :)

7cbe958

merging

1483b76

oops, missed merge conflict

a367558

change of name

c4975ed

minor cleanup

a9b4c2e

angela97lin requested a review from jeremyliweishih October 31, 2019 18:49

angela97lin added 2 commits October 31, 2019 15:02

adding basic test for two feature selectors and adding default compon…

56e66f2

…ents in DEFAULT_COMPONENTS

Merge branch 'pipeline_v2' of github.com:FeatureLabs/evalml into pv2_…

9d16675

…review

angela97lin mentioned this pull request Oct 31, 2019

Support two feature selection steps in pipelines #162

Closed

jeremyliweishih requested changes Oct 31, 2019

View reviewed changes

adding tests for retaining feature names in input_feature_names

d45ef64

angela97lin requested a review from jeremyliweishih October 31, 2019 21:44

jeremyliweishih approved these changes Oct 31, 2019

View reviewed changes

angela97lin merged commit 81fd22c into pipeline_v2 Oct 31, 2019

angela97lin deleted the pv2_review branch October 31, 2019 21:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wip: addressing comments #159

wip: addressing comments #159

angela97lin commented Oct 28, 2019 •

edited

Loading

codecov bot commented Oct 29, 2019 •

edited

Loading

angela97lin commented Oct 31, 2019

jeremyliweishih left a comment

wip: addressing comments #159

wip: addressing comments #159

Conversation

angela97lin commented Oct 28, 2019 • edited Loading

codecov bot commented Oct 29, 2019 • edited Loading

Codecov Report

angela97lin commented Oct 31, 2019

jeremyliweishih left a comment

Choose a reason for hiding this comment

angela97lin commented Oct 28, 2019 •

edited

Loading

codecov bot commented Oct 29, 2019 •

edited

Loading