Users can now pass in all valid kwargs to Estimators #1157

freddyaboulton · 2020-09-10T16:45:34Z

Pull Request Description

After creating the pull request: in order to pass the release_notes_updated check you will need to update the "Future Release" section of docs/source/release_notes.rst to include this pull request by adding :pr:123.

codecov · 2020-09-10T16:51:33Z

Codecov Report

Merging #1157 into main will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##             main    #1157   +/-   ##
=======================================
  Coverage   99.91%   99.91%           
=======================================
  Files         197      197           
  Lines       11663    11705   +42     
=======================================
+ Hits        11653    11695   +42     
  Misses         10       10

Impacted Files	Coverage Δ
...ents/estimators/classifiers/logistic_regression.py	`100.00% <ø> (ø)`
...ents/estimators/classifiers/catboost_classifier.py	`100.00% <100.00%> (ø)`
...ts/estimators/classifiers/elasticnet_classifier.py	`100.00% <100.00%> (ø)`
...onents/estimators/regressors/catboost_regressor.py	`100.00% <100.00%> (ø)`
.../tests/component_tests/test_catboost_classifier.py	`100.00% <100.00%> (ø)`
...l/tests/component_tests/test_catboost_regressor.py	`100.00% <100.00%> (ø)`
evalml/tests/component_tests/test_components.py	`100.00% <100.00%> (ø)`
evalml/tests/component_tests/test_en_classifier.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b38b51b...62d3b30. Read the comment docs.

jeremyliweishih

This looks great to me! I really like the coverage but just had one clarifying question.

jeremyliweishih · 2020-09-11T18:17:03Z

evalml/tests/component_tests/test_components.py

+    estimator = estimator_class()
+    if estimator._component_obj is None:
+        pytest.skip(f"Skipping {estimator_class} because does not have component object.")
+    params = estimator._component_obj.get_params()


Does this test if all estimators accept the default keyword arguments and values of the component_obj?

angela97lin

Looooks good! Thanks for the cleanup 😁

…r, ENClassifier, and Catboost estimators.

dsherry · 2020-09-18T14:00:32Z

evalml/pipelines/components/estimators/regressors/catboost_regressor.py

+                      'silent': silent}
+        if kwargs.get('allow_writing_files', False):
+            warnings.warn("Parameter allow_writing_files is being set to False in CatBoostRegressor")
+        kwargs["allow_writing_files"] = False


@freddyaboulton it looks like this code updates allow_writing_files so that it will always be False. Why is this necessary? If a user wants to set this parameter, why not allow them? We can still encode a default like so, without adding this parameter to the parameters dict:

cb_parameters['allow_writing_files'] = kwargs.get('allow_writing_files', False)

This question also applies to the changes for catboost classifier and elasticnet classifier as well.

Discussed with @freddyaboulton .

Options:

Allow users to set allow_writing_files to whatever they want. Make it a named field.

Add the ability to have an allow-list and deny-list for parameters. I.e. if someone passes in allow_writing_files, we raise an error.

Keep code as-is

We liked the first option. Let's expose allow_writing_files as named value in init, default False, and delete the warning code here

dsherry · 2020-09-18T14:03:10Z

evalml/pipelines/components/estimators/classifiers/logistic_regression.py

@@ -26,8 +26,6 @@ def __init__(self, penalty="l2", C=1.0, n_jobs=-1, random_state=0, **kwargs):
        parameters.update(kwargs)

        lr_classifier = LogisticRegression(random_state=random_state,
-                                           multi_class="auto",
-                                           solver="lbfgs",


@freddyaboulton is there a reason why we shouldn't add these as named parameters to init, so that the defaults are still clear?

def __init__(self, ..., multi_class='auto', solver='lbfgs', ..., **kwargs):

Discussed with @freddyaboulton .Plan: let's do this.

freddyaboulton force-pushed the 1155-cant-pass-in-certain-kwargs branch from 2f1e873 to 3b01f7a Compare September 11, 2020 16:00

freddyaboulton marked this pull request as ready for review September 11, 2020 16:31

freddyaboulton requested review from angela97lin, dsherry, eccabay, jeremyliweishih, bchen1116 and christopherbunn September 11, 2020 16:31

jeremyliweishih approved these changes Sep 11, 2020

View reviewed changes

angela97lin approved these changes Sep 11, 2020

View reviewed changes

freddyaboulton force-pushed the 1155-cant-pass-in-certain-kwargs branch from 3b01f7a to dcef397 Compare September 11, 2020 18:48

freddyaboulton added 2 commits September 15, 2020 09:55

Users can now pass in all valid kwargs to LogisticRegressionClassifie…

c59859c

…r, ENClassifier, and Catboost estimators.

Adding PR 1157 to release notes.

62d3b30

freddyaboulton force-pushed the 1155-cant-pass-in-certain-kwargs branch from dcef397 to 62d3b30 Compare September 15, 2020 13:56

freddyaboulton merged commit 54f6767 into main Sep 15, 2020

This was referenced Sep 17, 2020

Release v0.14.0 #1191

Closed

Release v0.13.2 #1192

Merged

dsherry reviewed Sep 18, 2020

View reviewed changes

freddyaboulton mentioned this pull request Sep 18, 2020

Adding some new kwargs to Catboost and LogisticRegression #1202

Merged

freddyaboulton deleted the 1155-cant-pass-in-certain-kwargs branch October 22, 2020 18:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Users can now pass in all valid kwargs to Estimators #1157

Users can now pass in all valid kwargs to Estimators #1157

freddyaboulton commented Sep 10, 2020

codecov bot commented Sep 10, 2020 •

edited

Loading

jeremyliweishih left a comment

jeremyliweishih Sep 11, 2020

freddyaboulton Sep 11, 2020

angela97lin left a comment

dsherry Sep 18, 2020

dsherry Sep 18, 2020

dsherry Sep 18, 2020

dsherry Sep 18, 2020

dsherry Sep 18, 2020

Users can now pass in all valid kwargs to Estimators #1157

Users can now pass in all valid kwargs to Estimators #1157

Conversation

freddyaboulton commented Sep 10, 2020

Pull Request Description

codecov bot commented Sep 10, 2020 • edited Loading

Codecov Report

jeremyliweishih left a comment

Choose a reason for hiding this comment

jeremyliweishih Sep 11, 2020

Choose a reason for hiding this comment

freddyaboulton Sep 11, 2020

Choose a reason for hiding this comment

angela97lin left a comment

Choose a reason for hiding this comment

dsherry Sep 18, 2020

Choose a reason for hiding this comment

dsherry Sep 18, 2020

Choose a reason for hiding this comment

dsherry Sep 18, 2020

Choose a reason for hiding this comment

dsherry Sep 18, 2020

Choose a reason for hiding this comment

dsherry Sep 18, 2020

Choose a reason for hiding this comment

codecov bot commented Sep 10, 2020 •

edited

Loading