Adding some new kwargs to Catboost and LogisticRegression #1202

freddyaboulton · 2020-09-18T17:09:31Z

Pull Request Description

In #1157, we agreed to add multi_class and solver kwargs to the logistic regression init (discussion) and add allow_writing_files to the Catboost init (discussion)

This doesn't change the default behavior of these estimators because we are using the same defaults as before.

After creating the pull request: in order to pass the release_notes_updated check you will need to update the "Future Release" section of docs/source/release_notes.rst to include this pull request by adding :pr:123.

codecov · 2020-09-18T17:16:27Z

Codecov Report

Merging #1202 into main will decrease coverage by 0.00%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main    #1202      +/-   ##
==========================================
- Coverage   99.74%   99.74%   -0.01%     
==========================================
  Files         196      196              
  Lines       12020    11998      -22     
==========================================
- Hits        11989    11967      -22     
  Misses         31       31

Impacted Files	Coverage Δ
...ents/estimators/classifiers/catboost_classifier.py	`100.00% <ø> (ø)`
...onents/estimators/regressors/catboost_regressor.py	`100.00% <ø> (ø)`
.../tests/component_tests/test_catboost_classifier.py	`100.00% <ø> (ø)`
...l/tests/component_tests/test_catboost_regressor.py	`100.00% <ø> (ø)`
evalml/tests/pipeline_tests/test_pipelines.py	`99.22% <ø> (ø)`
...ents/estimators/classifiers/logistic_regression.py	`100.00% <100.00%> (ø)`
evalml/tests/component_tests/test_components.py	`99.30% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8a71685...c91b5fd. Read the comment docs.

jeremyliweishih

Looks good - really like these changes!

angela97lin

Nice, this looks clean! I should have chimed in before but allow_writing_files was previously set to default to False so that evalml wouldn't also dump all of the catboost training logs, but it's also great to provide this as an option to users 😁 Thanks, @freddyaboulton!

freddyaboulton · 2020-09-24T14:08:44Z

@dsherry Is this good to merge despite codecov/project failure? I think it would be ok because the number of misses has not increased compared to main (still 9). I think the issue is that the coverage in this branch is 11982/11991 = 0.999249, and the coverage in main is 12004/12013 = 0.999250 which is technically a decrease 💀 . Maybe in the future we can change codecov to fail only if the number of missed lines increases?

dsherry

@freddyaboulton looks good to me!

dsherry · 2020-09-18T18:14:58Z

evalml/pipelines/components/estimators/classifiers/catboost_classifier.py

@@ -33,16 +32,14 @@ class CatBoostClassifier(Estimator):
    SEED_MAX = SEED_BOUNDS.max_bound

    def __init__(self, n_estimators=10, eta=0.03, max_depth=6, bootstrap_type=None, silent=True,
-                 random_state=0, **kwargs):
+                 allow_writing_files=False, random_state=0, **kwargs):


Awesome, thanks!

I think we should try to keep it so that if we do anything with the kwargs params other than passing them along to the component obj, we should declare them as named parameters. It'll keep our code clean.

dsherry · 2020-09-24T22:37:44Z

@freddyaboulton yes let's do it. We probably need to change our codecov thresholds...

You want me to merge it?

freddyaboulton · 2020-09-24T22:42:10Z

@dsherry Yep, I can merge! Thank you

… to logitistic regression init.

freddyaboulton force-pushed the tidy-up-kwargs-catboost-logistic branch from b31da9f to b21601b Compare September 18, 2020 17:51

freddyaboulton requested a review from dsherry September 18, 2020 18:04

freddyaboulton marked this pull request as ready for review September 18, 2020 18:04

freddyaboulton requested review from jeremyliweishih and angela97lin September 18, 2020 18:04

freddyaboulton force-pushed the tidy-up-kwargs-catboost-logistic branch from b21601b to d2a6ff0 Compare September 18, 2020 18:06

jeremyliweishih approved these changes Sep 18, 2020

View reviewed changes

angela97lin approved these changes Sep 18, 2020

View reviewed changes

freddyaboulton force-pushed the tidy-up-kwargs-catboost-logistic branch 3 times, most recently from f8c3e2a to 7b5b1e7 Compare September 22, 2020 20:37

freddyaboulton self-assigned this Sep 23, 2020

freddyaboulton force-pushed the tidy-up-kwargs-catboost-logistic branch 2 times, most recently from 57544c8 to c505632 Compare September 23, 2020 18:11

dsherry approved these changes Sep 24, 2020

View reviewed changes

freddyaboulton added 2 commits September 24, 2020 18:44

Adding allow_writing_files to Catboost init and solver and multiclass…

ec6c0b8

… to logitistic regression init.

Adding PR 1202 to release notes.

c91b5fd

freddyaboulton force-pushed the tidy-up-kwargs-catboost-logistic branch from c505632 to c91b5fd Compare September 24, 2020 22:44

dsherry merged commit 6b0f75d into main Sep 24, 2020

freddyaboulton deleted the tidy-up-kwargs-catboost-logistic branch September 24, 2020 22:58

angela97lin mentioned this pull request Sep 29, 2020

Release v0.14.1 #1241

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding some new kwargs to Catboost and LogisticRegression #1202

Adding some new kwargs to Catboost and LogisticRegression #1202

freddyaboulton commented Sep 18, 2020

codecov bot commented Sep 18, 2020 •

edited

Loading

jeremyliweishih left a comment

angela97lin left a comment

freddyaboulton commented Sep 24, 2020

dsherry left a comment

dsherry Sep 18, 2020

dsherry commented Sep 24, 2020

freddyaboulton commented Sep 24, 2020

Adding some new kwargs to Catboost and LogisticRegression #1202

Adding some new kwargs to Catboost and LogisticRegression #1202

Conversation

freddyaboulton commented Sep 18, 2020

Pull Request Description

codecov bot commented Sep 18, 2020 • edited Loading

Codecov Report

jeremyliweishih left a comment

Choose a reason for hiding this comment

angela97lin left a comment

Choose a reason for hiding this comment

freddyaboulton commented Sep 24, 2020

dsherry left a comment

Choose a reason for hiding this comment

dsherry Sep 18, 2020

Choose a reason for hiding this comment

dsherry commented Sep 24, 2020

freddyaboulton commented Sep 24, 2020

codecov bot commented Sep 18, 2020 •

edited

Loading