Update automl to use default of max_batches=1 #1452

dsherry · 2020-11-21T00:26:34Z

We have 8 classification estimators and 7 regression estimators now! We should use all of them by default.

Docs changes visible here.

codecov · 2020-11-24T01:09:52Z

Codecov Report

Merging #1452 (752f469) into main (ccfd9eb) will increase coverage by 0.1%.
The diff coverage is 100.0%.

@@            Coverage Diff            @@
##             main    #1452     +/-   ##
=========================================
+ Coverage   100.0%   100.0%   +0.1%     
=========================================
  Files         223      223             
  Lines       15001    15013     +12     
=========================================
+ Hits        14994    15006     +12     
  Misses          7        7

Impacted Files	Coverage Δ
evalml/automl/automl_search.py	`99.7% <100.0%> (+0.1%)`	⬆️
evalml/tests/automl_tests/test_automl.py	`100.0% <100.0%> (ø)`
.../automl_tests/test_automl_search_classification.py	`100.0% <100.0%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ccfd9eb...752f469. Read the comment docs.

freddyaboulton · 2020-11-24T16:39:41Z

docs/source/user_guide/automl.ipynb

@@ -74,7 +74,7 @@
   "source": [
    "The AutoML search will log its progress, reporting each pipeline and parameter set evaluated during the search.\n",
    "\n",
-    "There are a number of mechanisms to control the AutoML search time. One way is to set the maximum number of candidate models to be evaluated during AutoML using `max_iterations`. By default, AutoML will search a fixed number of iterations and parameter pairs (`max_iterations=5`). The first pipeline to be evaluated will always be a baseline model representing a trivial solution. "
+    "There are a number of mechanisms to control the AutoML search time. One way is to set the `max_batches` parameter which controls the maximum number of rounds of AutoML to evaluate, where each round may train and score a variable number of pipeline. Another way is to set the `max_iterations` parameter which controls the maximum number of candidate models to be evaluated during AutoML. By default, AutoML will search for a single batch. The first pipeline to be evaluated will always be a baseline model representing a trivial solution. "


Not related to this PR but when are we making _pipelines_per_batch public?

@freddyaboulton yeah good q, no plans to do so currently, let's chat at standup

bchen1116

LGTM! Left a few comments and nitpicks, but nothing blocking 🦃

docs/source/start.ipynb

evalml/automl/automl_search.py

docs/source/user_guide/automl.ipynb

evalml/tests/automl_tests/test_automl.py

evalml/automl/automl_search.py

angela97lin

LGTM!! 🚢 ⚓

angela97lin · 2020-11-24T17:53:16Z

evalml/automl/automl_search.py

+        if not isinstance(max_time, (int, float, str, type(None))):
+            raise TypeError(f"Parameter max_time must be a float, int, string or None. Received {str(max_time)}.")
+        if isinstance(max_time, (int, float)) and max_time <= 0:
+            raise ValueError(f"Parameter max_time must be None or non-negative. Received {max_time}.")


Hmm, the ValueError message doesn't align with the check (max_time <= 0 vs non-negative)

(Same with max_batches and max_iterations)

Maybe 'strictly positive' rather than 'non-negative'?

Good point. I'll change the boundary conditions here to be non-negative and if we wanna change this in the future we can!

evalml/tests/automl_tests/test_automl.py

freddyaboulton

Looks good to me @dsherry !

dsherry force-pushed the ds_update_max_batches_default branch from 2379738 to 276defe Compare November 24, 2020 01:02

dsherry marked this pull request as ready for review November 24, 2020 01:15

dsherry added documentation Improvements or additions to documentation enhancement An improvement to an existing feature. and removed documentation Improvements or additions to documentation labels Nov 24, 2020

dsherry self-assigned this Nov 24, 2020

dsherry force-pushed the ds_update_max_batches_default branch from 9b41693 to aac9988 Compare November 24, 2020 16:34

dsherry requested review from freddyaboulton, angela97lin, christopherbunn, bchen1116 and ParthivNaresh November 24, 2020 16:35

freddyaboulton reviewed Nov 24, 2020

View reviewed changes

dsherry force-pushed the ds_update_max_batches_default branch from f74ab35 to afae52e Compare November 24, 2020 17:18

dsherry requested a review from freddyaboulton November 24, 2020 17:35

bchen1116 approved these changes Nov 24, 2020

View reviewed changes

angela97lin approved these changes Nov 24, 2020

View reviewed changes

angela97lin reviewed Nov 24, 2020

View reviewed changes

evalml/tests/automl_tests/test_automl.py Outdated Show resolved Hide resolved

dsherry added 10 commits November 24, 2020 12:56

Update automl to use default of max_batches=1

6de93b3

Fix max pipeline tests

8769204

Fix test_max_batches_works

82be306

Fix unit tests

082c803

Update docs

8de59ef

Changelog

d07f994

Lint

1a06cd0

Codecov

444fc48

Lint

cfeeea0

PR comments

8d14e00

dsherry added 2 commits November 24, 2020 13:00

Reset test value, not needed

8f53ba5

Change boundary conditions

06f826e

dsherry force-pushed the ds_update_max_batches_default branch from afae52e to 06f826e Compare November 24, 2020 18:01

freddyaboulton approved these changes Nov 24, 2020

View reviewed changes

Fix broken tests

752f469

dsherry merged commit e10cb02 into main Nov 24, 2020

dsherry mentioned this pull request Nov 24, 2020

Release v0.16.0 #1468

Merged

freddyaboulton deleted the ds_update_max_batches_default branch May 13, 2022 14:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update automl to use default of max_batches=1 #1452

Update automl to use default of max_batches=1 #1452

dsherry commented Nov 21, 2020 •

edited

Loading

codecov bot commented Nov 24, 2020 •

edited

Loading

freddyaboulton Nov 24, 2020

dsherry Nov 24, 2020

bchen1116 left a comment

angela97lin left a comment

angela97lin Nov 24, 2020 •

edited

Loading

bchen1116 Nov 24, 2020

dsherry Nov 24, 2020

freddyaboulton left a comment

Update automl to use default of max_batches=1 #1452

Update automl to use default of max_batches=1 #1452

Conversation

dsherry commented Nov 21, 2020 • edited Loading

codecov bot commented Nov 24, 2020 • edited Loading

Codecov Report

freddyaboulton Nov 24, 2020

Choose a reason for hiding this comment

dsherry Nov 24, 2020

Choose a reason for hiding this comment

bchen1116 left a comment

Choose a reason for hiding this comment

angela97lin left a comment

Choose a reason for hiding this comment

angela97lin Nov 24, 2020 • edited Loading

Choose a reason for hiding this comment

bchen1116 Nov 24, 2020

Choose a reason for hiding this comment

dsherry Nov 24, 2020

Choose a reason for hiding this comment

freddyaboulton left a comment

Choose a reason for hiding this comment

dsherry commented Nov 21, 2020 •

edited

Loading

codecov bot commented Nov 24, 2020 •

edited

Loading

angela97lin Nov 24, 2020 •

edited

Loading