Remove recall objective #784

eccabay · 2020-05-19T19:54:35Z

Closes #476: removes all the recall objectives from the OPTIONS in objectives/utils.py. The classes still exist in objectives/standard_metrics.py, but all uses of any recall classes in tests or docs are removed or replaced.

codecov · 2020-05-19T21:26:25Z

Codecov Report

Merging #784 into master will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master     #784   +/-   ##
=======================================
  Coverage   99.40%   99.40%           
=======================================
  Files         150      150           
  Lines        5578     5588   +10     
=======================================
+ Hits         5545     5555   +10     
  Misses         33       33

Impacted Files	Coverage Δ
evalml/objectives/utils.py	`100.00% <ø> (ø)`
evalml/tests/automl_tests/test_autobase.py	`100.00% <ø> (ø)`
...ts/automl_tests/test_auto_classification_search.py	`100.00% <100.00%> (ø)`
evalml/tests/objective_tests/test_objectives.py	`96.55% <100.00%> (ø)`
...ation_pipeline_tests/test_binary_classification.py	`100.00% <100.00%> (ø)`
evalml/tests/pipeline_tests/test_pipelines.py	`99.63% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2f26183...b9a9ebe. Read the comment docs.

docs/source/api_reference.rst

docs/source/changelog.rst

docs/source/demos/fraud.ipynb

docs/source/guardrails/overfitting.ipynb

evalml/objectives/utils.py

evalml/tests/objective_tests/test_objectives.py

dsherry

@eccabay this is looking great! I left a few comments. My main request is that we add a unit test which ensures we can't use recall as an objective in the automl search. Check out this test which is a basic example of how to create an automl search with a particular objective, and this test which is an example of how to expect errors.

…ureLabs/evalml into 476-remove_recall_objective

docs/source/guardrails/overfitting.ipynb

dsherry · 2020-05-20T18:30:21Z

evalml/tests/automl_tests/test_auto_classification_search.py

+    automl = AutoClassificationSearch(objective=Recall(), max_pipelines=1)
+    automl.search(X, y)
+    assert isinstance(automl.objective, Recall)
+    assert automl.best_pipeline.threshold is not None


Why add this check? We set binary classification thresholds to 0.5 here, for all binary classification problems, so I'm not sure this check is the right thing to do here.

I wasn't sure what aspects to check for this test, so I just added elements I saw in other tests. I can definitely remove this if it's unnecessary.

Got it. Yeah, I'd ask the following: what is this test trying to cover? I think it's checking that we can pass an instance of the recall objective into automl search. In that case, all we really need to do is run the search and make sure there aren't errors. If that seems correct, perhaps all the other lines can be deleted.

dsherry · 2020-05-20T19:20:53Z

evalml/tests/automl_tests/test_auto_classification_search.py

+    X, y = X_y
+    error_msg = 'Could not find the specified objective.'
+    with pytest.raises(ObjectiveNotFoundError, match=error_msg):
+        AutoClassificationSearch(objective='recall', max_pipelines=1)


Awesome, this test looks good.

dsherry

This is getting close! I left another comment about that second unit test. Other than that, this is good to go, and I'll approve once its updated.

dsherry · 2020-05-21T16:06:36Z

evalml/tests/automl_tests/test_auto_classification_search.py

+def test_recall_object(X_y):
+    X, y = X_y
+    automl = AutoClassificationSearch(objective=Recall(), max_pipelines=1)
+    automl.search(X, y)


Looks good, thanks.

Have you ever used test mocking? Unfortunately we don't have a comprehensive pattern in place, but if you search through the auto* tests you'll see some usages of the patch decorator from unittest.mock. In particular, for some automl search tests, we're able to mock the pipelines' fit and predict, or the objectives score. This can be nice because a) most of the time spent during search goes to fitting the pipelines, and making that a no-op saves time, and b) mocking the output of predict and score means we can test for specific edge cases.

In this particular case, I wonder if a good thing to do would be to mock BinaryClassificationPipeline.fit to be a no-op, mock Recall.score to return a predetermined number (say 0.314159), and then mock BinaryClassificationPipeline.predict to return zeros (of the same length as the input) to prevent errors.

You don't need to do that in this PR. But I wanted to throw it out there as food for thought for the future. We're gradually trying to transition most of our automl search tests to use mocking, because it saves a lot of unnecessary unit testing runtime.

dsherry · 2020-05-21T16:07:26Z

evalml/tests/automl_tests/test_auto_classification_search.py

@@ -8,11 +8,13 @@

 from evalml import AutoClassificationSearch
 from evalml.automl.pipeline_search_plots import SearchIterationPlot
+from evalml.exceptions import ObjectiveNotFoundError


Good find using this!

dsherry

Well done! 👏 😁 Looks great. Approved pending resolution of conflicts and green checkin tests.

Becca Mcbrayer and others added 6 commits May 19, 2020 11:40

Removed Recall objectives from OPTIONS dict

06188e6

Remove recall from test_auto_classification_search

5ecc937

Update tests to remove recall

873d9f5

Updated changelog

d9a89a4

Update fraud demo to remove recall

46fe906

Updated API reference to reflect Recall removal

9f9bfb5

eccabay requested review from dsherry and jeremyliweishih May 19, 2020 19:54

auto-assign bot assigned eccabay May 19, 2020

eccabay and others added 2 commits May 19, 2020 17:19

Update changelog - try 2

4939c94

Merge branch 'master' into 476-remove_recall_objective

a5ae621

eccabay linked an issue May 19, 2020 that may be closed by this pull request

Disallow recall as an automl objective #476

Closed