Bin class pipeline: use predictions for "true" class in score #798

dsherry · 2020-05-22T14:26:05Z

Fixes #797, a bug introduced in #787 . The problem is that binary classification pipelines are no longer taking the "true" class from the predicted probs and passing that into the score math.

dsherry · 2020-05-22T14:26:34Z

evalml/tests/pipeline_tests/test_pipelines.py

+def test_score_auc(X_y, lr_pipeline):
+    X, y = X_y
+    lr_pipeline.fit(X, y)
+    lr_pipeline.score(X, y, ['auc'])


This was my reproducer. I will expand on this coverage before merging.

You know what, I'd like to merge this now to unblock master and then get another PR up with more coverage later today.

codecov · 2020-05-22T14:28:01Z

Codecov Report

Merging #798 into master will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master     #798   +/-   ##
=======================================
  Coverage   99.51%   99.51%           
=======================================
  Files         150      150           
  Lines        5718     5727    +9     
=======================================
+ Hits         5690     5699    +9     
  Misses         28       28

Impacted Files	Coverage Δ
evalml/pipelines/binary_classification_pipeline.py	`100.00% <100.00%> (ø)`
evalml/tests/pipeline_tests/test_pipelines.py	`99.74% <100.00%> (+<0.01%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8b1024a...c226873. Read the comment docs.

dsherry · 2020-05-22T14:28:52Z

evalml/pipelines/binary_classification_pipeline.py

+        """
+        if predictions.ndim > 1:
+            predictions = predictions[:, 1]
+        return ClassificationPipeline._score(X, y, predictions, objective)


@angela97lin @kmax12 what do you think of this solution?

Pros: it fixes the bug. And it keeps the binary-classification-specific code in the binary classification pipeline definition.
Cons: there may be a cleaner way to organize this. For example, we do the same indexing in BinaryClassificationPipeline.predict above, and ideally perhaps we'd have one method for computing this. But idk if its worth messing with that right now.

I think I prefer this! It makes more sense to me to do it here since after all, we just need this indexing for score, so predict shouldn't need to handle it.

would it be better for this to be PipelineBase._score since that is where it is actually defined?

i think this organization is fine. my only thought is that ClassificationPipeline._score should be a utility rather than a static method. it just feels off

or maybe even define a ObjectiveBase.safe_score method that has this behavior

Thanks @angela97lin @kmax12 !

@kmax12 , I agree this doesn't feel ideal yet. And yes, perhaps moving this functionality to a util or to the objectives would make more sense, I like those ideas.

I'll plan to update the tests and merge this fix, and then we can circle back and put something better in place later. This doesn't alter our public API so we have flexibility.

First attempt at a fix

a155430

dsherry requested review from kmax12 and angela97lin May 22, 2020 14:26

dsherry commented May 22, 2020

View reviewed changes

auto-assign bot assigned dsherry May 22, 2020

dsherry commented May 22, 2020

View reviewed changes

Changelog

c226873

angela97lin approved these changes May 22, 2020

View reviewed changes

dsherry merged commit 7786dd2 into master May 22, 2020

dsherry deleted the ds_797_fix_auc_score branch May 22, 2020 16:42

angela97lin mentioned this pull request May 29, 2020

Release v0.10.0 May 29, 2020 #822

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bin class pipeline: use predictions for "true" class in score #798

Bin class pipeline: use predictions for "true" class in score #798

dsherry commented May 22, 2020

dsherry May 22, 2020

dsherry May 22, 2020

codecov bot commented May 22, 2020 •

edited

Loading

dsherry May 22, 2020

angela97lin May 22, 2020

kmax12 May 22, 2020

kmax12 May 22, 2020 •

edited

Loading

dsherry May 22, 2020

Bin class pipeline: use predictions for "true" class in score #798

Bin class pipeline: use predictions for "true" class in score #798

Conversation

dsherry commented May 22, 2020

dsherry May 22, 2020

Choose a reason for hiding this comment

dsherry May 22, 2020

Choose a reason for hiding this comment

codecov bot commented May 22, 2020 • edited Loading

Codecov Report

dsherry May 22, 2020

Choose a reason for hiding this comment

angela97lin May 22, 2020

Choose a reason for hiding this comment

kmax12 May 22, 2020

Choose a reason for hiding this comment

kmax12 May 22, 2020 • edited Loading

Choose a reason for hiding this comment

dsherry May 22, 2020

Choose a reason for hiding this comment

codecov bot commented May 22, 2020 •

edited

Loading

kmax12 May 22, 2020 •

edited

Loading