Move ROC and confusion matrix to pipeline plot utils (#696) #704

dsherry · 2020-04-23T19:47:41Z

Preface: this is a clone of PR #696 -- I merged that to master this morning, but ran into RTD timeouts (described in #702). Reset master and now trying again.

Fix #639 #365 #392 #427 #608 #610 #620

Prior to #346 , plot data functions like ROC and confusion matrix were treated as objectives. #346 and follow-on work separated them from objectives, and made it so that these plot data functions aren't precomputed during automl. We then filed epic #639 to figure out where to put the code for plot data functions like ROC and confusion matrix.

This PR has the following changes:

Move ROC and confusion matrix functions to evalml/pipelines/plot_utils.py as standalone functions which take actual vs predicted
Delete plot_metrics input from automl, and delete plot_data field from automl results, because those were made unnecessary when we took ROC/confusion out of automl
Added a couple more gen_utils methods to the API docs (like import_or_raise)
Delete MSLE objective for codecov -- it was disabled with a comment
Add a couple misc tests to satisfy codecov.

Future work:

Add utils for displaying ROC and confusion matrix plots (deleted from here in Remove calculating plot metrics from AutoML #615) : tracked by Add methods to plot ROC and confusion matrix #697
Later, we could choose to move all of these methods into the pipeline API, like ClassificationPipeline.confusion_matrix, ClassificationPipeline.plot_confusion_matrix, BinaryClassificationPipeline.roc_curve, BinaryClassificationPipeline.plot_roc_curve. This would make it convenient to get plot data directly from pipelines

codecov · 2020-04-23T19:49:57Z

Codecov Report

Merging #704 into master will increase coverage by 0.10%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #704      +/-   ##
==========================================
+ Coverage   99.09%   99.19%   +0.10%     
==========================================
  Files         139      140       +1     
  Lines        4954     4952       -2     
==========================================
+ Hits         4909     4912       +3     
+ Misses         45       40       -5

Impacted Files	Coverage Δ
evalml/objectives/__init__.py	`100.00% <ø> (ø)`
evalml/objectives/standard_metrics.py	`100.00% <ø> (+0.43%)`	⬆️
evalml/objectives/utils.py	`100.00% <ø> (+5.55%)`	⬆️
evalml/tests/utils_tests/test_gen_utils.py	`100.00% <ø> (ø)`
evalml/utils/gen_utils.py	`100.00% <ø> (+1.69%)`	⬆️
evalml/automl/auto_base.py	`97.05% <100.00%> (+0.63%)`	⬆️
evalml/automl/auto_classification_search.py	`100.00% <100.00%> (ø)`
evalml/pipelines/__init__.py	`100.00% <100.00%> (ø)`
evalml/pipelines/plot_utils.py	`100.00% <100.00%> (ø)`
evalml/tests/automl_tests/test_autobase.py	`100.00% <100.00%> (ø)`
... and 7 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update fe7e929...87f2f2e. Read the comment docs.

dsherry · 2020-04-23T20:24:38Z

All checkin tests are green. The RTD build is green.

Because of #702, before I merge this, I will try to build this locally using the RTD docker container. I'll also rekick the job on RTD a couple times. I'm not sure what else to try!

dsherry · 2020-04-23T21:13:58Z

I rekicked the RTD job for this branch again:

Run 1
Run 2

I also rekicked latest:

Run 1
Run 2

All passed with flying colors.

I also followed the instructions on #586 to use the RTD docker container to build the docs locally. That succeeded without any issues. I was able to view the fresh html files and they looked fine.

* Impl * Rename file * Update test * Forgot to add the new file * Fix imports and lint * Fix test * Changelog * Fix api docs * Add missing markdown descr. Delete some ipynb elements which were failing validation * Update docstrings * Delete MSLE for codecov * Increase test coverage for codecov * Disallow unsupported options for normalize_confusion_matrix * Add test coverage for get_objective w\ nonetype (for codecov) * Update docstring * Remove unnecessary raise in test mock * Update test. * Update docstring

angela97lin

Hope it works this time!

dsherry · 2020-04-24T14:10:06Z

I hope so too!

I just rebased. I'll watch RTD pass once more, and then merge.

dsherry · 2020-04-24T14:27:37Z

The RTD build passed and the docs look good. Merging.

dsherry · 2020-04-24T14:47:12Z

Latest RTD passed on master!

dsherry marked this pull request as ready for review April 23, 2020 20:22

auto-assign bot assigned dsherry Apr 23, 2020

dsherry requested review from jeremyliweishih and angela97lin April 23, 2020 20:23

dsherry added 2 commits April 24, 2020 01:27

Update changelog with new PR number

87f2f2e

angela97lin approved these changes Apr 24, 2020

View reviewed changes

dsherry force-pushed the ds_639_clean_up_roc_confusion_take2 branch from 9a64d36 to 87f2f2e Compare April 24, 2020 14:09

dsherry merged commit b12e5fa into master Apr 24, 2020

dsherry deleted the ds_639_clean_up_roc_confusion_take2 branch April 24, 2020 14:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move ROC and confusion matrix to pipeline plot utils (#696) #704

Move ROC and confusion matrix to pipeline plot utils (#696) #704

dsherry commented Apr 23, 2020

codecov bot commented Apr 23, 2020 •

edited

dsherry commented Apr 23, 2020 •

edited

dsherry commented Apr 23, 2020

angela97lin left a comment

dsherry commented Apr 24, 2020

dsherry commented Apr 24, 2020

dsherry commented Apr 24, 2020

Move ROC and confusion matrix to pipeline plot utils (#696) #704

Move ROC and confusion matrix to pipeline plot utils (#696) #704

Conversation

dsherry commented Apr 23, 2020

codecov bot commented Apr 23, 2020 • edited

Codecov Report

dsherry commented Apr 23, 2020 • edited

dsherry commented Apr 23, 2020

angela97lin left a comment

Choose a reason for hiding this comment

dsherry commented Apr 24, 2020

dsherry commented Apr 24, 2020

dsherry commented Apr 24, 2020

codecov bot commented Apr 23, 2020 •

edited

dsherry commented Apr 23, 2020 •

edited