Add Plot for Prediction vs Actual for Regression Problems #1252

bchen1116 · 2020-10-01T17:58:07Z

Added outlier_threshold for simple outlier detection

Updated Model_Understanding doc here

Regular documentation here

codecov · 2020-10-01T18:04:10Z

Codecov Report

Merging #1252 into main will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##             main    #1252   +/-   ##
=======================================
  Coverage   99.93%   99.93%           
=======================================
  Files         207      207           
  Lines       13055    13142   +87     
=======================================
+ Hits        13046    13133   +87     
  Misses          9        9

Impacted Files	Coverage Δ
evalml/model_understanding/__init__.py	`100.00% <ø> (ø)`
evalml/model_understanding/graphs.py	`100.00% <100.00%> (ø)`
...lml/tests/model_understanding_tests/test_graphs.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4ae3207...68e882d. Read the comment docs.

bchen1116 · 2020-10-01T20:01:26Z

evalml/model_understanding/graphs.py

+    data = pd.concat([pd.Series(predictions),
+                      pd.Series(actual)], axis=1)
+    data.columns = ['prediction', 'actual']
+    data['outlier'] = np.where((abs(data['prediction'] - data['actual']) >= outlier_threshold), "#ffff00", "#0000ff")


needed to encode colors as hex values

angela97lin

Looking good but I have two immediate comments:

It would be helpful to post an image of what this looks like or add it to the model_understanding docs and link to that instead!
I see in the original issue (Add predicted vs actual plot (regression) #772) that we want to support regression and timeseries data. Not sure what the updated requirements were but if this PR adds just for regression (which is fine), could you please file a separate issue to make sure timeseries data doesn't get dropped?

evalml/model_understanding/graphs.py

freddyaboulton

@bchen1116 Looks good! I have two minor comments!

evalml/model_understanding/graphs.py

freddyaboulton

@bchen1116 This looks great!

evalml/tests/model_understanding_tests/test_graphs.py

changes addressed

bchen1116 · 2020-10-05T15:49:15Z

Submitted issue #1258 to handle adding plot for timeseries.

initial implementation

b0351be

bchen1116 self-assigned this Oct 1, 2020

bchen1116 added 2 commits October 1, 2020 13:58

update release notes

76d5f94

Merge branch 'main' into bc_772_plot

0929ed6

bchen1116 added 3 commits October 1, 2020 15:13

fix test

9a1ebd8

lint

1f5d9c7

update api refs

14963a9

bchen1116 marked this pull request as ready for review October 1, 2020 19:58

bchen1116 commented Oct 1, 2020

View reviewed changes

bchen1116 requested review from freddyaboulton, angela97lin, dsherry, christopherbunn, eccabay and jeremyliweishih and removed request for freddyaboulton October 1, 2020 20:01

angela97lin previously requested changes Oct 1, 2020

View reviewed changes

evalml/model_understanding/graphs.py Outdated Show resolved Hide resolved

bchen1116 added 3 commits October 1, 2020 16:39

update model understanding doc

0e3b72a

fix ipynb format

2ef3a1a

linting

7cc5aa0

bchen1116 requested a review from angela97lin October 1, 2020 21:29

freddyaboulton approved these changes Oct 2, 2020

View reviewed changes

evalml/model_understanding/graphs.py Show resolved Hide resolved

evalml/model_understanding/graphs.py Outdated Show resolved Hide resolved

bchen1116 and others added 5 commits October 2, 2020 13:16

default threshold to none

fadc77b

fix release notes

9d4b601

fix release notes

9205aea

fix model understanding notebook

2e86399

Merge branch 'main' into bc_772_plot

378dc5b

bchen1116 requested a review from freddyaboulton October 2, 2020 19:02

bchen1116 added 4 commits October 2, 2020 15:24

fix doc

5f03a5d

fix release notes

87e861c

Merge branch 'bc_772_plot' of github.com:alteryx/evalml into bc_772_plot

3010dd2

fix docs

1f1e1b1

freddyaboulton approved these changes Oct 5, 2020

View reviewed changes

evalml/tests/model_understanding_tests/test_graphs.py Show resolved Hide resolved

bchen1116 added 2 commits October 5, 2020 11:01

update test

bfe3c98

fix tests

f0338a8

Merge branch 'main' into bc_772_plot

68e882d

bchen1116 mentioned this pull request Oct 5, 2020

Add predicted-vs-actual plot for timeseries regression #1258

Closed

bchen1116 merged commit bd04b00 into main Oct 5, 2020

dsherry mentioned this pull request Oct 29, 2020

Release v0.15.0 #1370

Merged

freddyaboulton deleted the bc_772_plot branch May 13, 2022 14:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Plot for Prediction vs Actual for Regression Problems #1252

Add Plot for Prediction vs Actual for Regression Problems #1252

bchen1116 commented Oct 1, 2020 •

edited

Loading

codecov bot commented Oct 1, 2020 •

edited

Loading

bchen1116 Oct 1, 2020

angela97lin left a comment

freddyaboulton left a comment

freddyaboulton left a comment

bchen1116 commented Oct 5, 2020

Add Plot for Prediction vs Actual for Regression Problems #1252

Add Plot for Prediction vs Actual for Regression Problems #1252

Conversation

bchen1116 commented Oct 1, 2020 • edited Loading

codecov bot commented Oct 1, 2020 • edited Loading

Codecov Report

bchen1116 Oct 1, 2020

Choose a reason for hiding this comment

angela97lin left a comment

Choose a reason for hiding this comment

freddyaboulton left a comment

Choose a reason for hiding this comment

freddyaboulton left a comment

Choose a reason for hiding this comment

bchen1116 commented Oct 5, 2020

bchen1116 commented Oct 1, 2020 •

edited

Loading

codecov bot commented Oct 1, 2020 •

edited

Loading