-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Plot for Prediction vs Actual for Regression Problems #1252
Conversation
Codecov Report
@@ Coverage Diff @@
## main #1252 +/- ##
=======================================
Coverage 99.93% 99.93%
=======================================
Files 207 207
Lines 13055 13142 +87
=======================================
+ Hits 13046 13133 +87
Misses 9 9
Continue to review full report at Codecov.
|
evalml/model_understanding/graphs.py
Outdated
data = pd.concat([pd.Series(predictions), | ||
pd.Series(actual)], axis=1) | ||
data.columns = ['prediction', 'actual'] | ||
data['outlier'] = np.where((abs(data['prediction'] - data['actual']) >= outlier_threshold), "#ffff00", "#0000ff") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
needed to encode colors as hex values
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good but I have two immediate comments:
- It would be helpful to post an image of what this looks like or add it to the model_understanding docs and link to that instead!
- I see in the original issue (Add predicted vs actual plot (regression) #772) that we want to support regression and timeseries data. Not sure what the updated requirements were but if this PR adds just for regression (which is fine), could you please file a separate issue to make sure timeseries data doesn't get dropped?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@bchen1116 Looks good! I have two minor comments!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@bchen1116 This looks great!
Submitted issue #1258 to handle adding plot for timeseries. |
fix #772
Added
outlier_threshold
for simple outlier detectionUpdated Model_Understanding doc here
Regular documentation here