Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update User Guide with Decision Tree Visualization #1678

Merged
merged 11 commits into from Jan 25, 2021

Conversation

ParthivNaresh
Copy link
Contributor

@ParthivNaresh ParthivNaresh commented Jan 12, 2021

Fixes #1541

@ParthivNaresh ParthivNaresh self-assigned this Jan 12, 2021
@codecov
Copy link

codecov bot commented Jan 12, 2021

Codecov Report

Merging #1678 (e5da81c) into main (1f8779b) will not change coverage.
The diff coverage is 100.0%.

Impacted file tree graph

@@           Coverage Diff           @@
##             main    #1678   +/-   ##
=======================================
  Coverage   100.0%   100.0%           
=======================================
  Files         242      242           
  Lines       19273    19273           
=======================================
  Hits        19265    19265           
  Misses          8        8           
Impacted Files Coverage Δ
evalml/model_understanding/__init__.py 100.0% <100.0%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1f8779b...e5da81c. Read the comment docs.

@ParthivNaresh ParthivNaresh marked this pull request as ready for review January 22, 2021 16:26
roc_curve,
graph_roc_curve,
graph_confusion_matrix,
binary_objective_vs_threshold,
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reordered imports alphabetically

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did flake8 recommend this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@freddyaboulton Yeah it weirdly did and it came out of nowhere! One day it wasn't recommending it and the next it was. Correct me if I'm wrong but it makes sense right, as far as alphabetization is concerned?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorting alphabetically makes it look tidier! But your comment about one day not recommending it and the other day recommending it makes me think your version of flake8 got upgraded? Locally, my make lint passed before this change got merged.

"cell_type": "markdown",
"metadata": {},
"source": [
"Now let's make the pipeline more complex by replacing the Tree with a Forest."
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Instead of replacing the entire pipeline, which would have resulted in reduced performance and some uglier graphs, I chose to start with a DecisionTree and then change it

Copy link
Contributor

@freddyaboulton freddyaboulton left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ParthivNaresh Looks great!!

roc_curve,
graph_roc_curve,
graph_confusion_matrix,
binary_objective_vs_threshold,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did flake8 recommend this?

@@ -433,6 +473,13 @@
" include_shap_values=True, output_format=\"dict\")\n",
"print(json.dumps(report, indent=2))"
]
},
{
"cell_type": "code",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit-pick: Maybe we should get rid of the empty cell?

Copy link
Contributor

@angela97lin angela97lin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"cell_type": "markdown",
"metadata": {},
"source": [
"Now let's make the pipeline more complex by replacing the Tree with a Forest."
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bahaha nice! Can we update this to be more specific ex: the Decision Tree pipeline with a Random Forest pipeline or something?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also... maybe rather than just stating we'll make it more complex (because why?), maybe we should just introduce it in the next part? I guess as a user, I don't understand what the point is :P

@angela97lin angela97lin self-requested a review January 22, 2021 19:50
Copy link
Contributor

@bchen1116 bchen1116 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Doc changes look good!

@ParthivNaresh ParthivNaresh merged commit 0577e42 into main Jan 25, 2021
@bchen1116 bchen1116 mentioned this pull request Jan 26, 2021
@freddyaboulton freddyaboulton deleted the 1541-Add-Decision-Tree-User-Guide branch May 13, 2022 15:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add decision tree visualization to model understanding user guide
5 participants