Third Party Estimator Wrapper #1103

bbengfort · 2020-10-03T11:17:11Z

Adds a wrapper for estimators that implement the scikit-learn API but do not extend BaseEstimator. If the estimator is missing required properties (generally the learned attributes) then a sensible error is raised. Includes documentation about how to use non-sklearn estimators with Yellowbrick.

Fixes #1098
Fixes #1099
Fixes #397
Related to #1066

I have made the following changes:

Added a ContribEstimator wrapper and helper functions to wrap cuML, catboost, neuraxle etc estimators
Wrapper allows access to estimator functions, raising sensible error if required attr is not available
Wrapper allows model type checking e.g. is_classifier
Adapted our is_estimator and get_model_name helpers to use the ContribEstimator
Removed recursive imports from yellowbrick/contrib/__init__.py
Wrote tests for wrapper library
Tested wrapper with cuML and catboost
Wrote documentation about how to use third party estimators with Yellowbrick

Still to do:

Official tests with cuML and catboost
Finish documentation

CHECKLIST

Is the commit message formatted correctly?
Have you noted the new functionality/bugfix in the release notes of the next release?

~~Included a sample plot to visually illustrate your changes?~~
Do all of your functions and methods have docstrings?
Have you added/updated unit tests where appropriate?
~~Have you updated the baseline images if necessary?~~
Have you run the unit tests using pytest?
Is your code style correct (are you using PEP8, pyflakes)?
Have you documented your new feature/functionality in the docs?

Have you built the docs using make html?

Adds a wrapper for estimators that implement the scikit-learn API but do not extend BaseEstimator. If the estimator is missing required properties (generally the learned attributes) then a sensible error is raised. Includes documentation about how to use non-sklearn estimators with Yellowbrick. Fixes #1098 Fixes #1099 Related to #397 Related to #1066

yellowbrick/contrib/__init__.py

yellowbrick/contrib/wrapper.py

rebeccabilbro

Looks great @bbengfort; thank you for taking this on (especially given it's been on our wishlist for >2 years now!) I've noted just a handful of minor comments inline.

One other thought — I think that this PR should close #397 as well as #1098 and #1099; if there are specific libraries that folks want to add support for, I propose opening those as separate issues (e.g. #1066), given the potential for variations in external API behaviors, which may be easier to tackle individually (that odd catboost thing you noticed, for instance).

Once we've merged this, I might do a small follow-on PR to update our FAQ page, since this would be a great new addition!

docs/api/contrib/wrapper.rst

tests/test_contrib/test_wrapper.py

yellowbrick/contrib/wrapper.py

rebeccabilbro · 2020-10-05T13:56:32Z

yellowbrick/utils/helpers.py

+    if isinstance(model, Pipeline):
+        return get_model_name(model.steps[-1][-1])
+    elif isinstance(model, ContribEstimator):
+        return model.estimator.__class__.__name__
    else:
-        if isinstance(model, Pipeline):
-            return get_model_name(model.steps[-1][-1])
-        else:
-            return model.__class__.__name__
+        return model.__class__.__name__


Do we think this change needs a new test?

Model and Pipeline are tested in tests/test_utils/test_helpers.py lines 51-86 and ContribEstimator in tests/test_contrib/test_wrapper.py line 96; so the function is completely covered - but would you prefer that all of these tests were in one place?

bbengfort · 2020-10-05T18:16:55Z

@rebeccabilbro adding this to the FAQ would be great, thanks! Closing #397 works for me- I'll add a couple of issues for libraries that I think we should cover; possibly as spike issues so that interested folks can take them on one by one. Let me know about the tests and I can merge it in; or feel free to merge when you're ready!

bbengfort · 2020-10-05T19:23:19Z

@rebeccabilbro Created #1107 #1106 and #1105 in response to your suggestion about closing #397

bbengfort commented Oct 3, 2020

View reviewed changes

yellowbrick/contrib/__init__.py Show resolved Hide resolved

bbengfort commented Oct 3, 2020

View reviewed changes

yellowbrick/contrib/wrapper.py Show resolved Hide resolved

bbengfort added 2 commits October 3, 2020 08:47

add xgboost and catboost tests

bc9e46a

documentation

1499a1f

bbengfort changed the title ~~[WIP] Third Party Estimator Wrapper~~ Third Party Estimator Wrapper Oct 3, 2020

bbengfort requested a review from rebeccabilbro October 3, 2020 14:15

This was referenced Oct 3, 2020

Not supporting model of Catboost #1099

Closed

Non-sklearn dependent estimator check #1098

Closed

rebeccabilbro approved these changes Oct 5, 2020

View reviewed changes

fix typos

cb23542

bbengfort mentioned this pull request Oct 5, 2020

Visual ATM Model Report #1105

Open

3 tasks

rebeccabilbro merged commit 1329281 into DistrictDataLabs:develop Oct 5, 2020

bbengfort mentioned this pull request Oct 5, 2020

Keras contrib module #1106

Open

3 tasks

bbengfort deleted the contrib-wrapper branch October 5, 2020 19:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Third Party Estimator Wrapper #1103

Third Party Estimator Wrapper #1103

bbengfort commented Oct 3, 2020 •

edited

rebeccabilbro left a comment

rebeccabilbro Oct 5, 2020

bbengfort Oct 5, 2020

bbengfort commented Oct 5, 2020

bbengfort commented Oct 5, 2020

Third Party Estimator Wrapper #1103

Third Party Estimator Wrapper #1103

Conversation

bbengfort commented Oct 3, 2020 • edited

CHECKLIST

rebeccabilbro left a comment

Choose a reason for hiding this comment

rebeccabilbro Oct 5, 2020

Choose a reason for hiding this comment

bbengfort Oct 5, 2020

Choose a reason for hiding this comment

bbengfort commented Oct 5, 2020

bbengfort commented Oct 5, 2020

bbengfort commented Oct 3, 2020 •

edited