Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request for inclusion for sklearn-evaluation #13

Open
6 tasks done
edublancas opened this issue Jan 6, 2017 · 5 comments
Open
6 tasks done

Request for inclusion for sklearn-evaluation #13

edublancas opened this issue Jan 6, 2017 · 5 comments

Comments

@edublancas
Copy link

edublancas commented Jan 6, 2017

Request for project inclusion in scikit-learn-contrib

  • Project name: sklearn-evaluation
  • Project description: Model evaluation (mostly plots for now) and HTML report generation
  • Authors: Eduardo Blancas Reyes
  • Current repository: https://github.com/edublancas/sklearn-evaluation
  • Requirements:
  • scikit-learn compatible (check_estimator passed) - does not apply
  • Documentation (guide, API reference, example gallery)
  • Unit tests (coverage: 94%)
  • Python3 compatible
  • PEP8 compliant
  • Continuous integration

Longer description: the motivation behind sklearn-evaluation is to have an easy way to evaluate models. The package includes a bunch of plots for evaluating classifiers (actually, I've taken many from the sklearn docs) like confusion matrix, ROC, Precision-Recall. I also has some other useful plots, like this one to evaluate grid search results.

The second part is report generation (which is pretty basic at this point), it can generate the same plots but in an HTML file with embedded images. The idea for this is not to develop something complex like knitr but rather to provide a simple way to generate report for models.

@amueller
Copy link
Member

amueller commented Jan 6, 2017

Can you maybe say a bit more about the content? I have PR for a confusion matrix plot to sklearn.
That also allows some easy visualization of grid-search results, which I think is important.

I would actually like to have some more ready-made functions for PR curves and ROC curves and feature importances.
Getting them "upstream" will take longer and a bit of discussion, but I think it will be useful.

I'm not sure what the best route is. I'm pretty busy right now but I'd love to see this in the main repo.
We can include here and see what we can merge upstream maybe?

I haven't looked at the report generation, that's probably something we don't want upstream yet.

@edublancas
Copy link
Author

Sure, I updated the description.

@wdm0006
Copy link

wdm0006 commented Mar 19, 2017

+1 from me for inclusion, looks to be a very useful library, in particularly the report generation portion.

@jnothman
Copy link
Member

This is a bit different from what tends to be in scikit-learn-contrib. It is more comparable to something like eli5. Do we want this kind of thing in contrib, or put them on the related projects page?

@glemaitre
Copy link
Member

This is a bit different from what tends to be in scikit-learn-contrib.

I tend to think that it could be useful since that it could provide some plotting functionality to scikit-learn. I also think that having a tool similar to https://github.com/pandas-profiling/pandas-profiling could be good for the scikit-learn-contrib.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants