-
Notifications
You must be signed in to change notification settings - Fork 331
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
improved LIME for text data #110
Conversation
Current coverage is 97.27% (diff: 98.78%)@@ master #110 diff @@
==========================================
Files 35 37 +2
Lines 1894 2090 +196
Methods 0 0
Messages 0 0
Branches 362 390 +28
==========================================
+ Hits 1843 2033 +190
- Misses 24 28 +4
- Partials 27 29 +2
|
24e79b4
to
b73b393
Compare
…a has incorrect shape
…ral MaskingTextSamplerUnion.
…es for some of the class
* add decision tree example; * add some docs for sampling; * other documentation improvements.
//cc @lopuhin I think this is ready. Do you have any comments? Tutorial doesn't explain all options - most notably, it doesn't use |
|
||
If a library is not supported by eli5 directly, or the text processing | ||
pipeline is too complex for eli5, eli5 can still help - it provides an | ||
implementation of LIME (Ribeiro et al., 2016) algorithm which allows to |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it would be nice to link to the paper (https://arxiv.org/abs/1602.04938)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, makes sense.
Hey @kmike , I just finished reading the notebook, it looks great - I like that most important stuff is at the start, and you show how it can break. KL divergence and score of the white-box classifier predictions of the black-box classifier seem to be important to know if explanation should be trusted - does it make sense to include them in explanation output by default? |
Hm, but I see it's not really convenient to implement, and it's not obvious that they are needed in the explanation. The PR looks great! I didn't know about |
I was also thinking about adding scores to the output by default, but it was not straightforward to implement indeed. Adding a custom field just for scores looks like a bit too much; putting it to description is not enough because description is hidden by default. |
This is a work-in-progress.
TODO:
This PR also fixes #102 and #39.