explain_predictions_ functions should return top_k features by magnitude of SHAP value #1360

freddyaboulton · 2020-10-28T21:08:45Z

Currently, the explain_predictions_ functions accept a top_k_features argument. By design, this returns the features with the top_k highest and top_k smallest shap values for a total of 2 * top_k features.

The idea was to give users a sense of what drives a particular prediction up vs down but this approach can include some irrelevant information. For example, consider a prediction where every feature has a positive SHAP value but 10 features have really high Shap values and the smallest 5 have shap values close to 0. Passing top_k=5, would include the 5 features with near-zero shap values. It would be better to just return the 10 features with the highest shap value by magnitude.

FYI @kmax12

freddyaboulton mentioned this issue Oct 30, 2020

Displaying the top_k features with largest shap value magnitudes #1374

Merged

freddyaboulton self-assigned this Nov 5, 2020

freddyaboulton added this to the November 2020 milestone Nov 5, 2020

freddyaboulton closed this as completed in #1374 Nov 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

explain_predictions_ functions should return top_k features by magnitude of SHAP value #1360

explain_predictions_ functions should return top_k features by magnitude of SHAP value #1360

freddyaboulton commented Oct 28, 2020

explain_predictions_ functions should return top_k features by magnitude of SHAP value #1360

explain_predictions_ functions should return top_k features by magnitude of SHAP value #1360

Comments

freddyaboulton commented Oct 28, 2020