ExpO

This is the code to reproduce the results from Regularizing Black-box Models for Improved Interpretability which was published at NeurIPS 2020. A summary of this paper is as follows:

''Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, whose explanation quality can be unpredictable. Our method, ExpO, is a hybridization of these approaches that regularizes a model for explanation quality at training time. Importantly, these regularizers are differentiable, model agnostic, and require no domain knowledge to define. We demonstrate that post-hoc explanations for ExpO-regularized models have better explanation quality, as measured by the common fidelity and stability metrics. We verify that improving these metrics leads to significantly more useful explanations with a user study on a realistic task.''

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
Cancer-compare		Cancer-compare
Cancer-normal		Cancer-normal
Cancer-over		Cancer-over
Cancer-senn		Cancer-senn
Code		Code
Datasets		Datasets
Docker-senn		Docker-senn
Figures		Figures
MNIST		MNIST
MSD-LF		MSD-LF
MSD-LF1D		MSD-LF1D
MSD-None		MSD-None
MSD-compare		MSD-compare
MSD-time		MSD-time
Med-LF		Med-LF
Med-LF1D		Med-LF1D
Med-None		Med-None
Med-compare		Med-compare
UCI-HCI		UCI-HCI
UCI-L1		UCI-L1
UCI-L2		UCI-L2
UCI-LF		UCI-LF
UCI-LF1D		UCI-LF1D
UCI-None		UCI-None
UCI-compare		UCI-compare
UCI-demo		UCI-demo
UCI-demo2		UCI-demo2
UCI-demo3		UCI-demo3
notebooks		notebooks
.gitignore		.gitignore
Demo.mov		Demo.mov
README.md		README.md
run_med.sh		run_med.sh
run_uci.sh		run_uci.sh

GDPlumb/ExpO

Folders and files

Latest commit

History

Repository files navigation

ExpO

About

Resources

Stars

Watchers

Forks

Languages