PyTorch Implementation of Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models [ICDM 2023]

We present Learning to Explain (LTX), a model-agnostic framework designed for providing post-hoc explanations for vision models. The LTX framework introduces an ``explainer'' model that generates explanation maps, highlighting the crucial regions that justify the predictions made by the model being explained. To train the explainer, we employ a two-stage process consisting of initial pretraining followed by per-instance finetuning. During both stages of training, we utilize a unique configuration where we compare the explained model's prediction for a masked input with its original prediction for the unmasked input. This approach enables the use of a novel counterfactual objective, which aims to anticipate the model's output using masked versions of the input image. Importantly, the LTX framework is not restricted to a specific model architecture and can provide explanations for both Transformer-based and convolutional models. Through our evaluations, we demonstrate that LTX significantly outperforms the current state-of-the-art in explainability across various metrics.

Reproducing results on ViT - Perturbations Metrics

Loading Checkpoints:

Download checkpoints.zip from https://drive.google.com/file/d/1syOvmnXFgMsIgu-10LNhm0pHDs2oo1gm/
unzip classifier.zip -d ./checkpoints/ (after unzipping, the checkpoints should be in the corresponding folders based on the backbone's type (vit_base))

These checkpoints are essential for reproducing the results. All explanation metrics can be calculated using the mask files created during the LTX procedure.

Evaluations

LTX

CUDA_VISIBLE_DEVICES=0 PYTHONPATH=./:$PYTHONPATH nohup python main/seg_classification/run_seg_cls_opt.py --RUN-BASE-MODEL False --explainer-model-name vit_base_224 --explainee-model-name vit_base_224 --train-model-by-target-gt-class True

pLTX

CUDA_VISIBLE_DEVICES=0 PYTHONPATH=./:$PYTHONPATH nohup python main/seg_classification/run_seg_cls_opt.py --RUN-BASE-MODEL True --explainer-model-name vit_base_224 --explainee-model-name vit_base_224 --train-model-by-target-gt-class True

Pretraining Phase - pLTX model

CUDA_VISIBLE_DEVICES=0 PYTHONPATH=./:$PYTHONPATH nohup python main/seg_classification/run_seg_cls.py --enable-checkpointing True --explainer-model-name vit_base_224 --explainee-model-name vit_base_224 --mask-loss-mul 50 --train-model-by-target-gt-class True --n-epochs 30 --train-n-label-sample 1

Reproducing results on ViT-Base & ViT-Small - Segmentation Results

Download the segmentation datasets:

Download imagenet_dataset Link to download dataset
Download the COCO_Val2017 Link to download dataset
Download Pascal_val_2012 Link to download dataset
Move all datasets to ./data/

pLTX

CUDA_VISIBLE_DEVICES=0 PYTHONPATH=./:$PYTHONPATH nohup python main/segmentation_eval/seg_stage_a.py --explainer-model-name vit_base_224 --explainee-model-name vit_base_224 --dataset-type imagenet

LTX

CUDA_VISIBLE_DEVICES=0 PYTHONPATH=./:$PYTHONPATH nohup python main/segmentation_eval/seg_stage_b.py --explainer-model-name vit_base_224 --explainee-model-name vit_base_224 --dataset-type imagenet

** The dataset can be chosen by the parameter of --dataset-type from imagenet, coco, voc

Citation

If you make use of our work, please cite our paper:

@inproceedings{barkan2023learning,
  title={Learning to explain: A model-agnostic framework for explaining black box models},
  author={Barkan, Oren and Asher, Yuval and Eshel, Amit and Elisha, Yehonatan and Koenigstein, Noam},
  booktitle={2023 IEEE International Conference on Data Mining (ICDM)},
  pages={944--949},
  year={2023},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
cnn_baselines		cnn_baselines
config		config
evaluation		evaluation
feature_extractor		feature_extractor
gt_data_imagenet		gt_data_imagenet
images		images
main		main
models		models
utils		utils
vit_loader		vit_loader
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch Implementation of Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models [ICDM 2023]

Reproducing results on ViT - Perturbations Metrics

Loading Checkpoints:

Evaluations

LTX

pLTX

Pretraining Phase - pLTX model

Reproducing results on ViT-Base & ViT-Small - Segmentation Results

Download the segmentation datasets:

pLTX

LTX

Citation

About

Releases

Packages

Languages

LTX-Code/LTX

Folders and files

Latest commit

History

Repository files navigation

PyTorch Implementation of Learning to Explain: A Model-Agnostic Framework for Explaining Black Box Models [ICDM 2023]

Reproducing results on ViT - Perturbations Metrics

Loading Checkpoints:

Evaluations

LTX

pLTX

Pretraining Phase - pLTX model

Reproducing results on ViT-Base & ViT-Small - Segmentation Results

Download the segmentation datasets:

pLTX

LTX

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages