Explainability-of-Deep-Learning-Algorithms

What factors influence the predictions of Deep learning Algorithms?

Overview

The goal of an attribution method is to determine which parts of an input image a trained model uses to predict the class of the input image. If those parts of the image is perturbed then the model will fail to correctly classify the image.

In this project, I have implemented a VGG 16 model and the following attribution methods:

Saliency map [1] (hw3_attribution.py),
Integrated Gradients [2] (hw3_attribution.py),
Influence-directed Explanations [3] (hw3_infl.py)
Representer Points [4] (hw4_part3.py)

Then, I have compared these methods using visual comparisons as well as quantitative metrics like Average % Drop and Necessity Ordering [5] (hw3_evaluation.py)

Results

Saliency Map

Integrated Gradients

Influence-directed Explanations

Representer Points

References

[1] David Baehrens, Timon Schroeter, Stefan Harmeling, Motoaki Kawanabe, Katja Hansen, and Klaus- Robert Mueller. How to explain individual classification decisions, 2009.

[2] Mukund Sundararajan, Ankur Taly, and Qiqi Yan. Axiomatic attribution for deep networks. arXiv preprint arXiv:1703.01365, 2017.

[3] Klas Leino, Linyi Li, Shayak Sen, Anupam Datta, and Matt Fredrikson. Inuence-directed explanations for deep convolutional networks. arXiv preprint arXiv:1802.03788, 2018.

[4] Chih-Kuan Yeh, Joon Kim, Ian En-Hsu Yen, and Pradeep K Ravikumar. Representer point selection for explaining deep neural networks. In Advances in Neural Information Processing Systems, pages 9291–9301, 2018.

[5] Zifan Wang, PiotrPiotr Mardziel, Anupam Datta, and Matt Fredrikson. Interpreting interpretations: Organizing attribution methods by criteria, 2020.

Note: This project is part of my Homeworks. Current CMU students please refrain from going through the codes.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
class.PNG		class.PNG
fp.PNG		fp.PNG
homework4.pdf		homework4.pdf
hw3_attribution.py		hw3_attribution.py
hw3_evaluation.py		hw3_evaluation.py
hw3_infl.py		hw3_infl.py
hw3_part_one.pdf		hw3_part_one.pdf
hw3_part_two.pdf		hw3_part_two.pdf
hw3_utils.py		hw3_utils.py
hw3_vc.py		hw3_vc.py
hw4_mnist.py		hw4_mnist.py
hw4_part3.py		hw4_part3.py
hw4_utils.py		hw4_utils.py
ide.PNG		ide.PNG
ig.PNG		ig.PNG
ip.PNG		ip.PNG
sm.PNG		sm.PNG
vgg16.py		vgg16.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Explainability-of-Deep-Learning-Algorithms

Overview

Results

Saliency Map

Integrated Gradients

Influence-directed Explanations

Representer Points

References

About

Releases

Packages

Languages

manashpratim/Explainability-of-Deep-Learning-Algorithms

Folders and files

Latest commit

History

Repository files navigation

Explainability-of-Deep-Learning-Algorithms

Overview

Results

Saliency Map

Integrated Gradients

Influence-directed Explanations

Representer Points

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages