GitHub - soumyadip1995/TCAV: ⚙📲Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV)

Testing with Concept Activation Vectors (TCAV)

Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV)

Been Kim, Martin Wattenberg, Justin Gilmer, Carrie Cai, James Wexler, Fernanda Viegas, Rory Sayres

Paper link: https://arxiv.org/abs/1711.11279

What is TCAV?

Testing with Concept Activation Vectors (TCAV) is a new interpretability method to understand what signals your neural networks models uses for prediction.

Read my Full Blog Post here.

See the Full Jupyter NoteBook here

Credit goes to this academic Team here and If you wish to run this, clone this repo

What's special about TCAV compared to other methods?

Typical interpretability methods show importance weights in each input feature (e.g, pixel). TCAV instead shows importance of high level concepts (e.g., color, gender, race) for a prediction class -which is how humans communicate!

Typical interpretability methods require you to have one particular image that you are interested in understanding. TCAV gives an explanation that is generally true for a class of interest, beyond one image (global explanation). The key idea is to view the internal state of a neural net as an aid. We will be using CAVs as part of a technique, Testing with CAVs (TCAV), are used to quantify the degree to which a user-defined concept is important to a classification result--for example, how sensitive a prediction of "zebra" is to the presence of stripes.

For example, for a given class, we can show how much race or gender was important for classifications in a pretrained model . Even though neither race nor gender labels were part of the training input!

Why use high level concepts instead of input features?

Humans think and communicate using concepts, and not using numbers (e.g., weights to each feature). When there are lots of numbers to combine and reason about (many features), it becomes harder and harder for humans to make sense of the information they are accounting for. TCAV instead delivers explanations in the way humans communicate to each other.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Notebook		Notebook
tests		tests
LICENSE		LICENSE
README.md		README.md
TCAVs_Testing_with_Concept_Activation_Vectors(_Google_I_0_2019).ipynb		TCAVs_Testing_with_Concept_Activation_Vectors(_Google_I_0_2019).ipynb
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Notebook

Notebook

tests

tests

LICENSE

LICENSE

README.md

README.md

TCAVs_Testing_with_Concept_Activation_Vectors(_Google_I_0_2019).ipynb

TCAVs_Testing_with_Concept_Activation_Vectors(_Google_I_0_2019).ipynb

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

Testing with Concept Activation Vectors (TCAV)

Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV)

What is TCAV?

Read my Full Blog Post here.

See the Full Jupyter NoteBook here

Credit goes to this academic Team here and If you wish to run this, clone this repo

What's special about TCAV compared to other methods?

Why use high level concepts instead of input features?

About

Releases

Packages

Languages

License

soumyadip1995/TCAV

Folders and files

Latest commit

History

Repository files navigation

Testing with Concept Activation Vectors (TCAV)

Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV)

What is TCAV?

Read my Full Blog Post here.

See the Full Jupyter NoteBook here

Credit goes to this academic Team here and If you wish to run this, clone this repo

What's special about TCAV compared to other methods?

Why use high level concepts instead of input features?

About

Topics

Resources

License

Stars

Watchers

Forks

Languages