Input Switched Affine Networks: An RNN Architecture Designed for Interpretability (ICML 2017)

There exist many problem domains where the interpretability of neural network models is essential for deployment. Here we introduce a recurrent architecture composed of input-switched affine transformations – in other words an RNN without any explicit nonlinearities, but with inputdependent recurrent weights. This simple form allows the RNN to be analyzed via straightforward linear methods: we can exactly characterize the linear contribution of each input to the model predictions; we can use a change-of-basis to disentangle input, output, and computational hidden unit subspaces; we can fully reverse-engineer the architecture’s solution to a simple task. Despite this ease of interpretation, the input switched affine network achieves reasonable performance on a text modeling tasks, and allows greater computational efficiency than networks with standard nonlinearities. --Abstract

Parenthesis Task

The implementation was trained on the Parenthesis task. Here is the result:

Loss as a function of the number of steps

Resources

The paper is available at http://proceedings.mlr.press/v70/foerster17a/foerster17a.pdf

Contributing

Thanks to Justin Gilmer, one of the authors of the paper for providing some source code under the Apache license.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
.gitignore		.gitignore
README.md		README.md
isan_cell.py		isan_cell.py
task.py		task.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

.gitignore

.gitignore

README.md

README.md

isan_cell.py

isan_cell.py

task.py

task.py

Repository files navigation

Input Switched Affine Networks: An RNN Architecture Designed for Interpretability (ICML 2017)

Parenthesis Task

Resources

Contributing

About

Releases

Packages

Languages

afcarl/tensorflow-isan-rnn

Folders and files

Latest commit

History

Repository files navigation

Input Switched Affine Networks: An RNN Architecture Designed for Interpretability (ICML 2017)

Parenthesis Task

Resources

Contributing

About

Resources

Stars

Watchers

Forks

Languages