Capsule Network

Readable implementation of a Capsule Network as described in "Dynamic Routing Between Capsules" [Hinton et. al.]

In this notebook, I'll be building a simple Capsule Network that aims to classify MNIST images. This is an implementation in PyTorch and this notebook assumes that you are already familiar with convolutional and fully-connected layers.

What are Capsules?

Capsules are a small group of neurons that have a few key traits:

Each neuron in a capsule represents various properties of a particular image part; properties like a parts color, width, etc.
Every capsule outputs a vector, which has some magnitude and orientation.
Capsules have a hierarchy between child and parent capsules and use dynamic routing to find the strongest connections between the output of one capsule and the inputs of the next layer of capsules.

You can read more about all of these traits in my blog post about capsules and dynamic routing.

Representing Relationships Between Parts

All of these traits allow capsules to communicate with each other and determine how data moves through them. Using dynamic communication, during the training process, a capsule network learns the spatial relationships between visual parts and their wholes (ex. between eyes, a nose, and a mouth on a face). When compared to a vanilla CNN, this knowledge about spatial relationships makes it easier for a capsule network to identify an object no matter what orientation it is in. These networks are also, generally, better able to identify multiple, overlapping objects, and to learn from smaller sets of training data!

Model Architecture

The Capsule Network that I'll define is made of two main parts:

A convolutional encoder
A fully-connected, linear decoder

The above image was taken from the original Capsule Network paper (Hinton et. al.). The notebook follows the architecture described in that paper and tries to replicate some of the experiments, such as feature visualization, that the authors pursued.

Running Code Locally

If you're interested in running this code on your own computer, there are thorough instructions on setting up anaconda, and downloading PyTorch and the necessary libraries in the readme of Udacity's deep learning repo. After downloading the necessary libraries, you can proceed with cloning and running this code, as usual.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
.gitignore		.gitignore
Capsule_Network.ipynb		Capsule_Network.ipynb
LICENSE		LICENSE
README.md		README.md
helpers.py		helpers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

.gitignore

.gitignore

Capsule_Network.ipynb

Capsule_Network.ipynb

LICENSE

LICENSE

README.md

README.md

helpers.py

helpers.py

Repository files navigation

Capsule Network

What are Capsules?

Representing Relationships Between Parts

Model Architecture

Running Code Locally

About

Releases

Packages

Languages

License

Taaccoo-beta/capsule_net_pytorch

Folders and files

Latest commit

History

Repository files navigation

Capsule Network

What are Capsules?

Representing Relationships Between Parts

Model Architecture

Running Code Locally

About

Resources

License

Stars

Watchers

Forks

Languages