Reccurent Attention Model

Reccurent Attention Model with Chainer based on the following paper
arXiv:1406.6247: Recurrent Models of Visual Attention [Volodymyr Mnih+ 2014]

Features

RAM model difinition file (Chainer)
script for training the model on MNIST
script to run the model on MNIST

not yet implemented

hyper-params to get the best accuracy in the paper
multi-scale glimpse
models to solve "Translated MNIST" task

Examples

Training the model without LSTM takes a day with CPU (reaches 96% accuracy)

Training the model with LSTM takes ??? with CPU
(still searching for the hyper-parameters to get the best accuracy in the paper...)

Dependencies

Python(2 or 3), Chainer, scikit-learn, PIL, tqdm

Usage

➜ python train.py

If you use a GPU, add the option "-g deviceID".
When you use LSTM units in core RNN layer, add the option "--lstm".
(better performance but a little time consuming with LSTMs)

➜ python train.py -g 0 --lstm

After training, you can get predictions by the trained model.

➜ python predict.py -m ram_wolstm.chainermodel

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
figure		figure
.gitignore		.gitignore
README.md		README.md
crop.py		crop.py
predict.py		predict.py
ram_lstm.py		ram_lstm.py
ram_wolstm.py		ram_wolstm.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reccurent Attention Model

Features

not yet implemented

Examples

Dependencies

Usage

About

Releases

Packages

Languages

RaffEdwardBAH/ram

Folders and files

Latest commit

History

Repository files navigation

Reccurent Attention Model

Features

not yet implemented

Examples

Dependencies

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages