Using Fast Weights to Attend to the Recent Past

This repo is a TensorFlow implementation of

Using Fast Weights to Attend to the Recent Past
Jimmy Ba, Geoffrey Hinton, Volodymyr Mnih, Joel Z. Leibo, Catalin Ionescu
NIPS 2016, https://arxiv.org/abs/1610.06258

Specifically, we follow the experiments in Sec 4.1 Associative retrieval and try to reproduce the results in Table 1 and Figure 2. The fast weights model can achieve 100% accuracy (0% error rate) on R=50 setting in ~30K iterations.

Running result as follows:

Fast Weights(with layernorm):

Fast Weights(without layernorm):

LSTM:

Both trained on GTX 980 Ti, with TensorFlow 0.11rc1.

Setting on R=50, using ADAM optimizer with default parameters.

Train the fast weights model

python FW_train.py

Evaluate the fast weights model

python FW_eval.py

Run the LSTM baseline model in similar ways.

Author

Fan Wu (jxwufan@gmail.com)

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
chart		chart
data		data
log		log
save		save
.gitignore		.gitignore
FW_eval.py		FW_eval.py
FW_model.py		FW_model.py
FW_train.py		FW_train.py
FastWeightsRNN.py		FastWeightsRNN.py
LICENSE		LICENSE
LSTM_eval.py		LSTM_eval.py
LSTM_model.py		LSTM_model.py
LSTM_train.py		LSTM_train.py
README.md		README.md
configuration.py		configuration.py
utils.py		utils.py

License

dburner/AssociativeRetrieval

Folders and files

Latest commit

History

Repository files navigation

Using Fast Weights to Attend to the Recent Past

Train the fast weights model

Evaluate the fast weights model

Author

About

Resources

License

Stars

Watchers

Forks

Languages