Skip to content
code for Stacked attention networks for image question answering
Branch: master
Clone or download
Latest commit 14a8841 Jan 7, 2017
Type Name Latest commit message Commit time
Failed to load latest commit information.
.gitignore minor Apr 10, 2016


Source code for Stacked attention networks for image question answering.

Joint collaboration between CMU and MSR.


The code is in python and uses Theano package.

  • Python 2.7
  • Theano
  • Numpy
  • h5py


Download the data from here and extract them at data_vqa folder.

cd src; python

to start training.


If you use this code as part of your research, please cite our paper

''Stacked Attention Netowrks for Image Question Answering'', Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng and Alex Smola. To appear in CVPR 2016.

author    = {Zichao Yang and
Xiaodong He and
Jianfeng Gao and
Li Deng and
Alexander J. Smola},
title     = {Stacked Attention Networks for Image Question Answering},
journal   = {CoRR},
volume    = {abs/1511.02274},
year      = {2015},
url       = {},
You can’t perform that action at this time.