Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multiple-Choice on test-standard.
Utility functions for neural network implementations in Torch
The VQA dataset browser back-end code, using nginx, Django, an PostgreSQL (running in Docker containers).
The second version of the interface for Abstract Scenes research project.