My TensorFlow implementation of Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering - Vahid Kazemi and Ali Elqursh.
I wanted to try implementing a multimodal model in TF and this seemed like a good candidate. Lots of code needs to be added though