Visual-Question-Answering

This is RNN+CNN Visual Question Answering Model. It uses VGG16 for image feature extraction. VQA Dataset is used for training the model.

Keras version 2.0+
Tensorflow 1.2+
Spacy version 2.0+
- To upgrade & install Glove Vectors
  - python -m spacy download en_vectors_web_lg
OpenCV

Download my pretrained model from here

For running pretrained model in Google Colab Click Here

For training the model run:

$ python train.py

Currently in intitial stages. You have to rename the image with the question you want to ask. For running:

$ set FLASK_APP=hello_app.py
$ flask run

https://github.com/VT-vision-lab/VQA_LSTM_CNN

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
DATA		DATA
VQA		VQA
static		static
LICENSE		LICENSE
README.md		README.md
Untitled.png		Untitled.png
VQA_Appplication.ipynb		VQA_Appplication.ipynb
VQA_Appplication_v1.py		VQA_Appplication_v1.py
VQA_Appplication_v2.py		VQA_Appplication_v2.py
VQA_Appplication_v3.py		VQA_Appplication_v3.py
hello_app.py		hello_app.py
hello_app_v2.py		hello_app_v2.py
hello_app_v3.py		hello_app_v3.py
train.py		train.py

Provide feedback