GitHub - jayybhatt/visual-qa

We used a small part of the dataset downloading and preprocessing code released by the official body that released the VQA dataset. link: https://github.com/GT-Vision-Lab/VQA_LSTM_CNN/blob/master/data/vqa_preprocessing.py This file is used as the vqa_preprocess.py file in our codebase

Every part of the codebase was developed by the team except for the already noted preprocessing files mentioned above.

Given the current configuration, we can train the models using the following steps

For demo of our models: python demo.py

All files take command line arguments that have some default value set, the parameters can be tweaked as the user likes

Machine and software requirements

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.vscode		.vscode
data		data
models		models
.gitignore		.gitignore
README.md		README.md
Visual_Question_Answering_Demo_in_python_notebook.ipynb		Visual_Question_Answering_Demo_in_python_notebook.ipynb
demo.py		demo.py
main_baseline.py		main_baseline.py
main_enc_dec.py		main_enc_dec.py
main_one_word.py		main_one_word.py
main_spatial_attn.py		main_spatial_attn.py
metrics.py		metrics.py
model.py		model.py

Provide feedback