Skip to content

This repo focuses on improving current Visual Question Answering approaches by taking a reversed approach: increase the robustness of current models by providing a wider variety of data of training.

andreiluca96/reversed-vqa

Repository files navigation

reversed-vqa

Overview

In this repository we're planning to enhance the current Visual Question Answering (VQA) approaches by improving the robustness by enriching the VQA datasets variety.

Many articles of VQA emphasize the fact the the roubstness of VQA models is very weak due to the fact that small changes in the image/question repharsing might result in unpredictable outputs.

Here we'll try to explore the current VQA datasets and enhance them with more data that can be used for training.

Approaches

  • reversed VQA: target question generation based on image and answers

  • question rephrase through AutoEncoders, Clustering and Generation models

  • unsupervised question generation by feeding images and targeting questions

Further development:

  • using the reversed VQA model and classical VQA models we can train Generative Adversarial Networks architecture

Related articles

About

This repo focuses on improving current Visual Question Answering approaches by taking a reversed approach: increase the robustness of current models by providing a wider variety of data of training.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published