OpenVQA is a general platform for visual question ansering (VQA) research, with implementing state-of-the-art approaches (e.g., BUTD, MFH, BAN and MCAN) on different benchmark datasets like VQA-v2, GQA and CLEVR. Supports for more methods and datasets will be updated continuously.
This project is released under the Apache 2.0 license.
Supported methods and benchmark datasets are shown in the below table. Results and models are available in MODEL_ZOO.md.
VQA-v2 | GQA | CLEVR | |
---|---|---|---|
BUTD | ✓ | ||
MFB | ✓ | ||
MFH | ✓ | ||
BAN | ✓ | ||
MCAN | ✓ |
Please refer to INSTALL.md for installation and dataset preparation.
Please see GETTING_STARTED.md for the basic usage of OpenVQA.
We appreciate all contributions to improve OpenVQA. Please refer to CONTRIBUTING.md for the guideline.
This repo is currently maintained by Zhou Yu (@yuzcccc) and Yuhao Cui (@cuiyuhao1996).