Visual Question Answering datasets are available in multimodal. Annotations data are automatically downloaded and processed when the class is instanciated. Note that the pre-processing can take several minutes.
.. autoclass:: multimodal.datasets.VQA :inherited-members:
.. autoclass:: multimodal.datasets.VQA2 :inherited-members:
.. autoclass:: multimodal.datasets.VQACP :inherited-members:
.. autoclass:: multimodal.datasets.VQACP2 :inherited-members:
https://cs.stanford.edu/people/jcjohns/clevr/
.. autoclass:: multimodal.datasets.CLEVR :inherited-members: