Skip to content

nnaik39/commvqa

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 

Repository files navigation

CommVQA: Situating Visual Question Answering in Communicative Contexts

This is the official Github repository for our paper CommVQA: Situating Visual Question Answering in Communicative Contexts (Arxiv, 2024). We provide the code and data necessary to replicate our results. If you experience any issues, please email nanditan (at) cs.stanford.edu.

Downloading the CommVQA Dataset

CommVQA, the VQA dataset introduced in our paper, consists of images, descriptions, contexts, questions, and a set of answers.

For details on downloading CommVQA, navigate to CommVQA_dataset/.

Reproducing Section 4: Model Experiments

To reproduce the model experiments within our paper, please navigate to models/ for more details.

Citation

If you find this repo or the paper useful in your research, please feel free to cite our paper:

@unpublished{naik2024commvqa,
	author = {Naik, Nandita Shankar and Potts, Christopher and Kreiss, Elisa},
	note = {arXiv:2402.15002},
	title = {{CommVQA}: Situating Visual Question Answering in Communicative Contexts},
	url = {https://arxiv.org/abs/2402.15002},
	year = {2024}}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published