Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

C-GQA dataset: Some pairs in train_pairs.txt don't have image examples in the training set #3

Open
vkhoi opened this issue May 13, 2021 · 11 comments
Assignees

Comments

@vkhoi
Copy link

vkhoi commented May 13, 2021

Thank you for the new dataset. I have one question about it. I find 133 pairs in train_pairs.txt that don't have any image examples in the training set. Some examples of them are: "angry person", "long_sleeved shirt", "open drawer", "small duck". Can you check this out?

@ferjad
Copy link
Collaborator

ferjad commented Nov 14, 2021

@vkhoi @HeimingX
Apologies for the late update on this. We found this issue and have corrected it in an updated version of the dataset. You can download the corrected c-gqa from this link.
The dataset stats and the new numbers for cge can be found in this arxiv preprint.
All the paper conclusions and ranking wrt to baseline methods still hold true. We have thoroughly checked for possible issues in this updated version but please let us know if you find any other problems.

@ToneLi
Copy link

ToneLi commented Dec 2, 2021

are you sure this link can be opened? https://s3.mlcloud.uni-tuebingen.de/czsl/cgqa-updated.zip

@mancinimassimiliano
Copy link
Collaborator

Hello @ToneLi! Our servers are currently under maintenance (see #12). I contacted the admins and they mention they should be back online on Monday. I will let you know as soon as it is back online.

@mancinimassimiliano
Copy link
Collaborator

Update @ToneLi : the server is up again, so the dataset can be downloaded. Let us know if you encounter any issues.

@IshanManchanda
Copy link

Hi @mancinimassimiliano, I think the server is currently unreachable as I am unable to download the dataset. Can you confirm?

@mancinimassimiliano
Copy link
Collaborator

Hello @IshanManchanda! Yes, I confirm this. There was maintenance scheduled for today. I will check with the admins in case the problem is not fixed by tomorrow.

@IshanManchanda
Copy link

Hi @mancinimassimiliano, the file still seems to be down. Can you please check?

@mancinimassimiliano
Copy link
Collaborator

@IshanManchanda the dataset should be back online again (sorry, it took more time than expected).

@yzou2
Copy link

yzou2 commented Sep 19, 2022

Thanks for providing the great dataset. What's the license of the c-gqa dataset?

@mancinimassimiliano
Copy link
Collaborator

Hello @yzou2! For the dataset license, @ferjad can provide a more precise answer.

@ferjad
Copy link
Collaborator

ferjad commented Sep 30, 2022

@yzou2 we follow the same license as the original GQA dataset https://creativecommons.org/licenses/by/4.0/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants