Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about data overlap #16

Open
lichengshen opened this issue Aug 26, 2024 · 2 comments
Open

Question about data overlap #16

lichengshen opened this issue Aug 26, 2024 · 2 comments

Comments

@lichengshen
Copy link

Hello, thanks for the great work.

I noticed that the RefCOCO val/test sets use images from the COCO training set. When doing joint training, I think this could cause a data leak, that the testing images and masks for RefCOCO are seen when training on COCO-Panoptic. Is this true, or have you handled this somewhere?

@Z-MU-Z
Copy link

Z-MU-Z commented Sep 2, 2024

Hello, I noticed that LISA mentioned in the paper 'we exclude the COCO samples whose images are present in the refCOCO(+/g) validation sets during training.' However, it seems that I didn't find this implemented in the code. Did you notice anything regarding this?"

@zamling
Copy link
Owner

zamling commented Sep 2, 2024

Hi, all

I did not notice this problem. We built our dataset based on LISA and UNINEXT. RefCOCO/+/g and COCO train2017 are built from LISA, so I do not know whether such data leak has been dealed well. So can you provide me some codes in LISA or UNINEXT about this processing? So that I can check wheter I follow it correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants