Question about data overlap #16

lichengshen · 2024-08-26T19:03:18Z

Hello, thanks for the great work.

I noticed that the RefCOCO val/test sets use images from the COCO training set. When doing joint training, I think this could cause a data leak, that the testing images and masks for RefCOCO are seen when training on COCO-Panoptic. Is this true, or have you handled this somewhere?

Z-MU-Z · 2024-09-02T07:34:43Z

Hello, I noticed that LISA mentioned in the paper 'we exclude the COCO samples whose images are present in the refCOCO(+/g) validation sets during training.' However, it seems that I didn't find this implemented in the code. Did you notice anything regarding this?"

zamling · 2024-09-02T13:31:07Z

Hi, all

I did not notice this problem. We built our dataset based on LISA and UNINEXT. RefCOCO/+/g and COCO train2017 are built from LISA, so I do not know whether such data leak has been dealed well. So can you provide me some codes in LISA or UNINEXT about this processing? So that I can check wheter I follow it correctly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about data overlap #16

Question about data overlap #16

lichengshen commented Aug 26, 2024

Z-MU-Z commented Sep 2, 2024

zamling commented Sep 2, 2024 •

edited

Loading

Question about data overlap #16

Question about data overlap #16

Comments

lichengshen commented Aug 26, 2024

Z-MU-Z commented Sep 2, 2024

zamling commented Sep 2, 2024 • edited Loading

zamling commented Sep 2, 2024 •

edited

Loading