Data leak #24

kimihailv · 2022-12-02T09:52:50Z

Hello! According to XTD-10 repo, the test set contains 800 images from MSCOCO train set. During training you also use MSCOCO train set – it seems you have data leak. Or may be I don't understand something.

FreddeFrallan · 2022-12-02T12:43:14Z

Hey,

Now that you mention it, it looks like XTD includes train images in their translated captions. Which, in my humble opinion, is a rather weird decision... At least when there's still data from val+test that they have not used... ?
So yes, there seems to be data leakage in our evaluation.

We're currently working on creating a better evaluation system at CLIP_BENCHMARK, and we are working towards creating some multilingual evaluations.

The evaluations at this repo should be updated when such evaluations are available.

guillemram97 · 2023-02-11T14:11:59Z

How did you evaluate Table 1 in the original paper ('Cross-lingual and Multilingual CLIP')? The space of retrievable images were the 1k images from XTD-10 dataset? Because there's null interesection between the images of that dataset and the MSCOCO 2014 test set.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data leak #24

Data leak #24

kimihailv commented Dec 2, 2022

FreddeFrallan commented Dec 2, 2022 •

edited

guillemram97 commented Feb 11, 2023

Data leak #24

Data leak #24

Comments

kimihailv commented Dec 2, 2022

FreddeFrallan commented Dec 2, 2022 • edited

guillemram97 commented Feb 11, 2023

FreddeFrallan commented Dec 2, 2022 •

edited