Dataset information #4

jaeseokbyun · 2023-03-15T08:57:54Z

Hello, first of all, thank you for sharing great work!

I try to use your work, but I have some uncertainties in dataset.
In your Dataset.md, you point out that 4M dataset is cleaned followed by BLIP.

Does it mean that your 4M dataset is filtered and synthetically generated as BLIP did? (
Moreover, In Table 2 and 6, it seems that PTP-BLIP scores are different.
What is the difference between these two scores?

Thank you

FingerRec · 2023-03-22T06:28:39Z

Hi jaeseokbyun:

Thanks for you attention in our work and carefully check!

Yes, the dataset corpus is from OSCAR and BLIP.
I update the dataset.md, reference for this file for more details.

Thanks for point out the problem in Table6, previous we only use coco image for one times during pertaining.
Later we follow OSCAR, ViLT and use each coco image for five times (each image 5 captions) during pre-training which outperform former significantly. We have alignment this table in camera ready version.

Thanks again.

jaeseokbyun · 2023-04-24T09:57:27Z

Thanks for sharing the code and pre-training corpus!
In DATASET.md, there are two corpuses

If I want to reproduce your result in the paper,
could I just use the 4M dataset corpus (not 2M dataset) without changing any other corresponding codes or dataset?
(Except path for dataset)

Thanks,

FingerRec closed this as completed Mar 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset information #4

Dataset information #4

jaeseokbyun commented Mar 15, 2023

FingerRec commented Mar 22, 2023

jaeseokbyun commented Apr 24, 2023

Dataset information #4

Dataset information #4

Comments

jaeseokbyun commented Mar 15, 2023

FingerRec commented Mar 22, 2023

jaeseokbyun commented Apr 24, 2023