about evaluation #38

SUDA-HLT-ywfang · 2021-10-20T09:37:16Z

Hi,
How do you get "26.0" FID on mscoco using DM-GAN? Because the official result reported in https://github.com/MinfengZhu/DM-GAN is 26.55.
I ran DM-GAN myself and managed to get a similar result(26.54), instead of "26.0".

Sleepychord · 2021-10-20T09:57:36Z

We actually follow the evaluation of DALL-E. Since the 30,000 captions are sampled at random, I think the difference is normal. It is possible that DM-GAN performs better on our sampled sub-datasets. Maybe I should change the performance to the official number, thank you.

SUDA-HLT-ywfang · 2021-10-20T10:10:59Z

Thank you for your quick reply.
Could you share more details about the sample procedure please? for example, mscoco val has about 5 captions for each image, so you sample 30000 from (5 * 40504) captions?

Sleepychord · 2021-10-21T13:05:19Z

@FrankCast1e Hi, after comparing details with previous works, we find our sampling is slightly different from the previous works, but should be equal in the view of evaluation:
we remove the duplicated captions from coco, but sampled from the merged set of train and validation(~120,000). Since CogView is never trained on COCO and the two sets are split at random, the expectation should be the same.

SUDA-HLT-ywfang · 2021-10-21T16:34:04Z

So the sampling process you describe should be as follows:
a. mix all the captions from mscoco train and validation set
b. remove duplicated ones
c. sample 30000 captions
Am I correct?
In this case, there may be multiple captions in the sampling that belong to the same image.
Am I correct?
For FID-(1,2,4,8), one set is blurred images that model generate(30000), what is the other set? (Train + val) or val?

Sleepychord · 2021-10-21T17:27:12Z

@FrankCast1e

yes.
No. Apparently, there will not be duplicated ones after removal.
I think you misunderstand the process of evaluation. the blurred images are blurred original images, and the other set are generated images. Their captions are the same.

SUDA-HLT-ywfang · 2021-10-22T02:43:47Z

sorry, I'm confused.
2. For example, there are two images, A and B. A has 3 captions(no.1, no.2, no.3). B has 3 captions(no.4, no.5, no.6). Sample two captions from 6(3+3) captions. The sampled set may be (no.1, no.2), which belong to the same image A.
3. Following dalle, I think generated images should be applied with a gaussian filter too?

Sleepychord · 2021-10-22T06:25:40Z

@FrankCast1e Hi,
2. as discussed above, we removed the duplicated images, which means there is only no.1 & no.4.
3. Yes, the generated samples are blurred too, sorry to forget to mention it in the last reply.

SUDA-HLT-ywfang · 2021-10-25T02:55:11Z

Hi, thanks a lot.
But I still can't reproduce the dm-gan results reported in your paper. I don't know what I get wrong. Could you please share your dm-gan test code and sampled data?

Sleepychord · 2021-10-25T05:38:05Z

@FrankCast1e , you can email me according to the address in the paper

SUDA-HLT-ywfang · 2021-10-25T08:36:19Z

ok. An email has been sent.

SUDA-HLT-ywfang · 2021-10-28T02:18:12Z

Hi, sorry to bother you again. Have you received my mail? Looking forward to hearing from you.

SUDA-HLT-ywfang closed this as completed Oct 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about evaluation #38

about evaluation #38

SUDA-HLT-ywfang commented Oct 20, 2021

Sleepychord commented Oct 20, 2021

SUDA-HLT-ywfang commented Oct 20, 2021 •

edited

Sleepychord commented Oct 21, 2021

SUDA-HLT-ywfang commented Oct 21, 2021 •

edited

Sleepychord commented Oct 21, 2021

SUDA-HLT-ywfang commented Oct 22, 2021

Sleepychord commented Oct 22, 2021

SUDA-HLT-ywfang commented Oct 25, 2021 •

edited

Sleepychord commented Oct 25, 2021

SUDA-HLT-ywfang commented Oct 25, 2021

SUDA-HLT-ywfang commented Oct 28, 2021

about evaluation #38

about evaluation #38

Comments

SUDA-HLT-ywfang commented Oct 20, 2021

Sleepychord commented Oct 20, 2021

SUDA-HLT-ywfang commented Oct 20, 2021 • edited

Sleepychord commented Oct 21, 2021

SUDA-HLT-ywfang commented Oct 21, 2021 • edited

Sleepychord commented Oct 21, 2021

SUDA-HLT-ywfang commented Oct 22, 2021

Sleepychord commented Oct 22, 2021

SUDA-HLT-ywfang commented Oct 25, 2021 • edited

Sleepychord commented Oct 25, 2021

SUDA-HLT-ywfang commented Oct 25, 2021

SUDA-HLT-ywfang commented Oct 28, 2021

SUDA-HLT-ywfang commented Oct 20, 2021 •

edited

SUDA-HLT-ywfang commented Oct 21, 2021 •

edited

SUDA-HLT-ywfang commented Oct 25, 2021 •

edited