Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FID increases for larger training datasets #55

Closed
slala2121 opened this issue Mar 12, 2021 · 9 comments
Closed

FID increases for larger training datasets #55

slala2121 opened this issue Mar 12, 2021 · 9 comments

Comments

@slala2121
Copy link

I trained a generator using 3.8K images on 256x256 following the settings described and trained another generator using 750 images with the same settings but with regularization increased from 1 to 10 as in the paper.

While the training FID-5K score is lower when training on 3.8K vs 750 (both training FID curves are stable), I find that the the FID score computed using the generator trained on 3.8K gives much higher FID scores compared to that attained under 750.

I am using the same code to compute the FID scores so it is unclear to me why this is the case. Have you encountered this issue?

@zsyzzsoft
Copy link
Collaborator

"While the training FID-5K score is lower when training on 3.8K vs 750" What does this sentence mean? Could you give more details?

@slala2121
Copy link
Author

slala2121 commented Mar 13, 2021 via email

@zsyzzsoft
Copy link
Collaborator

zsyzzsoft commented Mar 13, 2021

This is probably caused by the inconsistency between training and testing. The FID-50k score of a model should be smaller than FID-5k of the same model. Please check whether your data is properly processed during testing.

@slala2121
Copy link
Author

slala2121 commented Mar 13, 2021 via email

@slala2121
Copy link
Author

slala2121 commented Mar 13, 2021 via email

@zsyzzsoft
Copy link
Collaborator

zsyzzsoft commented Mar 13, 2021

Most likely due to image preprocessing.

@zsyzzsoft
Copy link
Collaborator

Another guess - maybe you computed the FID statistics of real images with 750 images, but when you evaluate the model trained with 3.8k images, it still uses the cached statictics with 750 images. Maybe you can clear the cached files or rename the dataset, and then re-evaluate the model trained with 3.8k images.

@slala2121
Copy link
Author

slala2121 commented Mar 14, 2021 via email

@slala2121
Copy link
Author

I made some fixes so now the results match. Thanks for your thoughts!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants