Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reproducing FID #15

Closed
Godkimchiy opened this issue Sep 30, 2022 · 4 comments
Closed

Reproducing FID #15

Godkimchiy opened this issue Sep 30, 2022 · 4 comments

Comments

@Godkimchiy
Copy link

Hi authors,

Thanks for sharing this great work:)
I cannot reproduce 6.11(t=.9) FID on FFHQ in Table 2.
I used pretrained VQGAN and trained Absorbing diffusion sampler for 2M steps
under the same conditons, but the FID was only 8.10.

Is there sth important I need to check?

@peterhessey
Copy link
Collaborator

Hi, we're glad you're managing to successfully use the code!

How many samples are you using to evaluate the FID (--n_samples flag?) I believe we used 50k for those experiments so if you're not using that many that will definitely give an improved FID

@Godkimchiy
Copy link
Author

Thank you for reply!
I generated 50,000 samples following README.
Although I still didn't get the reported value, I found that using the trained model
for shorter steps than 2M shows better FID.

and I have one more question.
Is the pretrained VQGAN model you provided trained the same as the one provided by Taming Transformers?

@peterhessey
Copy link
Collaborator

peterhessey commented Oct 10, 2022

Interesting 🤔 What value are you getting for the <2M steps training? We'll take a look into our training logs to get the exact details of the training run used for our 6.11 value.

Is the pretrained VQGAN model you provided trained the same as the one provided by Taming Transformers?

Nope! We trained our own VQGANs. The architecture is almost identical to Taming Transformers' but adapted to allow us to use DiffAug and control other aspects of the model :)

@Godkimchiy
Copy link
Author

The only difference is that training was conducted at NVIDIA GeForce RTX 3090, and
other than that, the same commands in README.md were used.

Thanks a lot :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants