Round operation for discrete models #2

SANCHES-Pedro · 2021-05-06T10:18:10Z

Hello,

Firstly, congratulations on the amazing work. The ICLR award was well deserved!

I don't want to be pedantic but I realized that the get_score_fn for discrete models doesn't have a torch.round() operation even though the t at training time is an int. Therefore, the sampling is being done with slightly different values than the training (e.g. 500.1 instead of 500). I'm not sure if this really affects performance, it's just an observation.

I would add labels = torch.round(labels) after line 155 of the models/utils.py file.

Many thanks,
Pedro

The text was updated successfully, but these errors were encountered:

yang-song · 2021-05-09T06:48:09Z

Thanks for the comment! You are right it is better to add an additional torch.round operation for discrete models, though I don't think results would change much. The current code has an additional benefit: you can use the continuous SDE framework even for models pre-trained with discrete losses (such as DDPM and NCSN models provided by previous work), which allows you to compute log-likelihoods, for example.

SANCHES-Pedro · 2021-05-11T09:24:29Z

I see, I hadn't understood the motivation of not adding that. Thanks for clarifying!

SANCHES-Pedro closed this as completed May 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Round operation for discrete models #2

Round operation for discrete models #2

SANCHES-Pedro commented May 6, 2021

yang-song commented May 9, 2021

SANCHES-Pedro commented May 11, 2021

Round operation for discrete models #2

Round operation for discrete models #2

Comments

SANCHES-Pedro commented May 6, 2021

yang-song commented May 9, 2021

SANCHES-Pedro commented May 11, 2021