About the consistency loss #2

yakexee · 2021-08-18T10:33:06Z

Hi, thanks for sharing your code. For the function F.kl_div(), the first parameter is input and the second is target. I am confused why the target is not p_mixture on L401-L403?
Thanks.

wildphoton · 2021-08-23T05:45:49Z

Hi @yakexee, I think F.kl_div() defines inputs and targets in the way how NLL loss is used. Since NLL(input, target) = cross_entropy(target, input). I believe F.kl_div(input, target) = KL(target || input). You can check this with an easy example by comparing the results with a KL div function implemented by yourself. Since we are computing KL(p_aug || p_mixture), we called F.kl_div(p_mixture, p_aug). FYI, this loss is adapted from the Augmix's implementation. I hope it helps

yakexee · 2021-08-24T02:17:52Z

Thanks for the kind reply. That helps!

yakexee closed this as completed Aug 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the consistency loss #2

About the consistency loss #2

yakexee commented Aug 18, 2021 •

edited

Loading

wildphoton commented Aug 23, 2021 •

edited

Loading

yakexee commented Aug 24, 2021

About the consistency loss #2

About the consistency loss #2

Comments

yakexee commented Aug 18, 2021 • edited Loading

wildphoton commented Aug 23, 2021 • edited Loading

yakexee commented Aug 24, 2021

yakexee commented Aug 18, 2021 •

edited

Loading

wildphoton commented Aug 23, 2021 •

edited

Loading