Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mse loss is way much higher than the kl loss in eq (3), how do you deal with the imbalance? #3

Open
kailigo opened this issue Oct 8, 2021 · 1 comment

Comments

@kailigo
Copy link

kailigo commented Oct 8, 2021

Hi I am trying to reimplement the code. My current implementation shows that
in eq (3), the mse loss is way much higher than the kl loss in eq (3) (with 4 magnitude difference)
And you use the same weight for the two losses, which results in nan values for other losses, as the mse loss dominant the overall learning objective.
Thanks for the answer in advance.

@lryta
Copy link
Owner

lryta commented Oct 20, 2021

The camera-ready version has changed both losses to kld losses. MSE loss is basically an implementation error that was found too late which got into the first submitted version.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants