You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi I am trying to reimplement the code. My current implementation shows that
in eq (3), the mse loss is way much higher than the kl loss in eq (3) (with 4 magnitude difference)
And you use the same weight for the two losses, which results in nan values for other losses, as the mse loss dominant the overall learning objective.
Thanks for the answer in advance.
The text was updated successfully, but these errors were encountered:
The camera-ready version has changed both losses to kld losses. MSE loss is basically an implementation error that was found too late which got into the first submitted version.
Hi I am trying to reimplement the code. My current implementation shows that
in eq (3), the mse loss is way much higher than the kl loss in eq (3) (with 4 magnitude difference)
And you use the same weight for the two losses, which results in nan values for other losses, as the mse loss dominant the overall learning objective.
Thanks for the answer in advance.
The text was updated successfully, but these errors were encountered: