You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I run the baseline model with the default setting, but it seems not converged (cifar10), loss is huge, around 6600000. Is this normal, or just mine this? If it is normal, why is this? Based on my research, traditional langevin dynamics can easily converge with restricted number of steps, such as 50. I'm quite curious why the author set the number of this sampling process to 1000, with a relatively small learning rate.
Thanks for your excellent work, the anneal model converges well, by the way, i just try to figure out why this method works. Can you please tell us?
The text was updated successfully, but these errors were encountered:
I run the baseline model with the default setting, but it seems not converged (cifar10), loss is huge, around 6600000. Is this normal, or just mine this? If it is normal, why is this? Based on my research, traditional langevin dynamics can easily converge with restricted number of steps, such as 50. I'm quite curious why the author set the number of this sampling process to 1000, with a relatively small learning rate.
Thanks for your excellent work, the anneal model converges well, by the way, i just try to figure out why this method works. Can you please tell us?
The text was updated successfully, but these errors were encountered: