-
Notifications
You must be signed in to change notification settings - Fork 256
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Surprising results with no convergence #25
Comments
Thanks @dataislife - mind emailing me a description of the model and sentence embedding data you are using? This is an interesting application and probably easiest over email :) |
Hi, did you ever resolve this? I am facing a similar issue and cannot get the it to converge. |
Thanks @garenkwan - can you email me a description of the problem + data? I'm at j@jaan.io. (This is not designed for training on sentence embeddings, but I can try to help!) |
Thanks for the comment @altosaar . I realized that my inputs and sentence embeddings use both positive and negative values in the input, however, I overlooked that fact that you have designed this for working on images which have inputs that are always positive and between 0 and 1. To avoid the loss from blowing up, I rescaled the input values to be between 0 and 1 and it worked great! |
Hi there,
I am training this VAE on top of an input space of size 2048 (Sentence embedding space), I am trying to tweak the parameters in order to reconstruct the input space correctly but I cannot make it converge.
With config as follows:
It have the following results:
I thought about reducing the learning rate, changing the size of the latent space (I don't know whether increasing it or decreasing it is better in that case where input_dim = 2048), changing batch size, but nothing seems to be conclusive.
Also, why is the log-likelihood getting such high values? (p(x) should be in [0,1] ...)
Any idea on this matter? :)
Thanks for the great work.
The text was updated successfully, but these errors were encountered: