New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Doubt regarding the Hamiltonian calculations for RHVAE model. #10
Comments
Hi @shikhar2333 , Thank you for opening this issue. Actually, the Hamiltonian you are referring to is the one used to sample from the target distribution given by the inverse of the metric volume element post training and so it is used in the I hope this helps. |
Thanks for your response. |
Oops, sorry for that, this one should work https://arxiv.org/abs/2010.11518 |
How do we ensure that \rho gets sampled from N(0, G(z))?
In the code I saw this particular initialization for rho. Is this done because L is the cholesky decomposition for the matrix G? |
We use the Cholesky decomposition of the metric G = LL^T allowing to sample from multivariate Gaussian distributions. Note that G(z) is a symmetric definite positive matrix |
So are M and G equivalent, since M is also defined as LL^T. |
Is E[P(x|z)] (where z~q(z|x) and E denotes the expectation value) approximated by MSE/BCE loss in the implementation? |
Yes exactly. The MSE (resp. BCE loss) actually represents the opposite log-likelihood of a constant variance Gaussian distribution (resp. Bernoulli distribution) that are usually chosen for p(x|z) depending on the modeling of your data. |
Thanks for the confirmation. |
You're welcome :). Not really, you can down-sample a bit your input data to deal with memory issues for instance but, in theory, the VAE can handle any kind of data regardless of the dimension. If you are referring to the paper, we only down-sampled the data by a factor 2 in each dimension when dealing with 3D MRIs. |
What's the intuition for updating the metric at the end of each epoch? |
The updated metric is needed during validation. |
In the paper, Hamiltonian is defined as follows:
But in the code, I see extra terms like addition of a joint probability term and a G inverse multiplied in the term for kinetic energy.
Are these 2 equations equivalent?
The text was updated successfully, but these errors were encountered: