Feature/nitpicks #35

martijnvanbeers · 2019-03-07T16:04:53Z

Some bits I changed while trying to figure things out/implement a pytorch version. Mostly consist of a flexible plot_images function that lets you display arbitrary amounts of images.

* use logging.getLogger to get a logger to write to instead of calling logging directly. Also set the matplotlib logger to only print ERROR level messages, so we don't get extra output when plotting. * create a generic function to plot (an) image(s) using a specified amount of columns. This allows the user to easily plot more samples

Passing in the class instead of a string makes construct_vae generic enough that it doesn't need changes when you want to play with different implementations.

the kl_divergence parameter in the GaussianVAE was never used, and it seems diagonal_gaussian_kl should be private to GaussianInferenceNetwork, so remove it.

philschulz

This a nice clean-up of the notebook. Thanks for doing this. I only have one remark on the changes that I left inline. Sorry taking so long to get back to you.

philschulz · 2019-03-25T00:27:06Z

code/vae_notebook.ipynb

@@ -525,11 +549,8 @@
    "\n",
    "    def __init__(self,\n",
    "                 generator: Generator,\n",
-    "                 inference_net: GaussianInferenceNetwork,\n",
-    "                 kl_divergence: Callable) -> None:\n",


Why are you removing the KL divergence. It is needed to properly construct the ELBO.

I have thought about this a lot before and one alternative is to rewrite the ELBO as E[p(x,z)] + H(q(z)). Then we could provide the entropy as a method of the inference net. However, it's out of whack with the slides in that case. That's why for the time being I would just pass in the kl_divergence as an argument.

I removed it because it wasn't actually being used (as far as I can see). The GaussianInferenceNetwork calls diagonal_gaussian_kl() directly. I've been implementing inference networks with different distributions with Wilker, which require different ways to calculate the KL divergence, it seemed to me that the KL divergence was conceptually linked more to the inference network than to the VAE as a whole, which is why I opted for removing the parameter rather than making the GaussianInferenceNetwork not hardcode the function.

martijnvanbeers added 5 commits February 28, 2019 12:41

Fix typo

f89d992

Make formulas look a bit nicer

4c5a58b

Pass in classes, not strings for encoder/decoder

a588ab9

Passing in the class instead of a string makes construct_vae generic enough that it doesn't need changes when you want to play with different implementations.

Remove unused kl_divergence parameter

adf6780

the kl_divergence parameter in the GaussianVAE was never used, and it seems diagonal_gaussian_kl should be private to GaussianInferenceNetwork, so remove it.

wilkeraziz requested a review from philschulz March 11, 2019 09:06

philschulz requested changes Mar 25, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/nitpicks #35

Feature/nitpicks #35

martijnvanbeers commented Mar 7, 2019

philschulz left a comment

philschulz Mar 25, 2019

martijnvanbeers Mar 25, 2019

Feature/nitpicks #35

Are you sure you want to change the base?

Feature/nitpicks #35

Conversation

martijnvanbeers commented Mar 7, 2019

philschulz left a comment

Choose a reason for hiding this comment

philschulz Mar 25, 2019

Choose a reason for hiding this comment

martijnvanbeers Mar 25, 2019

Choose a reason for hiding this comment