Backpropagation through the Void: Optimizing control variates for black-box gradient estimation

by Will Grathwohl, Dami Choi, Yuhuai Wu, Geoffrey Roeder, David Duvenaud

We introduce a general framework for learning low-variance, unbiased gradient estimators for black-box functions of random variables, based on gradients of a learned function. These estimators can be jointly trained with model parameters or policies, and are applicable in both discrete and continuous settings. We give unbiased, adaptive analogs of state-of-the-art reinforcement learning methods such as advantage actor-critic. We also demonstrate this framework for training discrete latent-variable models.

Code for VAE Experiments lives here. The Discrete RL experiments can be found at: https://github.com/wgrathwohl/BackpropThroughTheVoidRL.

A simplified, pure-python implementation is in /relax-autograd/relax.py

If you have any questions about the code or paper please contact Will Grathwohl (wgrathwohl@cs.toronto.edu). The code is in "research-state" at the moment and I will be updating it periodically. If you have questions feel free to email me and I will do my best to respond. -Will

Name		Name	Last commit message	Last commit date
Latest commit History 363 Commits
paper		paper
rebar_baseline		rebar_baseline
relax-autograd		relax-autograd
.gitignore		.gitignore
10k_mnist_vae_grad_samples.pkl		10k_mnist_vae_grad_samples.pkl
README.md		README.md
binary_vae_multilayer_per_layer.py		binary_vae_multilayer_per_layer.py
datasets.py		datasets.py
display_grads.py		display_grads.py
mnist_vae.py		mnist_vae.py
pytorch_test.py		pytorch_test.py
pytorch_toy.py		pytorch_toy.py
rebar_tf.py		rebar_tf.py
rebar_toy.py		rebar_toy.py
toy.py		toy.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

paper

paper

rebar_baseline

rebar_baseline

relax-autograd

relax-autograd

.gitignore

.gitignore

10k_mnist_vae_grad_samples.pkl

10k_mnist_vae_grad_samples.pkl

README.md

README.md

binary_vae_multilayer_per_layer.py

binary_vae_multilayer_per_layer.py

datasets.py

datasets.py

display_grads.py

display_grads.py

mnist_vae.py

mnist_vae.py

pytorch_test.py

pytorch_test.py

pytorch_toy.py

pytorch_toy.py

rebar_tf.py

rebar_tf.py

rebar_toy.py

rebar_toy.py

toy.py

toy.py

Repository files navigation

Backpropagation through the Void: Optimizing control variates for black-box gradient estimation

About

Releases

Packages

Contributors 6

Languages

duvenaud/relax

Folders and files

Latest commit

History

Repository files navigation

Backpropagation through the Void: Optimizing control variates for black-box gradient estimation

About

Resources

Stars

Watchers

Forks

Languages