vectorize calls to distribution log densities #47

dustinvtran · 2016-03-12T00:37:18Z

Consider a B x d array of zs, where a row corresponds to one sample of a d-dimensional latent variable, and we have a mini-batch of size B.

Univariate Distributions
For mean-field methods, we'd like to do something like call bernoulli.logpmf(zs[:, i], p), where p is a scalar in [0,1]. This returns a B-dimensional vector,

[ log Bernoulli(zs[1, i] | p), ..., log Bernoulli(zs[B, i] | p) ]^T

For a univariate distribution, it takes a B-dimensional input and returns a B-dimensional output.

Multivariate Distributions
Consider a d-dimensional multivariate Gaussian. We call multivariate_normal.logpdf(zs.transpose(), mu, Sigma), where mu is d-dimensional, Sigma is d x d, and it returns a B-dimensional vector

[ log Normal(zs[1, :] | mu, Sigma), ..., log Normal(zs[B, :] | mu, Sigma) ]^T

For a d-dimensional distribution, it takes a B x d matrix of inputs and returns a B-dimensional output.

SciPy does this too!

from scipy import stats

#4-d vector input, univariate normal
stats.norm.logpdf([0.0, 1.0, 1.0, 2.0], loc=0, scale=1)
## array([-0.91893853, -1.41893853, -1.41893853, -2.91893853])

#4 x 2 matrix input, 2-d normal
stats.multivariate_normal.logpdf(
    np.array([[0.0, 0.0], [1.0, 1.0], [2.0, 2.0], [3.0, 3.0]]), 
    mean=np.zeros(2), cov=np.diag(np.ones(2)))
## array([ -1.83787707,  -2.83787707,  -5.83787707, -10.83787707])

Higher-dimensional arguments
We can also consider something like bernoulli.logpmf(zs[:, i], ps), where not only is zs[:, i] a M-dimensional vector but ps is also a M-dimensional vector (in [0,1]^d). I propose not doing this. This is bound to lead to bugs. Any time this comes up, I propose we do individual calls, bernoulli.logpmf(zs[1, i], ps[i]) and so on.

(I don't know a situation where this comes up enough that vectorizing this computation is crucial. If we notice this we can make the change. I don't think SciPy allows this either.)

The text was updated successfully, but these errors were encountered:

dustinvtran added Code cleanup new function labels Mar 12, 2016

dustinvtran assigned ido Mar 12, 2016

dustinvtran mentioned this issue Mar 12, 2016

match outputs of distribution methods to SciPy convention #46

Closed

dustinvtran unassigned ido Apr 4, 2016

dustinvtran mentioned this issue May 14, 2016

Feature/issue 47 vectorize densities #81

Merged

dustinvtran closed this as completed May 14, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vectorize calls to distribution log densities #47

vectorize calls to distribution log densities #47

dustinvtran commented Mar 12, 2016 •

edited

Loading

vectorize calls to distribution log densities #47

vectorize calls to distribution log densities #47

Comments

dustinvtran commented Mar 12, 2016 • edited Loading

dustinvtran commented Mar 12, 2016 •

edited

Loading