NamedTuple-valued random variables #289

Red-Portal · 2023-09-30T20:15:34Z

Red-Portal
Sep 30, 2023
Collaborator

Hi, I think it might be interesting to consider whether we ever want to support NamedTuple-valued random variables.
Currently, most of the inference algorithms expect vector-valued random variables.
This is probably fine for MCMC algorithms running on the CPU.
But for more advanced inference algorithms that require some introspection into the model and GPUs,
I think this interface is becoming a liability.

Variational Inference

For example, to support subsampling in variational inference, we need to deal with models that have local variables. In this case, we need to operate on a subset of random variables at each iteration. That is, for a model like this

$$\begin{aligned} x &\sim p(x) \\\ y_i &\sim p(y_i | x) \end{aligned}$$

we wish to subsample over $i = 1, \ldots, n$. Then, for each batch $j$, we need to select the axis such that $\left[x,\, y_j\right]$. This is not very nice to do if we flatten everything. For instance, we receive a sample $\theta$ one way to do this would be to unflatten and obtain the structured representation $\left[\, x, \, y_1, \, \ldots, \, y_n \right]$, select the variables $\left[\, x, \, y_i \right]$, and then flatten back.

If the random variables $\theta$ were already structured not flattened, we can entirely skip the redundant flattening-unflattening process.

Subsampling MCMC

The story is similar for implementing subsampling-based MCMC algorithms.

RJMCMC

This also concerns certain MCMC algorithms that need to inspect the structure of the parameters, such as RJMCMC. For example, if we know the name of the model indicator random variables, the algorithm can simply deduce the model order from the length of these variables and do business as usual. However, if the random variables are vector-valued, we need additional machinery to do this. There is also the problem of having flatten-unflatten parameters of varying dimensionality.

GPU Support

There is also a concern for supporting GPUs. As we all know, Bayesian models often involve lots of scalar-valued hyperparameters. Since evaluating the likelihood involves flattening-unflattening, we have to face lots of scalar-indexing, which don't play nice with Julia's GPU ecosystem. With NamedTuple-valued RVs, however, we can probably mix the types of the different random variables, and leave some things on the CPU when appropriate.

What Needs to Change?

LogDensityProblems: Probably doesn't need to change except for with_gradient and with_hessian. We could also make LogDensityProblems.dimension to return a NamedTuple of dimensions.
Bijectors: New interfaces for Stacked and TransformedDistribution will have to be added.
DynamicPPL: Probably an additional specialization for LogDensityProblems?
StatsBase: Maybe rand and logpdf?

I would love to hear yous thoughts on this!

torfjelde · 2023-10-04T14:00:50Z

torfjelde
Oct 4, 2023
Maintainer

So we already have NamedBijector which does indeed support NamedTuple and similars as inputs; is this what you mean? Or are you talking about more generally, something like JuliaStats/Distributions.jl#1762 ?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NamedTuple-valued random variables #289

{{title}}

Replies: 1 comment

{{title}}

Select a reply

NamedTuple-valued random variables #289

Red-Portal Sep 30, 2023 Collaborator

Variational Inference

Subsampling MCMC

RJMCMC

GPU Support

What Needs to Change?

Replies: 1 comment

torfjelde Oct 4, 2023 Maintainer

Red-Portal
Sep 30, 2023
Collaborator

torfjelde
Oct 4, 2023
Maintainer