RFC: how to handle implicit quantities associated with coordinates #94

tpapp · 2022-09-05T12:19:23Z

Motivation

Suppose that for a set of parameters $x$, the equation $F(x, y) = 0$ defines $y(x)$ implicitly. Eg $x$ could be parameters to a problem that we approximate numerically, and $y$ the parameters of an approximation we obtain numerically (rootfinding etc). Given data $d$, the likelihood is defined as $\ell(d \mid x, y)$.

Theoretically, one could of course solve for the $y$ that belongs to each $x$. But this may be expensive and brittle, and if

$$ x_2 = x_1 + \Delta $$

then

$$ \hat{y}_2 = y_2 + \frac{\partial y}{\partial x} \Delta $$

would be a good initial guess for $y_2 = y(x_2)$.

Ideally, "users" like Turing.jl and DynamicHMC.jl should be able to ignore the details of these things and just carry on doing HMC/NUTS/etc with minimal changes.

Proposal: allow coordinates to be opaque

I propose an addition to the API composed of 3 functions, with the fallbacks

lift(ℓ, x::AbstractVector) = x
unlift(ℓ, x::AbstractVector) = x
translate(ℓ, x::AbstractVector, Δ::AbstractVector) = x .+ Δ

Specifically,

"users" would call lift when generating random points for starting MCs, and in similar situations. Otherwise they would use translate,
similarly, unlift would be called when coordinates are needed (eg turn statistics),
leapfrog and RWMH steps would use translate.
otherwise the result of lift and the x arguments of logdensity, logdensity_and_gradient, translate, unlift are allowed to be opaque objects, not an ::AbstractVector of real numbers. Nevertheless, logdensity_and_gradient should provide a valid gradient of x -> logdensity(ℓ, lift(ℓ, x)), but how that is done is up to the implementation of ℓ.

Bikeshedding names is appreciated 😉, also alternative API suggestions.

How this meshes with AD

This is a bit tricky and I don't yet have a good API in mind. Related work is in

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: how to handle implicit quantities associated with coordinates #94

RFC: how to handle implicit quantities associated with coordinates #94

tpapp commented Sep 5, 2022

RFC: how to handle implicit quantities associated with coordinates #94

RFC: how to handle implicit quantities associated with coordinates #94

Comments

tpapp commented Sep 5, 2022

Motivation

Proposal: allow coordinates to be opaque

How this meshes with AD