Add Stable distribution with numerically integrated log-probability calculation (StableWithLogProb). #3369

BenZickel · 2024-05-20T20:36:14Z

This fixes #3280 by adding pyro.distributions.StableWithLogProb which is based on pyro.distributions.Stable with an additional log_prob method (I opted for not modifying the pyro.distributions.Stable distribution at this stage).

Code is based on combining #3280 (comment) by @mawright with the existing Stable distribution Pyro code base, with the following modifications:

Make the calculation stable at and near an alpha value of one, and values at and near zero.
Eliminate dependency on the torchquad package.
Cache integration points (not sure if torchquad does this but overall speed is 25% faster than the reference implementation based on torchquad).

Per iteration duration is about 5 times slower than with reparameterization but overall convergence is much faster, and includes cases which do not converge with reparameterization (like skew beta estimation).

The log-probability calculation is based on integration over a uniformly distributed random variable $u$ such that $P(x) = \int du P(x|u) P(u)$. The integral can be converted to a reparameterization where we first sample $u$ with probability density $P(u)$ or $g(u)$ when approximating the posterior distribution by a guide, and secondly sampling or observing $x$ with the distribution $P(x|u)$. Initial tests indicate this reparameterization works but is still slower than estimating the log-probability by integration.

A usage example with real life data has been added to the last section of the Stable distribution tutorial.

…ear zero.

…ar one.

…tion of the Stable distribution.

…n order to improve convergence.

…-probability.

…the first time in order to avoid requiring scipy when building the docs.

fritzo

It's great to see this implemented!

I don't trust my review of the math, but I would trust some sort of density test. One option is to use goftests.density_goodness_of_fit, something like:

@pytest.mark.parametrize(...)
def test_density(stability, skew, loc, scale):
    d = StableWithLogProb(stability, skew, loc, scale)
    samples = d.sample(1000)
    probs = d.log_prob(samples).exp()
    gof = goftests.density_goodness_of_fit(samples, probs)
    assert gof > 1e-2

Another option is to check against a reference implementation, say something in scipy. WDYT?

(btw thanks for your patience!)

fritzo · 2024-05-26T17:58:31Z

pyro/distributions/stable_log_prob.py

+        beta = self.skew.double()
+        value = value.double()
+
+        return _stable_log_prob(alpha, beta, value, self.coords) - self.scale.log()


I think we'll want to convert the result of _stable_log_prob() back to value.dtype, right? Something like:

logp = _stable_log_prob(alpha, beta, value, self.coords) return logp.to(dtype=value.dtype) - self.scale.log()

… tests.

review-notebook-app · 2024-05-27T22:24:48Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

fritzo

Looks great!

I think it would actually be cleaner to implement Stable.log_prob() rather than a separate StableWithLogProb class (but thank you for drafting a non-invasive solution!). Do you see any blockers to simply merging Stable <-> StableWithLogProb in this PR? I think the only change will be the need to update your tutorial's summary:

 ## Summary
-- [Stable.log_prob()](http://docs.pyro.ai/en/stable/distributions.html#stable) is undefined.
+- [Stable.log_prob()](http://docs.pyro.ai/en/stable/distributions.html#stable) is very expensive.

and simply omit the reparam stuff from your new section. The single Stable solution is nice in that users will at least be able to use default SVI and HMC, and the older StableReparam machinery can become an approximate cost-saving tool.

EDIT I guess we'd need to revise some pytest.raises checks in the tests, which might be easiest by adding an internal distribution pyro.distributions.testing.fakes.StableWithoutLogProb.

BTW I am a big fan of the Levy Stable distribution, and am delighted to see Pyro improving its support for heavy-tailed inference.

fritzo

Happy to merge as-is, but I think this .log_prob() deserves to be in the main Stable distribution. LMK if you want me to merge now, or if you want to combine Stable <-> StableWithLogProbs.

BenZickel · 2024-05-28T05:51:46Z

Thx @fritzo for the review!

I also think heavy-tailed inference is much needed and I really appreciate all the work done on this so far.

It might be better to combine StableWithLogProb and Stable but I'd do it in a separate pull request (if at all). The advantage of keeping them separate is that users will be made explicitly aware of both the high cost of the log-probability calculation and the possibility of reducing that cost at the expense of accuracy by reparameterization. If we do combine the two we also need to figure out if the behavior of MinimalReparam needs to be modified when handling the Stable distribution.

BenZickel · 2024-05-28T06:02:15Z

One more option that comes to mind is to keep both Stable and StableWithLogProb and add the .log_prob method to Stable. This way a user can enforce no reparameterization by using StableWithLogProb instead of Stable.

Ben Zickel added 12 commits May 19, 2024 12:34

Added Stable distribution with unsafe log-probability calculation.

e94495e

Make Stable distribution log-probability calculation safe at values n…

665a092

…ear zero.

Make Stable distribution log-probability calculation safe at alpha ne…

695ee8c

…ar one.

Make Stable log-probability method part of an independent class.

6d50cca

Added dynamic near zero value tolerance to the log-probability estima…

7661a6e

…tion of the Stable distribution.

Reduce Stable log-probability calculation value near zero tolerance i…

2e2b036

…n order to improve convergence.

Cap range of Stable log-probability.

cd6dde3

Clamp log in order to make gradient continuous.

49657f7

Code cleanup.

147a772

Don't reparametrize pyro.distributions.StableWithLogProb.

d493f8c

Add tests for Stable distribution with method for calculating the log…

037f094

…-probability.

Linting and formatting.

1f0a696

fritzo added enhancement awaiting review labels May 20, 2024

Ben Zickel added 8 commits May 21, 2024 10:04

Moved definition of StableWithLogProb into pyro.distributions.stable.

2d2b702

Avoid importing scipy until StableWithLogProb.log_prob is called for …

daf04a0

…the first time in order to avoid requiring scipy when building the docs.

Don't allow reparameterization of StableWithLogProb.

110ea37

Linting and formatting.

2864c33

Add iterations in order to assure convergence in parameter fit tests.

a19fbee

Comment out test.

a79eb7b

Increase test error limit.

3602f00

Added StableWithLogProb docs.

1ee4391

fritzo reviewed May 26, 2024

View reviewed changes

Ben Zickel added 4 commits May 27, 2024 21:21

Cap near zero tolerance by inverse probability density.

ff8cd1f

Make log_prob return data type same as that of the input value.

77d9c9f

Added Stable distirbution log-probability calculation goodness of fit…

9e8044c

… tests.

Added explanation of StableWithLogProb usage and results.

06b4bec

fritzo reviewed May 27, 2024

View reviewed changes

fritzo approved these changes May 28, 2024

View reviewed changes

fritzo merged commit 0678b35 into pyro-ppl:dev May 28, 2024
9 checks passed

BenZickel mentioned this pull request May 29, 2024

Add log_prob method to Stable (same one that already exists in StableWithLogProb) #3370

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Stable distribution with numerically integrated log-probability calculation (StableWithLogProb). #3369

Add Stable distribution with numerically integrated log-probability calculation (StableWithLogProb). #3369

BenZickel commented May 20, 2024 •

edited

fritzo left a comment

fritzo May 26, 2024

review-notebook-app bot commented May 27, 2024

fritzo left a comment •

edited

fritzo left a comment

BenZickel commented May 28, 2024

BenZickel commented May 28, 2024

Add Stable distribution with numerically integrated log-probability calculation (StableWithLogProb). #3369

Add Stable distribution with numerically integrated log-probability calculation (StableWithLogProb). #3369

Conversation

BenZickel commented May 20, 2024 • edited

fritzo left a comment

Choose a reason for hiding this comment

fritzo May 26, 2024

Choose a reason for hiding this comment

review-notebook-app bot commented May 27, 2024

fritzo left a comment • edited

Choose a reason for hiding this comment

fritzo left a comment

Choose a reason for hiding this comment

BenZickel commented May 28, 2024

BenZickel commented May 28, 2024

BenZickel commented May 20, 2024 •

edited

fritzo left a comment •

edited