Introducing pyro.infer.predictive.WeighedPredictive which reports weights along with predicted samples #3345

BenZickel · 2024-03-22T14:42:01Z

The Problem

When sampling from the posterior predictive distribution we are often using a guide as an approximation for the posterior. As mentioned in #3340 it is often desirable to correct for the non-uniform per sample gap between the model log-probability and the guide log-probability. This gap is essentially the weight that should be assigned to each sample.

The current implementation of pyro.infer.predictive.Predictive does not support calculation of these weights.

The Proposed Solution

Add pyro.infer.predictive.WeighedPredictive which supports calculation of per sample weights.

The implementation relies on three objects:

Model which samples from priors and observations (same as in instantiation of pyro.infer.predictive.Predictive).
Guide which approximates the posterior given observations (same as in instantiation of pyro.infer.predictive.Predictive).
Model with observations constrained to be the actual observations. This model was used in creating the guide and is provided to instances of pyro.infer.predictive.WeighedPredictive when called as the keyword argument model_guide (as in the model that was used when creating the guide).

The model_guide is what enables calculation of the weights. If not provided we use the model provided at instantiation of pyro.infer.predictive.WeighedPredictive as the model_guide (in this case the model provided at instantiation is usually already with observations constrained to be the actual observations).

Design Considerations

Maintain backwards compatibility of pyro.infer.predictive.Predictive.
Reuse as much as possible from pyro.infer.predictive.Predictive when implementing pyro.infer.predictive.WeighedPredictive.

… and trace.

…ighedPredictive.

fritzo · 2024-03-25T16:31:19Z

Hi @BenZickel I think it's a great idea to add a Predictive interface that can incorporate both a set of samples and some sort importance weights. And I appreciate your efforts to share interface and code with Predictive (which reduces our maintenance efforts!). Here are some general questions & comments before I do a thorough code review:

I think it would also be good to add more mathematical details in the docstring of this class, to make it clear exactly what it's doing.
Do I understand correctly that WeightedPredictive samples from the model p by drawing proposals from the guide q, then weighting each sample z by p(z)/q(z), returning a weighted set of samples?
If so, it might be nice to add some sort of utility importance_resample: WeightedSamples -> Samples to convert from the weighted representation back to an unweighted representation (as done in SMCFilter).
How does your WeightedPredictive relate to pyro.infer.Importance? Could they share any machinery? Should we link their docstrings?
How does your WeightedPredictive relate to pyro.infer.ReweightedWakeSleep? IIUC, WeightedPredictive + importance_resample is like ReweightedWakeSleep, but where the former optimizes ELBO(guide,model) and the latter directly optimizes the model posterior density p(z|x)?

cc @martinjankowiak who may have a better understanding of the relationships between these inference algorithms.

….infer.Importance.

…tive.

BenZickel · 2024-03-25T22:14:11Z

Thank you for your comments @fritzo. See below my feedback:

I've added more mathematical details to the docstring.
Yes, WeighedPredictive returns the exact same samples returned by Predictive, namely p(x|z)q(z), but accompanies each sample with its weight p(Xobs,z)/q(z) where p(Xobs,z)=p(Xobs|z)p(z).
I agree that we need to have some way to convert weighed samples to unweighed samples, but I believe this should be added in another pull request as there are several considerations related to multi-dimensional event samples and interpolation methods (SMCFilter does not do interpolation). For now, we can calculate quantiles of weighed samples using weighed_quantile introduced in Add function for calculating quantiles of weighed samples. #3340.
I've created as much shared machinery between WeighedPredictive and pyro.infer.Importance as I could in 026cae2. They do indeed share concepts and code.
As I see it pyro.infer.ReweightedWakeSleep is a strategy to create a guide from your model and observations. As the created guide is usually not perfectly proportional to the model you would want to use WeighedPredictive in order to obtain the true distribution quantiles.

fritzo

The code looks great. Thanks for adding tests!

fritzo · 2024-03-26T11:14:29Z

pyro/infer/predictive.py

@@ -31,53 +34,58 @@ def _guess_max_plate_nesting(model, args, kwargs):
    return max_plate_nesting


+class _predictiveResults(NamedTuple):


Thanks for the cleanup! This will also help us with #2550

Ben Zickel added 7 commits March 21, 2024 21:43

Make pyro.infer.predictive._predictive always return both the samples…

2c00882

… and trace.

Added pyro.infer.predictive.WeighedPredictive and some of its tests.

c6f9532

Make model_guide in call to WeighedPredictive optional.

89955e0

Add test for WeighedPredictive with plate and event shape.

133bd74

Linting and formatting updates associated with the introduction of We…

a989e87

…ighedPredictive.

Fix naming from probability to log-probability.

7426641

Fix backwards compatbility.

a4c4d85

fritzo added the enhancement label Mar 25, 2024

Ben Zickel added 2 commits March 25, 2024 20:49

Create shared machinery between pyro.infer.WeighedPredictive and pyro…

026cae2

….infer.Importance.

Elaborate methematical details of pyro.infer.predictive.WeighedPredic…

260e528

…tive.

fritzo added the awaiting review label Mar 25, 2024

Update and fix docs.

5030d41

fritzo approved these changes Mar 26, 2024

View reviewed changes

fritzo merged commit 0dc635f into pyro-ppl:dev Mar 26, 2024
9 checks passed

BenZickel mentioned this pull request Apr 5, 2024

Resampler for weighed samples #3352

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducing pyro.infer.predictive.WeighedPredictive which reports weights along with predicted samples #3345

Introducing pyro.infer.predictive.WeighedPredictive which reports weights along with predicted samples #3345

BenZickel commented Mar 22, 2024

fritzo commented Mar 25, 2024 •

edited

Loading

BenZickel commented Mar 25, 2024

fritzo left a comment

fritzo Mar 26, 2024

		@@ -31,53 +34,58 @@ def _guess_max_plate_nesting(model, args, kwargs):
		return max_plate_nesting


		class _predictiveResults(NamedTuple):

Introducing pyro.infer.predictive.WeighedPredictive which reports weights along with predicted samples #3345

Introducing pyro.infer.predictive.WeighedPredictive which reports weights along with predicted samples #3345

Conversation

BenZickel commented Mar 22, 2024

The Problem

The Proposed Solution

Design Considerations

fritzo commented Mar 25, 2024 • edited Loading

BenZickel commented Mar 25, 2024

fritzo left a comment

Choose a reason for hiding this comment

fritzo Mar 26, 2024

Choose a reason for hiding this comment

fritzo commented Mar 25, 2024 •

edited

Loading