Resampler for weighed samples #3352

BenZickel · 2024-04-05T15:46:28Z

Background

This pull request introduces a resampler for weighed samples that creates equally weighed samples from the distribution specified by the generator of the weighed samples (see item 3 here).

Implementation

Based on the Metropolis-Hastings algorithm.
The guide is used as a proposal distribution which is independent of the current sample.
Sample are generated and returned via the WeighedPredictiveResults class.

Notes

The implementation guarantees that:
- Samples are independent.
- Samples will not be repeated (up to the probability that the random number generators generate identical samples) unlike sampling with replacement from a fixed set of weighed samples.
In case the guide perfectly tracks the model this sampler will do nothing as the Metropolis-Hastings algorithm will have an acceptance probability of one for new samples.
This resampler can be an efficient and fast way to correct "relatively small" errors in the guide that would otherwise take many optimization iterations to converge or not converge at all (this is demonstrated in the tests of this module).
Calculating weighed samples quantiles by using resampling will usually have less variance then direct calculation using weighed quantiles (see notes in the documentation).

…ctive.MHResampler) that converts weighed samples into equally weighed samples.

…esampler.

…amples resampler.

…ation.

fritzo · 2024-04-08T22:11:46Z

Hi @BenZickel, thanks for your patience. I'm not sure I understand how MHResampler is intended to be used. Do you intend this new resampler to be used as a component in a larger training algorithm, such that the whole algorithm will be akin to Reweighted Wake Sleep? Or would the resampler be mainly used for prediction? Gosh in either case it might help to have example usage in the docstring of MHResampler.

Also, do you understand the relationship betwen your MHResampler and the importance_resample utility we discussed in your PR from a couple weeks ago?

@fritzo: ...it might be nice to add some sort of utility importance_resample: WeightedSamples -> Samples to convert from the weighted representation back to an unweighted representation...
@BenZickel: I agree that we need to have some way to convert weighed samples to unweighed samples, but I believe this should be added in another pull request as there are several considerations related to multi-dimensional event samples and interpolation methods...

My thinking then was that it would be nice to bridge the two worlds: weighted versus uniform samples. I figured a simple way to convert weighted -> unweighted samples would be to add a method WeightedPredictiveResult.resample() that just called _systematic_resample under the hood, and converted types. I'm not sure this .resample() method has anything to do with your current PR other than the word 'resample' 😄

BenZickel · 2024-04-09T10:46:15Z

Thx @fritzo for the review! The intended use of MHResampler is mainly for prediction and I've added an example that reflects that (the example is basically copied from the combined test for MHResampler and WeighedPredictive). Although MHResampler is mainly used for prediction, it actually creates posterior predictive samples that are independent of the guide, and therefore could produce accurate results with fewer SVI iterations and reduced overall running time (this is actually tested in this test configuration where SVI iteration count is reduced from 5000 to 1000).

Regarding your second point, MHResampler is a way to do importance_resample, but not as a method .resample of WeightedPredictiveResult. The reason is that resampling from a fixed set of samples (by a method .resample for example) creates correlation and high variance of computed quantities, whereas MHResampler continuously creates new samples and selects which current samples will be replaced by the new samples. Due to this MHResampler needs access to the callable that creates new WeightedPredictiveResult when called (usually an instance of WeighedPredictive but other callables will work as well).

Lastly, you mentioned _systematic_resample which does resampling from a fixed set of samples (which is not what we want as explained in the previous paragraph). Resampling from a fixed set of samples is a necessity in Sequential Monte Carlo methods as the samples are time (sequence) dependent and therefore new samples for the current time cannot be generated without starting from time zero.

fritzo

Thanks for adding the example! I have just a couple more clarifying questions, whose answers I think could make MHResampler easier for users to understand.

pyro/infer/predictive.py

Ben Zickel added 5 commits April 5, 2024 00:51

Added Metropolis-Hastings algorithm based resampler (pyro.infer.predi…

0963976

…ctive.MHResampler) that converts weighed samples into equally weighed samples.

Add tests for the pyro.infer.predictive.MHResampler weighed samples r…

09e6aab

…esampler.

Add documentation for the pyro.infer.predictive.MHResampler weighed s…

eae323b

…amples resampler.

Merge branch 'dev' into add_mh_resampler

6c28fbf

Add notes on pyro.infer.predictive.MHSampler behavior to the document…

9b83ace

…ation.

fritzo added awaiting review enhancement labels Apr 8, 2024

Add example to docstring of pyro.infer.predictive.MHResampler.

98d6692

fritzo requested changes Apr 12, 2024

View reviewed changes

pyro/infer/predictive.py Outdated Show resolved Hide resolved

pyro/infer/predictive.py Outdated Show resolved Hide resolved

pyro/infer/predictive.py Show resolved Hide resolved

pyro/infer/predictive.py Show resolved Hide resolved

Elaborated and fixed documentation of pyro.infer.predictive.MHResampler.

e52b512

BenZickel requested a review from fritzo April 12, 2024 20:52

fritzo approved these changes Apr 16, 2024

View reviewed changes

fritzo merged commit 91bc2b3 into pyro-ppl:dev Apr 16, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resampler for weighed samples #3352

Resampler for weighed samples #3352

BenZickel commented Apr 5, 2024 •

edited

Loading

fritzo commented Apr 8, 2024 •

edited

Loading

BenZickel commented Apr 9, 2024

fritzo left a comment

Resampler for weighed samples #3352

Resampler for weighed samples #3352

Conversation

BenZickel commented Apr 5, 2024 • edited Loading

Background

Implementation

Notes

fritzo commented Apr 8, 2024 • edited Loading

BenZickel commented Apr 9, 2024

fritzo left a comment

Choose a reason for hiding this comment

BenZickel commented Apr 5, 2024 •

edited

Loading

fritzo commented Apr 8, 2024 •

edited

Loading