PBA [Waeber2013]

This is an implementation of the probabilistic bisection algorithm (PBA) described by Rolf Waeber in his 2013 PhD thesis. PBA is a 1D stochastic root-finding algorithm. This means that it is meant to find the point where a noisy function (i.e. a function that may return different values each time its evaluated at the same point) crosses the x-axis. More precisely, given g(x) = f(x) + ε(x), the goal is to find x such that E[g(x)] = 0, where f(x) is the function we are interested in, ε is a normally distributed noise function with median 0, g(x) is the only way we can observe f(x), and E[g(x)] is the expected value of g(x).

This algorithm works by repeatedly evaluating the noisy function at a single x until the probability that we know the true sign of f(x) exceeds a specified threshold. This information is used to build a Bayesian posterior distribution describing the location of the root. The next point to evaluate is then chosen from this distribution. There are two features that make this algorithm unique:

It provides a true confidence interval in addition to the root itself.
It places very little restraints on the form of the noise function.

However, there are some caveats to be aware of:

In my experience, the algorithm rarely converges. This is due to the fact that the confidence intervals only narrow as new x-coordinates are sampled, but the closer the algorithm gets to the root, the more time it spends sampling each x-coordinate. That said, the algorithm seems to find the root very accurately even when it doesn't converge.
The algorithm isn't very numerically stable. The posterior distribution is represented using two arrays: one for the bin edges and one for the bin heights. As the algorithm progresses, the bins near the root get very thin and very tall, which leads to loss-of-precision and overflow issues. My only advice on how to avoid these problems is to limit the number of iterations. It might also help to increase the p_c parameter.

Note that I myself only have a rudimentary understanding of the math behind this algorithm. This implementation is based on scripts I received from the authors, and I tried to test my code as well as possible to convince myself that it's doing the right thing, but I'm outside my comfort zone here.

Installation

Install from PyPI:

$ pip install pba_waeber2013

Usage

This module provides a single public function:

>>> from pba_waeber2013 import pba_waeber2013
>>> pba_waeber2013(
...         f, a, b, *
...         tol=None,
...         rtol=None,
...         alpha=0.05,
...         p_c=0.65,
...         maxiter=1000,
...         check_bounds=False,
...         slope=None,
...         args=None,
...         kwargs=None,
... )

Below are brief descriptions of the parameters:

f: The stochastic function of interest. This function can take any number of arguments (see args and kwargs), but the first should be the x coordinate to evaluate the function at.
a b: The lower and upper bounds, respectively, on where the root can occur. There must be exactly one root in this interval.
tol: How narrow the confidence interval needs to be in order for the algorithm to be converged. Note that if neither tol nor rtol are specified, the algorithm will just run for the maximum number of iterations.
rtol: Similar to tol, but multiplied by the estimated root.
alpha: A parameter controlling the width of the confidence intervals. Specifically, the confidence level is given by 1 − alpha. In other words, for 95% confidence intervals, set alpha to 0.05.
p_c: Repeatedly evaluate the function at each x coordinate until we can reject the null hypothesis that f(x) = 0 with probability p_c. At that point, the sign of f(x) will be taken as whichever sign we observed most often.
maxiter: The maximum number of function calls to make.
check_bounds: Evaluate the function at the bounds a and b before starting the bisection. This achieves two things. First, it checks that the bounds have different signs. If this is not the case, then there are either 0 or >1 roots in the interval, and so this algorithm is not applicable. Second, it determines the slope of the function. If the bounds are not checked, the slope parameter must be manually specified.
slope: +1 if the function is increasing (i.e. negative before the root and positive after) or -1 if the function is decreasing. A slope must be given unless check_bounds is True (in which case it will be calculated internally).
args: Any extra arguments to pass to the function on each evaluation.
kwargs: Any extra keyword arguments to pass to the function on each evaluation.

The return value is an object with the following attributes:

x: The location of the root.
x_obs: All of the x coordinates where the function was evaluated.
f_obs: The result of evaluating the function at each of the above x coordinates.
x_post: The bin edges of the posterior distribution.
log_p_post: The natural logarithms of the bin heights of the posterior distribution. Logarithms are used to avoid multiplication overflows.
ci: The confidence interval evaluated independently after a sign is determined for each coordinate. Note that these intervals can grow and shrink over time. See [Waeber2013] §3.3 for more information.
ci_seq: The sequential confidence interval. Unlike the ci intervals, these are guaranteed to never expand. For that reason, these are the intervals used to check for convergence.
converged: True if the algorithm terminated because the confidence interval grew narrower than the given tolerance, False if the algorithm terminated because it reached the maximum number of iterations.

References

Waeber R. (2013) "Probabilistic bisection search for stochastic root-finding."
Frazier PI, Henderson SG, Waeber R (2016) "Probabilistic bisection converges almost as quickly as stochastic approximation", arXiv:1612.03964
Robbins H and Siegmund D. (1974) "The expected sample size of some tests of power one", The Annals of Statistics, 2(3), pp. 415–436. doi:10.1214/aos/1176342704.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
tests		tests
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
CHANGELOG.md		CHANGELOG.md
LICENSE.txt		LICENSE.txt
README.rst		README.rst
pba_waeber2013.py		pba_waeber2013.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

tests

tests

.gitignore

.gitignore

.readthedocs.yml

.readthedocs.yml

CHANGELOG.md

CHANGELOG.md

LICENSE.txt

LICENSE.txt

README.rst

README.rst

pba_waeber2013.py

pba_waeber2013.py

pyproject.toml

pyproject.toml

Repository files navigation

PBA [Waeber2013]

Installation

Usage

References

About

Releases 1

Packages

Contributors 2

Languages

License

kalekundert/pba_waeber2013

Folders and files

Latest commit

History

Repository files navigation

PBA [Waeber2013]

Installation

Usage

References

About

Resources

License

Stars

Watchers

Forks

Languages