ENH: Poisson disk sampling for arbitrary bounds #20288

mplough-kobold · 2024-03-19T15:14:20Z

Is your feature request related to a problem? Please describe.

#13918 added Poisson disk sampling of the unit hypercube to scipy.stats.qmc. Poisson disk sampling is widely used in image processing and image generation; see scikit-image/scikit-image#2380 for example use cases. Sampling of the unit hypercube is not sufficient for image processing applications because an image can be any aspect ratio. The sampling algorithm itself must be aware of the bounds from which to draw samples; scaling samples from the unit hypercube using scipy.stats.qmc.scale results in loss of the radius distance property through unequal scaling. In the two dimensional case for example, a circular radius gets squashed into an ellipse as shown below.

Original sampling of unit hypercube

After scaling

Describe the solution you'd like.

It will be best to modify scipy.stats.qmc.PoissonDisk to optionally accept, along with the existing dimension parameter d, a d-dimensional u_bounds parameter similar to scipy.stats.qmc.scale. If u_bounds is left unspecified the Poisson disk sampling will proceed on the unit hypercube. If it is specified the samples will be scaled prior to radius consideration.

Describe alternatives you've considered.

Sample scaling via scipiy.stats.qmc.scale is not possible due to the distortion it introduces.

Subclassing scipy.stats.qmc.PoissonDisk to allow pre-scaling is not easy due to the tight coupling of the radius parameter and the initialization of the cell grid.

Currently it's necessary to either create a parallel PoissonDisk implementation or to not use SciPy for Poisson disk sampling from arbitrary bounds.

It's possible to calculate the image aspect ratio and do rejection sampling on the unit hypercube, rejecting samples that lie outside a "hyperrectangle". For 2D that looks like this:

import numpy as np
from scipy.stats import qmc


class AspectRatioRejection:
    def __init__(self, width: int, height: int):
        """Rejection sampling to get around scipy.stats.qmc.PoissonDisk scaling issues.

        See https://github.com/scipy/scipy/issues/20288.
        """
        if width >= height:
            self.xmax = 1.0
            self.ymax = height / width
        else:
            self.xmax = width / height
            self.ymax = 1.0

    def __call__(self, sample: tuple[float, float]) -> bool:
        """Reject sample if it's outside the aspect ratio."""
        return sample[0] > self.xmax or sample[1] > self.ymax


def fill_space_blue_noise_samples(
    width: int,
    height: int,
    radius_fraction: float = 0.05,
    rng: np.random.Generator | None = None
):
    """Fill image area with blue noise samples.

    Samples are spaced at least `radius_fraction` apart, where radius_fraction is a fraction of the image width.
    """
    aspect_reject = AspectRatioRejection(width, height)

    rng = np.random.default_rng() if rng is None else rng
    engine = qmc.PoissonDisk(d=2, radius=radius_fraction, seed=rng)
    samples = [s * max(width, height) for s in engine.fill_space() if not aspect_reject(s)]

    return samples

However, the performance gets worse and worse as the space's aspect ratio gets larger. For the case of a line, all samples will be almost surely rejected. However, it's very easy (and performant) to generate samples in a lower dimensional space so this performance hit can be avoided entirely through prescaling.

Additional context (e.g. screenshots, GIFs)

No response

The text was updated successfully, but these errors were encountered:

tupui · 2024-03-25T06:02:31Z

Hi @mplough-kobold thank you for the suggestion. Also you detailed report is highly appreciated 🙏

That's a legitimate ask 👍 would you be interested in making a PR? I am happy to help with reviewing it or helping you implement the solution.

mplough-kobold · 2024-04-13T00:36:37Z

@tupui - glad the report was helpful. I'd be interested in making a PR but unfortunately don't have time right now to dig in. So this is definitely up for grabs if someone else wants to pick this up.

tupui · 2024-04-13T12:35:05Z

Thanks for letting us know. I also don't have time at the moment. Feel free to come back when you do have time 😃

mplough-kobold added the enhancement A new feature or improvement label Mar 19, 2024

dschmitz89 added the scipy.stats label Mar 19, 2024

tupui mentioned this issue Apr 18, 2024

META: tracker for improvements in scipy.stats.qmc #14510

Open

31 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Poisson disk sampling for arbitrary bounds #20288

ENH: Poisson disk sampling for arbitrary bounds #20288

mplough-kobold commented Mar 19, 2024 •

edited

tupui commented Mar 25, 2024 •

edited

mplough-kobold commented Apr 13, 2024

tupui commented Apr 13, 2024

ENH: Poisson disk sampling for arbitrary bounds #20288

ENH: Poisson disk sampling for arbitrary bounds #20288

Comments

mplough-kobold commented Mar 19, 2024 • edited

Is your feature request related to a problem? Please describe.

Describe the solution you'd like.

Describe alternatives you've considered.

Additional context (e.g. screenshots, GIFs)

tupui commented Mar 25, 2024 • edited

mplough-kobold commented Apr 13, 2024

tupui commented Apr 13, 2024

mplough-kobold commented Mar 19, 2024 •

edited

tupui commented Mar 25, 2024 •

edited