Error handling of distributions::Uniform::new #1195

dhardy · 2021-10-19T15:49:35Z

Currently, Uniform::new will panic on invalid parameters:

low >= high (or > for new_inclusive)
(float, debug only) non-finite low or high
(float) non-finite scale = high - low

Since #581 / #770, all other distributions with fallible constructor return a Result instead of panic (I think).

For: consistency and utility (error checking in derived types)

Against: this is another significant breaking change, when we hope to be getting close to stable (though now is certainly better than later)

Alternative: try_new, try_new_inclusive constructors - but this is inconsistent and results in far too many methods.

Also related: Rng::gen_range uses this. This should probably still panic on failure (note also Rng::fill vs Rng::try_fill; Rng::gen_bool and Rng::gen_ratio which may panic).

I suggest this error type:

enum Error {
    /// Low > high, or equal in case of exclusive range
    EmptyRange,
    /// Input or range (high - low) is non-finite. Not relevant to integer types. In release mode only the range is checked.
    NonFinite,
}

Link: #1165

The text was updated successfully, but these errors were encountered:

dhardy · 2021-10-21T13:06:53Z

Follow up change: WeightedIndex::new and assign_weights (whatever it gets called) should be updated to account for Uniform::new returning a Result.

kazcw · 2021-10-24T19:38:01Z

I agree that this should be done now or never, and IMO it's better to have a breaking change now than have this be an inconsistency in the API forever.

If the error type exposes the distinction between EmptyRange and NonFinite, we should assume people will rely on that distinction, and commit to a policy about which error code will be returned when both have occurred. Preferably, this policy should be the same between debug and release builds, without imposing a performance burden that we currently avoid by making finer distinctions in debug_asserts only. I think designing and implementing such a policy would be possible, but not trivial.

I propose avoiding that hullabaloo by returning an opaque error type. The type's debug output can include details about the exact error encountered, which would be as informative to a debugging developer as the current assert approach; if it's ever decided in the future that users actually do need to distinguish these cases programmatically, an accessor could be added in a backward-compatible manner in the style of io::ErrorKind.

SWW13 · 2022-04-21T11:36:41Z

When using WeightedIndex Uniform::new: range overflow is a rather surprising panic with no direct mention in the documentation. The panic is painful to debug compared to a Result, so there should be a panic free way to use all random distributions.

vks · 2022-04-22T19:54:29Z

@kazcw Unfortunately, we have to make the error type public if we want to allow other crates to add support for their types to Uniform by implementing UniformSampler.

I'm not sure how often that is used in practice, but removing this option would be a regression.

- This is a breaking change. - The new error type had to be made public, otherwise `Uniform` could not be extended for user-defined types by implementing `UniformSampler`. - `rand_distr` was updated accordingly. - Also forbid unsafe code for crates where none is used. Fixes rust-random#1195, rust-random#1211.

kazcw · 2022-04-22T21:37:00Z

The solution does not need to preclude other crates from implementing UniformSampler.

The plan is to make UniformSampler::new fallible, right? Presumably something like this:

pub trait UniformSampler {
    // The possible errors vary for different types, and we cannot anticipate the classes of errors
    // for all implementors of the trait, so this is an associated type.
    type Error: Display;
    fn new<B1, B2>(low: B1, high: B2) -> Result<Self, Self::Error>;
    // ...
}

impl<X: SampleUniform> Uniform<X> {
    fn new<B1, B2>(low: B1, high: B2) -> Result<Self, X::Error> { ... }
    // ...
}

pub struct InvalidFloatRange(FloatRangeError);
enum FloatRangeError {
    Empty,
    Infinite,
}
impl Display for FloatRangeError {
    // ...
}
// In the macro defining floating point samplers:
impl UniformSampler for UniformFloat<$Ty> {
    type Error = FloatRangeError;
    // ...
}

pub struct EmptyRange;
// In the macro defining integer samplers:
impl UniformSampler for UniformInt<$Ty> {
    type Error = EmptyRange;
    // ...
}

So, the question is whether, in a case like InvalidFloatRange where more than one reason for failure is possible, we:

Hide the specific reasons for the provided implementations' errors (as above), and only use that information to produce messages for debugging purposes.
Programmatically expose the reason for failure.

My point earlier was, if we go with (2), we should have a consistent and well-documented precedence of errors when multiple failures are possible; but i can't picture much need for programmatic inspection of the error here, and (1) would be easier.

dhardy · 2022-04-25T07:20:41Z

I agree with @kazcw that we likely don't need to programmatically expose error details.

But, in this case, why have an associated Error type?

We could do the following, or store Box<dyn Error> cause. (Though about the only thing these cfgs save is allowing the cause strings/types to be optimised out of release binaries.)

pub struct Error {
    #[cfg(debug_assertions)]
    cause: &'static str,
}
impl Error {
    pub fn new(cause: &'static str) -> Self {
        #[cfg(not(debug_assertions))] {
            let _ = cause;
        }
        Error {
            #[cfg(debug_assertions)]
            cause,
        }
    }
}
impl std::fmt::Display for Error {
    fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result {
        #[cfg(debug_assertions)] {
            write!(f, "invalid range: {}", self.cause)
        }
        #[cfg(not(debug_assertions))] {
            write!(f, "invalid range")
        }
    }
}

* Forbid unsafe code in crates without unsafe code This helps tools like `cargo geiger`. * Make `Uniform` constructors return a result - This is a breaking change. - The new error type had to be made public, otherwise `Uniform` could not be extended for user-defined types by implementing `UniformSampler`. - `rand_distr` was updated accordingly. - Also forbid unsafe code for crates where none is used. Fixes #1195, #1211. * Address review feedback * Make `sample_single` return a `Result` * Fix benchmarks * Small fixes * Update src/distributions/uniform.rs

dhardy added the B-API Breakage: API label Oct 19, 2021

dhardy added this to the 0.9 release milestone Oct 19, 2021

dhardy mentioned this issue Oct 19, 2021

Full update of weighted index by assigning weights #1194

Closed

dhardy mentioned this issue Oct 25, 2021

Outline path towards 1.0.0 #693

Open

vks mentioned this issue Jan 18, 2022

Tracker: rand 0.9 #1165

Open

24 tasks

vks mentioned this issue Apr 22, 2022

Make Uniform constructors return a result #1229

Merged

dhardy closed this as completed in #1229 Feb 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error handling of distributions::Uniform::new #1195

Error handling of distributions::Uniform::new #1195

dhardy commented Oct 19, 2021

dhardy commented Oct 21, 2021

kazcw commented Oct 24, 2021

SWW13 commented Apr 21, 2022

vks commented Apr 22, 2022

kazcw commented Apr 22, 2022

dhardy commented Apr 25, 2022

Error handling of distributions::Uniform::new #1195

Error handling of distributions::Uniform::new #1195

Comments

dhardy commented Oct 19, 2021

dhardy commented Oct 21, 2021

kazcw commented Oct 24, 2021

SWW13 commented Apr 21, 2022

vks commented Apr 22, 2022

kazcw commented Apr 22, 2022

dhardy commented Apr 25, 2022