Parsing of long floating point numbers causes panic #1421

MatejKastak · 2021-10-05T11:51:56Z

Hello, first of all thank you, nom is a great tool!

I have tried to migrate to a new nom version (7.0.0) and encountered some unexpected behavior.

parsing of floating point numbers (float, double) panics for long inputs
sadly I found no way to handle this error, program is aborted
are there any tips to avoid such panics?
recent 6.* versions does not panic
migrating to the most recent version is not critical to me, but I wanted to ask/note this behavior for others

After a little digging, I think this is caused by switching to minimal-lexical.

the lexical-core crate used for float parsing has now been replaced with minimal-lexical: the new crate is faster to compile, faster to parse, and has no dependencies

Somewhat related to #1384

Prerequisites

Rust version : rustc 1.55.0 (c8dfcfe04 2021-09-06) also tried it with nightly
nom version : v7.0.0
nom compilation features used: default

Test case

mod tests {
    use nom::number::complete::float;
    use nom::IResult;

    fn parser(s: &str) -> IResult<&str, f32> {
        float(s)
    }

    #[test]
    fn float_panic() {
        // 7.0.0 -> panic
        // 6.2.1 -> pass
        assert!(std::matches!(parser("0.00000000000000000087"), Ok(_)));
        assert!(std::matches!(
            parser("0.00000000000000000000000000000000000000000000000000000000000000087"),
            Ok(_)
        ));

    }
}

nom v7.0.0

---- tests::float_panic stdout ----
thread 'tests::float_panic' panicked at 'attempt to shift left with overflow', /home/anon/.cargo/registry/src/github.com-1ecc6299db9ec823/minimal-lexical-0.1.4/src/lemire.rs:155:5

nom v6.2.1

running 1 test
test tests::float_panic ... ok

The text was updated successfully, but these errors were encountered:

Geal · 2021-10-07T08:48:28Z

thanks for the report. Could you test with nom directly from master?

MatejKastak · 2021-10-07T21:23:04Z

Sure.

~/dev/sandbox/nom-float-panic-new master* 21s
λ cargo test
    Updating crates.io index
   Compiling version_check v0.9.3
   Compiling memchr v2.3.4
   Compiling minimal-lexical v0.2.1
   Compiling nom v7.0.0 (/home/anon/dev/sandbox/nom)
   Compiling nom-float-panic-new v0.1.0 (/home/anon/dev/sandbox/nom-float-panic-new)
    Finished test [unoptimized + debuginfo] target(s) in 9.44s
     Running unittests (target/debug/deps/nom_float_panic_new-b07783cdd65cd87f)

running 1 test
test tests::float_panic ... FAILED

failures:

---- tests::float_panic stdout ----
thread 'tests::float_panic' panicked at 'attempt to shift left with overflow', /home/anon/.cargo/registry/src/github.com-1ecc6299db9ec823/minimal-lexical-0.2.1/src/lemire.rs:155:5
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace

I did a git clone, current master HEAD is at dfa5591, and then specified nom dependency as path

[dependencies]
# nom = "6"
nom = { path = "../nom" }

nbigaouette · 2021-10-12T21:54:32Z

I got the same panic using b"0.00000000000000000018" as input.

mullr · 2021-10-22T19:46:54Z

Here's the workaround I'm using for now. It's a bit of a cudgel, but afaict it gets the old behavior back.

/// This is derived from nom::number::complete::float, which is unfortunately broken in nom 7.0.0.
/// - https://github.com/Geal/nom/issues/1421
/// - https://github.com/Geal/nom/issues/1384
pub fn float<T, E: ParseError<T>>(input: T) -> IResult<T, f32, E>
where
    T: nom::Slice<std::ops::RangeFrom<usize>>
        + nom::Slice<std::ops::RangeTo<usize>>
        + nom::Slice<std::ops::Range<usize>>,
    T: Clone + nom::Offset,
    T: nom::InputIter + nom::InputLength + nom::InputTake,
    <T as nom::InputIter>::Item: nom::AsChar + Copy,
    <T as nom::InputIter>::IterElem: Clone,
    T: nom::InputTakeAtPosition,
    <T as nom::InputTakeAtPosition>::Item: nom::AsChar,
    T: nom::AsBytes,
    T: for<'a> nom::Compare<&'a [u8]>,
    T: nom::ParseTo<f32>,
    T: nom::InputTake + nom::Compare<&'static str>,
{
    let (i, s) = recognize_float(input)?;
    match s.parse_to() {
        Some(f) => (Ok((i, f))),
        None => Err(nom::Err::Error(E::from_error_kind(
            i,
            nom::error::ErrorKind::Float,
        ))),
    }
}

pub fn recognize_float<T, E: ParseError<T>>(input: T) -> IResult<T, T, E>
where
    T: nom::Slice<std::ops::RangeFrom<usize>> + nom::Slice<std::ops::RangeTo<usize>>,
    T: Clone + nom::Offset,
    T: nom::InputIter,
    <T as nom::InputIter>::Item: nom::AsChar,
    T: nom::InputTakeAtPosition,
    <T as nom::InputTakeAtPosition>::Item: nom::AsChar,
    T: nom::InputTake + nom::Compare<&'static str>,
{
    alt((
        nom::number::complete::recognize_float,
        nom::combinator::recognize(nom::bytes::complete::tag("NaN")),
    ))(input)
}

parasyte · 2021-11-06T17:30:54Z

This should have been closed by #1455

Geal · 2021-11-06T17:39:41Z

Yes. Leaving it open for now, I'd like to get a proper fix in minimal-lexical, because we take a perf hit with that workaround

Might as well avoid the dep until rust-bakery/nom#1421 is fixed which is blocked on Alexhuszagh/minimal-lexical#7

Geal mentioned this issue Oct 10, 2021

[BUG] Alexhuszagh/minimal-lexical#7

Open

mullr mentioned this issue Oct 22, 2021

Updating to Nom 7 has broken double("NaN") #1384

Closed

Geal mentioned this issue Nov 2, 2021

introduce a workround for float parsing #1455

Merged

Geal added this to the 8.0 milestone Mar 14, 2022

epage added a commit to winnow-rs/winnow that referenced this issue Dec 27, 2022

chore: Remove dead minimal_lexical code

0470ef7

Might as well avoid the dep until rust-bakery/nom#1421 is fixed which is blocked on Alexhuszagh/minimal-lexical#7

epage added a commit to winnow-rs/winnow that referenced this issue Dec 27, 2022

chore: Remove dead minimal_lexical code

1b25592

Might as well avoid the dep until rust-bakery/nom#1421 is fixed which is blocked on Alexhuszagh/minimal-lexical#7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parsing of long floating point numbers causes panic #1421

Parsing of long floating point numbers causes panic #1421

MatejKastak commented Oct 5, 2021

Geal commented Oct 7, 2021

MatejKastak commented Oct 7, 2021

nbigaouette commented Oct 12, 2021

mullr commented Oct 22, 2021

parasyte commented Nov 6, 2021

Geal commented Nov 6, 2021

Parsing of long floating point numbers causes panic #1421

Parsing of long floating point numbers causes panic #1421

Comments

MatejKastak commented Oct 5, 2021

Prerequisites

Test case

Geal commented Oct 7, 2021

MatejKastak commented Oct 7, 2021

nbigaouette commented Oct 12, 2021

mullr commented Oct 22, 2021

parasyte commented Nov 6, 2021

Geal commented Nov 6, 2021