Parsing subnormal float with too many sig figs returns zero, even if it is larger than smallest subnormal #83

n8xm · 2021-07-18T05:17:01Z

I use Base.parse, I get different results than if I use Parsers.parse:

julia> parse(Float64,"9.3494547075363499E-311")
9.3494547075363e-311

julia> parse(Float64,"9.349454707536349E-311")
9.3494547075363e-311

julia> using Parsers

julia> Parsers.parse(Float64,"9.3494547075363499E-311")
0.0

julia> Parsers.parse(Float64,"9.349454707536349E-311")
9.3494547075363e-311

I believe what is happening is that:

We are dealing with subnormal floats
If the subnormal float has "too many" sig figs

It is my understanding that subnormal floats sacrifice precision in order to allow representations that are "close" to zero. However, because 9.3494547075363499E-311 is larger than the smallest subnormal float, the behavior of Base.parse is what I would expect. I would only expect a parser to return 0.0 if the number being parsed is smaller than the smallest subnormal float.

By the way, Python's built-in parser behaves like Base.parse and not Parsers.parse. Using Python 3.9.6:

>>> float("9.3494547075363499E-311")
9.3494547075363e-311
>>> float("9.349454707536349E-311")
9.3494547075363e-311

Similarly, C's atof behaves like Base.parse and not Parsers.parse:

printf("%e\n", atof("9.3494547075363499E-311"));
printf("%e\n", atof("9.349454707536349E-311"));

The above C source produces the output:

9.349455e-311
9.349455e-311

Is there a good reason why Parsers.parse behaves differently from Base.parse in this case?

The text was updated successfully, but these errors were encountered:

Fixes #83. Previously, we eagerly bailed for small enough exponents when parsing. Instead, we can take a slightly slower path by falling back on BigInt/BigFloat for however small the exponent is to do the correct scaling.

quinnj · 2021-10-14T05:03:00Z

Sorry for the slow fix here; a PR is up that should make subnormal parsing more robust: #92

Fixes #83. Previously, we eagerly bailed for small enough exponents when parsing. Instead, we can take a slightly slower path by falling back on BigInt/BigFloat for however small the exponent is to do the correct scaling.

n8xm mentioned this issue Jul 18, 2021

Float with many decimal places unexpectedly read as 0.0 JuliaData/CSV.jl#855

Closed

Felix-Gauthier mentioned this issue Sep 1, 2021

crash when parsing subnormal floats with compiled package #87

Closed

quinnj mentioned this issue Oct 14, 2021

Increase accuracy of parsing subnormal floats #92

Merged

quinnj closed this as completed in #92 Oct 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parsing subnormal float with too many sig figs returns zero, even if it is larger than smallest subnormal #83

Parsing subnormal float with too many sig figs returns zero, even if it is larger than smallest subnormal #83

n8xm commented Jul 18, 2021

quinnj commented Oct 14, 2021

Parsing subnormal float with too many sig figs returns zero, even if it is larger than smallest subnormal #83

Parsing subnormal float with too many sig figs returns zero, even if it is larger than smallest subnormal #83

Comments

n8xm commented Jul 18, 2021

quinnj commented Oct 14, 2021