behavior in case of overflow/underflow differs from std::from_chars specification? #120

ppalka-rh · 2022-01-17T20:03:12Z

When the parsed value is outside the representable range, such as on input "1e-10000" and "1e+10000", it seems fast_float::from_chars sets the 'value' output parameter to 0 and infinity respectively and returns std::errc{}.

But the specification for std::from_chars (http://eel.is/c++draft/charconv.from.chars#1) says:

If the parsed value is not in the range representable by the type of value, value is unmodified and the member ec of the return value is equal to errc::result_out_of_range.

It this deviation from the C++ standard intended?

When integrating fast_float into libstdc++, we adjusted this behavior with the following patch: https://gcc.gnu.org/git/?p=gcc.git;a=commitdiff;h=40b0d4472a2591cf27f3a81aa3fba57dc4532648

lemire · 2022-01-17T20:35:03Z

The from_chars supports various number types, including integer types. For integer types, there is a finite range.
My view is that the range of values from binary64 and binary32 numbers is from -infinity to infinity. So all real numbers are in range. Not all numbers can be represented exactly, but that is nothing to do with the range become the convertion from text to binary is lossy: we always seek to assign the best match.

I covered my stance with respect to this issue in the following PR:
#113
Given a decimal representation, you always seek the best match. My view is that 1e99999 is indeed representable with a binary64 number: it is infinity. I am not exactly sure on what account you would consider that 1e-10000 is not in range. What range is that?

jwakely · 2022-01-17T21:05:43Z

I think an LWG issue is warranted, since we have two reasonable but conflicting interpretations of the standard.

alugowski · 2023-03-06T22:06:14Z

Given a decimal representation, you always seek the best match. My view is that 1e99999 is indeed representable with a binary64 number: it is infinity.

Agreed.

However, this assumes that the user only cares about the best representation of the input in a double (like the JavaScript or Python examples in the link)

A more interesting value is 1e40. It does not fit in a binary32, but does in a binary64. A user may wish to use 32-bit code if the values fit and 64-bit code otherwise. A parser that returns infinity on overflow makes such approaches much more difficult to write. The opposite is easier. Same argument for 64-bit overflow as some platforms offer 80-bit floats and of course variable-precision libraries exist.

I'd also argue that a C++ library should match the behavior of the C++ standard library, not Python, JavaScript, or Swift. AFAIK all floating-point parsers in the C and C++ standard libraries have a way to distinguish overflow.

Heck, you can easily do both. The error code is separate from the return value, so just return infinity (or -infinity, or whatever) but still set result.ec = errc::result_out_of_range. Users that care can distinguish overflow, users that don't can just treat errc::result_out_of_range as success.

lemire · 2023-03-07T03:08:11Z

@alugowski That's reasonable. Pull request invited.

I don't consider the current behaviour incorrect but if someone wants to contribute the code for the functionality that @alugowski described, then we will merge it.

alugowski · 2023-03-29T05:45:52Z

@alugowski That's reasonable. Pull request invited.

I don't consider the current behaviour incorrect but if someone wants to contribute the code for the functionality that @alugowski described, then we will merge it.

PR submitted with the behavior as I described above: #189

The main difference from @pppalka 's patch is that the check is one line lower to still return the best match value.

jwakely · 2023-03-29T10:30:37Z

I think an LWG issue is warranted, since we have two reasonable but conflicting interpretations of the standard.

Actually I think I'll add a comment to https://cplusplus.github.io/LWG/issue3081

lemire · 2023-03-30T22:35:55Z

I will close this issue given that I just merged #189

lemire changed the title ~~behavior in case of overflow/underflow differs from std::from_chars specification~~ behavior in case of overflow/underflow differs from std::from_chars specification? Jan 17, 2022

alugowski mentioned this issue Mar 29, 2023

Set errc::result_out_of_range on over/underflow #189

Merged

lemire closed this as completed Mar 30, 2023

lemire mentioned this issue Aug 22, 2024

result modified when result_out_of_range #261

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

behavior in case of overflow/underflow differs from std::from_chars specification? #120

behavior in case of overflow/underflow differs from std::from_chars specification? #120

ppalka-rh commented Jan 17, 2022

lemire commented Jan 17, 2022

jwakely commented Jan 17, 2022

alugowski commented Mar 6, 2023

lemire commented Mar 7, 2023

alugowski commented Mar 29, 2023

jwakely commented Mar 29, 2023

lemire commented Mar 30, 2023

behavior in case of overflow/underflow differs from std::from_chars specification? #120

behavior in case of overflow/underflow differs from std::from_chars specification? #120

Comments

ppalka-rh commented Jan 17, 2022

lemire commented Jan 17, 2022

jwakely commented Jan 17, 2022

alugowski commented Mar 6, 2023

lemire commented Mar 7, 2023

alugowski commented Mar 29, 2023

jwakely commented Mar 29, 2023

lemire commented Mar 30, 2023