Infinity handling differs to python native json #80

telegraphic · 2013-04-19T05:58:49Z

It looks like ujson handles infinity different to the native json encoder:

import ujson, json, numpy

a = np.array([1,2,3,4,numpy.inf])
b = json.dumps({"test" : a.tolist()})
# outputs '{"test": [1,2,3,4,5, Infinity]}'
b= ujson.dumps({"test" : a.tolist()})
# raises OverflowError: Invalid Inf value when encoding double

Similarly, an Overflow error is raised when NaN is encountered. It seems the relevant lines are 499-508 in ultrajsonenc.c:

    if (value == HUGE_VAL || value == -HUGE_VAL)
    {
        SetError (obj, enc, "Invalid Inf value when encoding double");
        return FALSE;
    }
    if (! (value == value)) 
    {
        SetError (obj, enc, "Invalid Nan value when encoding double");
        return FALSE;
    }

In the interests of keeping ujson as a drop-in replacement for json, may I suggest this is changed so that it converts the NaN / Infinity to strings (like json) and doesn't raise an error? The decoder would likely need to be changed too...

Cheers
Dan

jskorpan · 2013-04-19T06:18:01Z

The specification on json.org is very clear about numeric types only being numbers or exponents.
//JT

From: Danny Price [mailto:notifications@github.com]
Sent: den 19 april 2013 07:59
To: esnme/ultrajson
Subject: [ultrajson] Infinity handling differs to python native json (#80)

It looks like ujson handles infinity different to the native json encoder:

import ujson, json, numpy

a = np.array([1,2,3,4,numpy.inf])

b = json.dumps({"test" : a.tolist()})

outputs '{"test": [1,2,3,4,5, Infinity]}'

b= ujson.dumps({"test" : a.tolist()})

raises OverflowError: Invalid Inf value when encoding double

Similarly, an Overflow error is raised when NaN is encountered. It seems the relevant lines are 499-508 in ultrajsonenc.chttps://github.com/esnme/ultrajson/blob/master/lib/ultrajsonenc.c#L499:

if (value == HUGE_VAL || value == -HUGE_VAL)

{

    SetError (obj, enc, "Invalid Inf value when encoding double");

    return FALSE;

}

if (! (value == value))

{

    SetError (obj, enc, "Invalid Nan value when encoding double");

    return FALSE;

}

In the interests of keeping ujson as a drop-in replacement for json, may I suggest this is changed so that it converts the NaN / Infinity to strings (like json) and doesn't raise an error? The decoder would likely need to be changed too...

Cheers
Dan

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/80.

telegraphic · 2013-04-19T07:03:38Z

Good point - for anyone else interested there's a discussion on StackOverflow of exactly this. I find it bemusing that JSON cannot fully represent IEE754 floating point numbers!

However, if ujson is intended as a drop-in replacement for python's json, I maintain that handling inf/nans in the exact same way would be preferable to raising an exception. So, may I instead request for an argument allow_nan for loading/dumping:
allow_nan is False (default: True), then it will be a ValueError to serialize out of range float values (nan, inf, -inf) in strict compliance of the JSON specification, instead of using the JavaScript equivalents (NaN, Infinity, -Infinity).

Dan

mthurlin · 2013-04-19T08:29:51Z

I think this should be solved in a more generic way by a allowing a custom encoder/decoder that handles unknown values.

Adding an option for every special case where someone is abusing the JSON-spec is not the way to go IMO.

telegraphic · 2013-04-19T09:47:46Z

I would argue that it's not "abuse" of the JSON spec, but is instead adding extra functionality and compatibility -- already included in simplejson, json, and cjson. Having an identical featureset to the standard json would be fantastic, and I really think handling IEEE floating point is a pretty solid move.

mthurlin · 2013-04-23T07:33:37Z

Well, you are asking a JSON encoder to output invalid JSON...

Also:
http://en.wikipedia.org/wiki/Robustness_principle

telegraphic · 2013-04-23T09:45:18Z

So following the robustness principle, uJson should accept nan and inf on the decode ("code that receives input should accept non-conformant input as long as the meaning is clear"), but shouldn't encode it ("code that sends commands or data to other machines should conform completely to the specifications")?

While this is obviously not what I would prefer, I do see the advantages of following design principles. I'll leave the ball in your court on this one...

jskorpan · 2013-05-17T09:10:24Z

Feel free to contribute a patch towards the robustness principle. Closing this as an issue

hsk81 · 2013-06-09T13:04:38Z

+1 for encoding inf and nan to JavaScript values Infinity and NaN .. it seems to me that the JSON specification is simply insufficient by omitting the full range of IEE754 floating point numbers. Following specs religiously is not the way to go for practical programming.

Joshuaalbert · 2022-08-06T19:59:55Z

Have to agree strongly with @hsk81, and the robustness principle is clear on this too. You SHOULD decode things when the meaning is clear, and you SHOULD NOT encode things against the spec. In programming the meaning of SHOULD and MUST are distinct. (And now I shall use it in practice). You SHOULD NOT follow a rule containing the word SHOULD, when it introduces unnecessary complexity into a fundamental part of many people's code.

bwoodsend · 2022-08-06T20:31:38Z

We do appear to have a bit of a deviation here. ujson writes infinity as Inf rather than Infinity which mismatches JavaScript, Python's json and ujson's own parser so it can't even read back its own JSON!

>>> import math
>>> import ujson
>>> import json

>>> json.dumps([math.nan, math.inf, -math.inf])
'[NaN, Infinity, -Infinity]'
>>> ujson.dumps([math.nan, math.inf, -math.inf])
'[NaN,Inf,-Inf]'

>>> ujson.loads('[NaN,Inf,-Inf]')
ujson.JSONDecodeError: Unexpected character found when decoding 'Infinity'
>>> ujson.loads('[NaN, Infinity, -Infinity]')
[nan, inf, -inf]

And on the topic of JSON's inclusion/exclusion of non finite floats, one of the original JSON authors said that he didn't add it to the spec only because he didn't expect anyone to need it and that if anyone found a good use for them then he would consider his own argument to be moot.

Infinity was being encoded as 'Inf' which, whilst the JSON spec doesn't include any non-finite floats, differs from the conventions in other JSON libraries, JavaScript of using 'Infinity'. It also differs from what `ujson.loads()` expects so that `ujson.loads(ujson.dumps(math.inf))` raises an exception. Closes ultrajson#80.

Infinity was being encoded as 'Inf' which, whilst the JSON spec doesn't include any non-finite floats, differs from the conventions in other JSON libraries, JavaScript of using 'Infinity'. It also differs from what `ujson.loads()` expects so that `ujson.loads(ujson.dumps(math.inf))` raises an exception. Closes #80.

jskorpan closed this as completed May 17, 2013

zack-sampson mentioned this issue Feb 26, 2016

Support Infinity, -Infinity and NaN in read_json pandas-dev/pandas#12213

Closed

dstaley mentioned this issue Oct 26, 2019

Crash with querystring ?wb48617274=803E5708 Kinto/kinto#2312

Closed

bwoodsend reopened this Aug 6, 2022

bwoodsend added the bug Something isn't working label Aug 6, 2022

bwoodsend linked a pull request Aug 6, 2022 that will close this issue

Fix encoding of infinity (#80). #562

Merged

hugovk mentioned this issue Aug 7, 2022

Fix encoding of infinity (#80). #562

Merged

bwoodsend closed this as completed in #562 Aug 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Infinity handling differs to python native json #80

Infinity handling differs to python native json #80

telegraphic commented Apr 19, 2013

jskorpan commented Apr 19, 2013

telegraphic commented Apr 19, 2013

mthurlin commented Apr 19, 2013

telegraphic commented Apr 19, 2013

mthurlin commented Apr 23, 2013

telegraphic commented Apr 23, 2013

jskorpan commented May 17, 2013

hsk81 commented Jun 9, 2013

Joshuaalbert commented Aug 6, 2022

bwoodsend commented Aug 6, 2022

Infinity handling differs to python native json #80

Infinity handling differs to python native json #80

Comments

telegraphic commented Apr 19, 2013

jskorpan commented Apr 19, 2013

outputs '{"test": [1,2,3,4,5, Infinity]}'

raises OverflowError: Invalid Inf value when encoding double

telegraphic commented Apr 19, 2013

mthurlin commented Apr 19, 2013

telegraphic commented Apr 19, 2013

mthurlin commented Apr 23, 2013

telegraphic commented Apr 23, 2013

jskorpan commented May 17, 2013

hsk81 commented Jun 9, 2013

Joshuaalbert commented Aug 6, 2022

bwoodsend commented Aug 6, 2022