JSON: improved deserialization into union types #8689

kimburgess · 2020-01-14T15:56:15Z

Modified behaviour when reading into union types to prioritize:

native types used by JSON::PullParser
casts to numeric types of the same kind in order of precision
casts to numeric types of a different kind
non-primitive types

Resolves #7333
Supersedes #8675
Extends #8686

RX14 · 2020-01-14T18:00:44Z

src/json/from_json.cr

+      return pull.read_string
+    {% end %}
+    when .int?
+    {% type_order = [Int64, Int32, Int16, Int8, UInt64, UInt32, UInt16, UInt8, Float64, Float32] %}


I'm interested in other's thoughts on ordering Ints and UInts here.

I don't think it matters. Maybe it should be from smaller to bigger. For example if you have Union(Int8, Int16).from_json(1), what would you expect? 1 fits in both types.

That's why I'm not sure whether we should do anything about it other than putting floats to the end.

That is, my PR. But whenever I send a PR there are always thoughts, comments, etc. It's so slow to develop in Crystal. Meanwhile in Ruby they push commits without much thoughts, and it's a success.

I don't think it matters.

JSON::PullParser#read? silently wraps in the case of an overflow so order is important. Combine that with the alphabetic type order and your end up with Int16 > Int32 > Int64 > Int8 which would likely lead to some interesting behaviour if someone was to parse into a union of indiscriminate types. That was the main motivation for this following your PR.

alias T = Int16 | Int32 a : T = Int16::MAX.to_i + 2 # => 32769 b : T = T.from_json(a.to_json) # => -32767

An initial implementation prioritised smaller types, converted with {{type}}.new and caught Overflow errors along the way, but this seemed like a lot of runtime overhead when the type could be satisfied without it.

There is likely a separate discussion needed on if that silent cast is the correct behaviour from #read?, but in the interests of keeping this focused on Union I've not touched that here.

@bcardiff i'd prefer to always use the largest type in the union, so that if you deserialize Array(Int32 | Int8), you get all int32's.

Again, this really doesn't matter. If you're doing that, you're doing it wrong.

Ok. for Int32 | Int8 it might be more stable to return always Int32 in that case.

I haven't checked how it currently works but whay is your opinion regarding Int32 | Float32 ? Should it behave as it was Float32 alone? For the UInt64::MAX < Float32::MAX but that is no true for UInt128. So I would prefer to honor Int32 | Float32 depending if the input is a float or int based on what I said before.

So I would prefer to honor Int32 | Float32 depending if the input is a float or int based on what I said before.

This sounds acceptable, but I'm not sure if the JSON parser should read some context into number literals when there is absolutely no difference between 1.0 and 1 literals in JSON.

I'd probably prefer the largest float data type over any int.

As a user of the std lib, a primary argument for reading into the that decimal point is to keep encode/decode behavior symmetric.

Looks like we still have some split opinions. In the interests of keeping things moving / forcing a decision I've amended the order to match what was mentioned to @RX14 above as well as adding spec to demonstrate stability when pulling into a (highly questionable) union of different integer types as well as symmetry of encode/decode for the Int | Float case.

A re-review would be lovely as people have time.

src/json/from_json.cr

Modified behaviour when reading into union types to prioritize: 1. native types used by JSON::PullParser 2. casts to numeric types of the same kind in order of precision 3. casts to numeric types of a different kind 4. non-primitive types

src/json/from_json.cr

RX14 reviewed Jan 14, 2020

View reviewed changes

src/json/from_json.cr Show resolved Hide resolved

kimburgess requested a review from RX14 January 15, 2020 00:27

kimburgess mentioned this pull request Jan 17, 2020

JSON::PullParser numeric overflow behaviour #8694

Closed

JSON: improved deserialization into union types

227e371

Modified behaviour when reading into union types to prioritize: 1. native types used by JSON::PullParser 2. casts to numeric types of the same kind in order of precision 3. casts to numeric types of a different kind 4. non-primitive types

RX14 reviewed Jan 25, 2020

View reviewed changes

src/json/from_json.cr Show resolved Hide resolved

RX14 approved these changes Jan 25, 2020

View reviewed changes

RX14 requested a review from bcardiff January 27, 2020 16:47

bcardiff approved these changes Jan 27, 2020

View reviewed changes

bcardiff added this to the 0.33.0 milestone Jan 27, 2020

bcardiff added topic:stdlib:serialization kind:bug labels Jan 27, 2020

RX14 merged commit f3d2972 into crystal-lang:master Jan 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JSON: improved deserialization into union types #8689

JSON: improved deserialization into union types #8689

kimburgess commented Jan 14, 2020

RX14 Jan 14, 2020

kimburgess Jan 15, 2020

asterite Jan 15, 2020

asterite Jan 15, 2020

kimburgess Jan 16, 2020

RX14 Jan 17, 2020

bcardiff Jan 17, 2020

straight-shoota Jan 17, 2020

kimburgess Jan 19, 2020

kimburgess Jan 21, 2020

JSON: improved deserialization into union types #8689

JSON: improved deserialization into union types #8689

Conversation

kimburgess commented Jan 14, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment