Extended JSON encode/decode protocol #12796

nstarman · 2022-01-26T20:59:25Z

Signed-off-by: Nathaniel Starkman (@nstarman) nstarkman@protonmail.com

Improvement suggestions appreciated.

I now have the shape of things, but details need filling in. I like the idea of the registration system and the entry points.
The tests need work: they are repetitive and should be shortened using pytest features.

This is the code pattern showing how the following examples are displayed

import json
import numpy as np
import astropy.coordinates as coord
import astropy.units as u
from astropy.cosmology import units as cu
from astropy.io.misc.json import JSONExtendedEncoder, JSONExtendedDecoder

def show(val):
    print("val:", repr(val), end="\n")
    serialized = json.dumps(val, cls=JSONExtendedEncoder)
    print("dump:", serialized, end="\n")
    with u.add_enabled_units(cu):
        out = json.loads(serialized, cls=JSONExtendedDecoder)
    print("load:", repr(out))

>>> show(np.dtype(float))
val: dtype('float64')
dump: {"!": "numpy.dtype", "value": "float64"}
load: dtype('float64')

>>> show(np.dtype(np.dtype("float", metadata={"a": 1})))
val: dtype('float64')
dump: {"!": "numpy.dtype", "value": "float64", "metadata": {"a": 1}}
load: dtype('float64')

>>> np.dtype("10float64", align=True)
val: dtype(('<f8', (10,)))
dump: {"!": "numpy.dtype", "value": "float64", "shape": [10]}
load: dtype(('<f8', (10,)))

>>> show(np.bool_(True))
val: True
dump: {"!": "numpy.bool_", "value": true}
load: True

>>> show(np.float128("10.0000002"))
val: 10.0000002
dump: {"!": "numpy.float128", "value": "10.0000002000"}
load: 10.0000002

>>> show(np.uintc(10))
val: 10
dump: {"!": "numpy.uint32", "value": "10"}
load: 10

>>> show(np.void(b'abcd'))
val: void(b'\x61\x62\x63\x64')
dump: {"!": "numpy.void", "value": {"!": "builtins.bytes", "value": "abcd"}, "dtype": "|V4"}
load: void(b'\x61\x62\x63\x64')

>>> show(np.array((0, 0.60), dtype=np.dtype([("nu1", float), ("nu2", np.float32)]))[()])
val: (0., 0.6)
dump: {"!": "numpy.void", "value": ["0.0", "0.6"], "dtype": {"value": {"nu1": ["float64", 0], "nu2": ["float32", 8]}, "align": false}}
load: (0., 0.6)

>>> show(np.array(np.float64(10)))
val: array(10.)
dump: {"!": "numpy.ndarray", "value": "10.0", "dtype": "float64"}
load: array(10.)

>>> show(np.array([3], dtype=float))
val: array([3.])
dump: {"!": "numpy.ndarray", "value": ["3.0"], "dtype": "float64"}
load: array([3.])

>>> show(np.array([3], dtype=np.float128))
val: array([3.], dtype=float128)
dump: {"!": "numpy.ndarray", "value": ["3.0"], "dtype": "float128"}
load: array([3.], dtype=float128)

>>> show(np.array((0, 0.6), dtype=np.dtype([("nu1", float), ("nu2", np.float32)])))
val: array((0., 0.6), dtype=[('nu1', '<f8'), ('nu2', '<f4')])
dump: {"!": "numpy.ndarray", "value": {"nu1": {"!": "numpy.ndarray", "value": "0.0", "dtype": "float64"}, "nu2": {"!": "numpy.ndarray", "value": "0.6", "dtype": "float32"}}, "dtype": {"value": {"nu1": ["float64", 0], "nu2": ["float32", 8]}, "align": false}}
load: array([(0., 0.6)], dtype=[('nu1', '<f8'), ('nu2', '<f4')])

>>> show(np.array([(0, 0.6), (1, 1.6)], dtype=np.dtype([("nu1", float), ("nu2", np.float32)])))
val: array([(0., 0.6), (1., 1.6)], dtype=[('nu1', '<f8'), ('nu2', '<f4')])
dump: {"!": "numpy.ndarray", "value": {"nu1": {"!": "numpy.ndarray", "value": ["0.0", "1.0"], "dtype": "float64", "flags": {"align": false}}, "nu2": {"!": "numpy.ndarray", "value": ["0.6", "1.6"], "dtype": "float32"}}, "dtype": {"value": {"nu1": ["float64", 0], "nu2": ["float32", 8]}, "align": false}}
load: array([(0., 0.6), (1., 1.6)], dtype=[('nu1', '<f8'), ('nu2', '<f4')])

>>> show(u.Unit("km"))
val: Unit("km")
dump: {"!": "astropy.units.PrefixUnit", "value": "km"}
load: Unit("km")

>>> show(u.Unit("(km, km, (eV^2, eV))"))
val: Unit("(km, km, (eV2, eV))")
dump: {"!": "astropy.units.StructuredUnit", "value": "(km, km, (eV2, eV))"}
load: Unit("(km, km, (eV2, eV))")

>>> show(u.km * u.eV**2)
val: Unit("eV2 km")
dump: {"!": "astropy.units.CompositeUnit", "value": "eV2 km"}
load: Unit("eV2 km")

>>> show(u.Quantity(10))
val: <Quantity 10.>
dump: {"!": "astropy.units.Quantity", "value": 10.0, "unit": "dimensionless"}
load: <Quantity 10.>

>>> show(u.Quantity((0, 0.6), dtype=np.dtype([("nu1", float), ("nu2", np.float32)]), unit=u.Unit("(eV, eV)")))
val: <Quantity (0., 0.6) (eV, eV)>
dump: {"!": "astropy.units.Quantity", "value": {"!": "numpy.void", "value": ["0.0", "0.6"], "dtype": {"value": {"nu1": ["float64", 0], "nu2": ["float32", 8]}, "align": false}}, "unit": "(eV, eV)"}
load: <Quantity (0., 0.6) (eV, eV)>

>>> show(u.Quantity(np.float128(10)))
val: <Quantity 10.>
dump: {"!": "astropy.units.Quantity", "value": {"!": "numpy.float128", "value": "10.0"}, "unit": "dimensionless"}
load: <Quantity 10.>

>>> show(u.Quantity([(0, 0.6)], dtype=np.dtype([("nu1", float), ("nu2", np.float32)]), unit=u.Unit("(eV, eV)")))
val: <Quantity [(0., 0.6)] (eV, eV)>
dump: {"!": "astropy.units.Quantity", "value": {"!": "numpy.ndarray", "value": {"nu1": {"!": "numpy.ndarray", "value": ["0.0"], "dtype": "float64"}, "nu2": {"!": "numpy.ndarray", "value": ["0.6"], "dtype": "float32"}}, "dtype": {"value": {"nu1": ["float64", 0], "nu2": ["float32", 8]}, "align": false}}, "unit": "(eV, eV)"}
load: <Quantity [(0., 0.6)] (eV, eV)>

Checklist for package maintainer(s)

This checklist is meant to remind the package maintainer(s) who will review this pull request of some common things to look for. This list is not exhaustive.

Do the proposed changes actually accomplish desired goals?
Do the proposed changes follow the Astropy coding guidelines?
Are tests added/updated as required? If so, do they follow the Astropy testing guidelines?
Are docs added/updated as required? If so, do they follow the Astropy documentation guidelines?
Is rebase and/or squash necessary? If so, please provide the author with appropriate instructions. Also see "When to rebase and squash commits".
Did the CI pass? If no, are the failures related? If you need to run daily and weekly cron jobs as part of the PR, please apply the Extra CI label.
Is a change log needed? If yes, did the change log check pass? If no, add the no-changelog-entry-needed label. If this is a manual backport, use the skip-changelog-checks label unless special changelog handling is necessary.
Is a milestone set? Milestone must be set but astropy-bot check might be missing; do not let the green checkmark fool you.
At the time of adding the milestone, if the milestone set requires a backport to release branch(es), apply the appropriate backport-X.Y.x label(s) before merge.

github-actions · 2022-01-26T21:00:11Z

👋 Thank you for your draft pull request! Do you know that you can use [ci skip] or [skip ci] in your commit messages to skip running continuous integration tests until you are ready?

nstarman · 2022-02-11T15:48:10Z

@mhvk, I put a number of examples in the opening comment. I think the structured ndarray is still a little too verbose, but other than that, it's looking pretty good. Do you have any comments / suggestions?
In particular, I'm looking to improve the construction of the test suite.

pep8speaks · 2022-02-11T16:17:47Z

Hello @nstarman 👋! It looks like you've made some changes in your pull request, so I've checked the code again for style.

In the file astropy/io/misc/json/numpy.py:

Line 335:15: E999 SyntaxError: invalid syntax

In the file astropy/io/utils.py:

Line 24:13: E999 SyntaxError: invalid syntax