ENH: json_normalize should work with JSON #61006

jessekv · 2025-02-25T16:09:43Z

Feature Type

Adding new functionality to pandas
Changing existing functionality in pandas
Removing existing functionality in pandas

Problem Description

I wish pd.json_normalize accepted JSON (as str or bytes), and not just dict.

Or, as a joke, there could be a pd.dict_normalize that only accepts JSON ;)

Feature Description

Given a Series with JSON as str or bytes:

>>> df["data"]
0                  {"value":0.0}
1          {"value":0.005787037}
2         {"value":0.0115740741}
3         {"value":0.0173611111}

It should be possible to parse the JSON with pd.json_normalize, e.g.

>>> pd.json_normalize(df["data"])
            value
0        0.000000
1        0.005787
2        0.011574
3        0.017361

Pandas already has good JSON integration, so don't see why it can't be done.

Alternative Solutions

From what I understand, right now it must be first parsed with some other library, e.g. with apply, before using pd.json_normalize.

>>> import json
>>> pd.json_normalize(df["data"].apply(json.loads))
            value
0        0.000000
1        0.005787
2        0.011574
3        0.017361

Additional Context

With better JSON/JSONB support in databases like postgres and sqlite, encountering this sort of data is becoming more common, and the intermediate apply step is a performance and usability issue:

>>> import json
>>> df = pd.read_sql(sql=query, con=conn)
>>> pd.json_normalize(df["data"].apply(json.loads))
            value
0        0.000000
1        0.005787
2        0.011574
3        0.017361

The text was updated successfully, but these errors were encountered:

arthurlw · 2025-02-28T00:43:55Z

take

Abhibhav2003 · 2025-03-01T14:35:17Z

Can you please clarify @vdwees

Is this what you are expecting ?

df = pd.json_normalize([
    '{"value": 0.0}',
    '{"value": 0.005787037}',
    '{"value": 0.0115740741}',
    '{"value": 0.0173611111}'
])

print(df)

Output :

value
0.0
0.005787037
0.0115740741
0.0173611111

mroeschke · 2025-03-05T19:00:08Z

As explained in #61056 (review), I would be -1 on the feature request.

Thanks for the suggestion, but going to close this issue

jessekv added Enhancement Needs Triage labels Feb 25, 2025

github-actions bot assigned arthurlw Feb 28, 2025

arthurlw mentioned this issue Mar 4, 2025

ENH: json_normalize accepts JSON with str and bytes input #61056

Closed

4 tasks

mroeschke closed this as completed Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: json_normalize should work with JSON #61006

ENH: json_normalize should work with JSON #61006

jessekv commented Feb 25, 2025 •

edited

Loading

arthurlw commented Feb 28, 2025

Abhibhav2003 commented Mar 1, 2025 •

edited

Loading

mroeschke commented Mar 5, 2025

ENH: json_normalize should work with JSON #61006

ENH: json_normalize should work with JSON #61006

Comments

jessekv commented Feb 25, 2025 • edited Loading

Feature Type

Problem Description

Feature Description

Alternative Solutions

Additional Context

arthurlw commented Feb 28, 2025

Abhibhav2003 commented Mar 1, 2025 • edited Loading

mroeschke commented Mar 5, 2025

jessekv commented Feb 25, 2025 •

edited

Loading

Abhibhav2003 commented Mar 1, 2025 •

edited

Loading