Cleanup `response.json()` method. by lovelydinosaur · Pull Request #2911 · encode/httpx

lovelydinosaur · 2023-10-30T14:34:20Z

The Response.json() method appears unnecessarily complicated, I think this is a historical appendix that can be surgically tidied up. The stdlib implementation of json.loads already includes the charset detection that we're currently providing with our internal guess_json_utf() function.

Switch test_utils to use a public API for testing JSON utf-8 encodings.
Remove the internal guess_json_utf() function.
Simpliy the Response.json() method.

T-256

Agree with this change.
No more need _guess_content_json_utf at #2851.

T-256 · 2023-10-30T15:51:37Z

#2851 now included this change, can you review that? I think we can merge #2851 instead of this PR.

lovelydinosaur · 2023-10-30T19:06:22Z

can you review that?

👍

whitehatboxer · 2024-06-06T03:03:31Z

Sorry to bother all, but in my situation, changes in response.json() from json.loads(self.text) to json.loads(self.content) have caused UnicodeDecodeError: 'utf-8' codec can't decode byte 0xaa in position 351: invalid start byte.

It cames from that I found some test failed in my new system. I diffed it with my old one, and found there was a difference between httpx from 0.27.0 to 0.22.0. The new system had the new version.

So I had downgraded httpx from 0.27.0 to 0.22.0, and really resloved this bug.

After that, I tried to found the reason. I delved into code, and thought it was guess_json_utf, so I searched this keyword and found this pr.
I tried to add some debug cod in _models.py, it turned out that no deal with guess_json_utf.

    def json(self, **kwargs: typing.Any) -> typing.Any:
        if self.charset_encoding is None and self.content and len(self.content) > 3:
            encoding = guess_json_utf(self.content)
            print("==== encoding was: ", encoding)
            if encoding is not None:
                return jsonlib.loads(self.content.decode(encoding), **kwargs)
        print("==== no encoding")
        return jsonlib.loads(self.text, **kwargs)

Then I guessed that using self.text is necessary in some uncommon situation.

lovelydinosaur · 2024-06-06T09:33:11Z

Then I guessed that using self.text is necessary in some uncommon situation.

The server is likely returning non-compliant JSON.

https://datatracker.ietf.org/doc/html/rfc8259#section-8.1

JSON text exchanged between systems that are not part of a closed
ecosystem MUST be encoded using UTF-8 [RFC3629].

Previous specifications of JSON have not required the use of UTF-8
when transmitting JSON text. However, the vast majority of JSON-
based software implementations have chosen to use the UTF-8 encoding,
to the extent that it is the only encoding that achieves
interoperability.

Implementations MUST NOT add a byte order mark (U+FEFF) to the
beginning of a networked-transmitted JSON text. In the interests of
interoperability, implementations that parse JSON texts MAY ignore
the presence of a byte order mark rather than treating it as an
error.

Take a look at what Content-Type is being returned.

Also save response.content and determine from the prior version of guess_json_utf what encoding is being used there.

If you're able to resolve the issue on the server-side, then do that because it appears to be a server-side issue. Otherwise use json.loads(response.text) or json.loads(response.content.decode(whatever)).

alexmojaki · 2024-12-18T15:56:52Z

-            encoding = guess_json_utf(self.content)
-            if encoding is not None:
-                return jsonlib.loads(self.content.decode(encoding), **kwargs)
-        return jsonlib.loads(self.text, **kwargs)


Just a random thing I noticed looking through the code, not a problem I've seen in the wild: isn't using self.text like this this better than the new version when there's a non-utf charset?

JSON is spec'ed as always UTF encoded.

Co-authored-by: Kar Petrosyan <92274156+karpetrosyan@users.noreply.github.com>

Cleanup response.json() method

2e7596e

lovelydinosaur added the refactor Issues and PRs related to code refactoring label Oct 30, 2023

lovelydinosaur marked this pull request as ready for review October 30, 2023 14:36

T-256 approved these changes Oct 30, 2023

View reviewed changes

lovelydinosaur and others added 2 commits October 30, 2023 19:15

Merge branch 'master' into cleanup-json-decoding

5aa3010

Merge branch 'master' into cleanup-json-decoding

51c579a

karpetrosyan approved these changes Oct 31, 2023

View reviewed changes

lovelydinosaur merged commit 1d73150 into master Oct 31, 2023

lovelydinosaur deleted the cleanup-json-decoding branch October 31, 2023 10:10

alexmojaki reviewed Dec 18, 2024

View reviewed changes

samclearman pushed a commit to titanmsp/httpx that referenced this pull request Apr 26, 2025

Cleanup response.json() method (encode#2911)

4d78185

Co-authored-by: Kar Petrosyan <92274156+karpetrosyan@users.noreply.github.com>

coderabbitai Bot mentioned this pull request Apr 11, 2026

feat: implement external service integration APIs Aureliolo/synthorg#1240

Merged

This was referenced May 3, 2026

feat(scripts): forgejo-mirror-github.py — bulk-mirror GH repos into Forgejo AlverezYari/phillips-homelab#229

Merged

[Feat] 아바타 모드를 위한 FastAPI 코드 수정 CapstoneDgu/NUNCHI-AI#22

Merged

coderabbitai Bot mentioned this pull request May 13, 2026

feat(cli): implement all six OrcaMind CLI commands with full test suite AruneemB/orca#12

Merged

11 tasks

coderabbitai Bot mentioned this pull request May 25, 2026

feat: add bocha web search component langflow-ai/langflow#13322

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cleanup `response.json()` method.#2911

Cleanup `response.json()` method.#2911
lovelydinosaur merged 3 commits into
masterfrom
cleanup-json-decoding

lovelydinosaur commented Oct 30, 2023

Uh oh!

T-256 left a comment

Uh oh!

T-256 commented Oct 30, 2023

Uh oh!

lovelydinosaur commented Oct 30, 2023

Uh oh!

whitehatboxer commented Jun 6, 2024

Uh oh!

lovelydinosaur commented Jun 6, 2024

Uh oh!

alexmojaki Dec 18, 2024

Uh oh!

lovelydinosaur Dec 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

lovelydinosaur commented Oct 30, 2023

Uh oh!

T-256 left a comment

Choose a reason for hiding this comment

Uh oh!

T-256 commented Oct 30, 2023

Uh oh!

lovelydinosaur commented Oct 30, 2023

Uh oh!

whitehatboxer commented Jun 6, 2024

Uh oh!

lovelydinosaur commented Jun 6, 2024

Uh oh!

alexmojaki Dec 18, 2024

Choose a reason for hiding this comment

Uh oh!

lovelydinosaur Dec 19, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants