Add 'readText()' method in JsonParser #15

cowtowncoder · 2012-05-27T04:22:00Z

Current JsonParser.getText() requires reading of the whole JSON String value as String.
While convenient, this may not be optimal when processing large payloads.

As an alternative method, there should be something like:

boolean readText(Writer w);

which would read JSON String value, and pass it using given Writer; but possibly in separate chunks, without aggregating it. This allows caller to do incremental processing and avoid potentially big temporary memory usage.

In addition, for non-blocking parser implementations, this method could do partial decoding, meaning that it would only parse part of textual value; return value indicating whether full contents (true) or partial content (false) was processed.

Review changes

issue #15 - readtext in jsonparser

MikePieperSer · 2021-04-29T06:52:00Z

I need to parse JSON with huge text fields (up to 500MB). Using the readText(Writer) methods still needs a lot of memory, because it reads the whole text field into memory.

Is there any plan to make this more efficient?

From code reading I would assume that giving the writer down to _finishString() could help here. Then the string finisher could use only one (some?) segment by writing it to the writer if it's full and reusing it.

cowtowncoder · 2021-04-29T17:16:16Z

No one is working on this currently as far as I know; I do not have time to work on this now and probably not for a while (unless I'd need it myself for some reason). But anyone who wants to work on it would be more than welcome to do so!

And yes, lazy initial handling (only decoding opening quote) is intended to allow more efficient read+write operation like you suggest. There are multiple backends (byte-based UTF8, character/Reader-based, async) to consider, but implementation could be relatively simple if it just addresses 2 common ones (Reader/byte-based; maybe DataInput one -- async could not be supported anyway I suspect.

Put another way: the reason this one has not been tackled is not necessarily due to inherent complexity of implementing support when API already exists.

cowtowncoder · 2024-06-09T03:41:01Z

Looks like I re-filed this as #1288; could close that but in this case I'll do the opposite, close this, older issue.

LokeshN added a commit to LokeshN/jackson-core that referenced this issue May 15, 2016

issue FasterXML#15 - readtext in jsonparser

b42a4da

LokeshN added a commit to LokeshN/jackson-core that referenced this issue May 16, 2016

issue FasterXML#15 - readtext in jsonparser

a8855fa

Review changes

LokeshN added a commit to LokeshN/jackson-core that referenced this issue May 16, 2016

issue FasterXML#15 - readtext in jsonparser

373588c

Review changes

LokeshN added a commit to LokeshN/jackson-core that referenced this issue May 18, 2016

issue FasterXML#15 - readtext in jsonparser

ef7e8a2

Review changes

LokeshN added a commit to LokeshN/jackson-core that referenced this issue May 18, 2016

issue FasterXML#15 - readtext in jsonparser

ca17d2f

cowtowncoder added a commit that referenced this issue May 18, 2016

Merge pull request #285 from LokeshN/readtext-jsonparser1

7a0991a

issue #15 - readtext in jsonparser

cowtowncoder closed this as completed Jun 9, 2024

cowtowncoder mentioned this issue Jun 9, 2024

Add new method like JsonParser.readText(Writer) (and implementation) for truly non-buffering reads #1288

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add 'readText()' method in JsonParser #15

Add 'readText()' method in JsonParser #15

cowtowncoder commented May 27, 2012

MikePieperSer commented Apr 29, 2021

cowtowncoder commented Apr 29, 2021 •

edited

Loading

cowtowncoder commented Jun 9, 2024

Add 'readText()' method in JsonParser #15

Add 'readText()' method in JsonParser #15

Comments

cowtowncoder commented May 27, 2012

MikePieperSer commented Apr 29, 2021

cowtowncoder commented Apr 29, 2021 • edited Loading

cowtowncoder commented Jun 9, 2024

cowtowncoder commented Apr 29, 2021 •

edited

Loading