Split a JSON byte stream into JSON objects/arrays. Fixes #2536 #2547

buchgr · 2014-06-07T20:17:46Z

Motivation:

See GitHub Issue #2536.

Modifications:

Introduce the class JsonObjectDecoder to split a JSON byte stream
into individual JSON objets/arrays.

Result:

A Netty application can now handle a byte stream where multiple JSON
documents follow eachother as opposed to only a single JSON document
per request.

@trustin @daschl @normanmaurer please review :-)

ghost · 2014-06-07T20:30:24Z

Build result for #2547 at dcdc69fc28ff416717c0742f13a88c399116fcb7: Failure

ghost · 2014-06-07T21:09:55Z

Build result for #2547 at 07ad08da328a825451e8ba3946bc27d3e4e01d83: Success

normanmaurer · 2014-06-08T09:19:06Z

codec/src/main/java/io/netty/handler/codec/json/JsonObjectDecoder.java

+
+/**
+ * Splits a byte stream of JSON objects and arrays into individual objects/arrays and passes them up the
+ * channel pipeline.


use {@link ChannelPipeline}

ghost · 2014-06-08T10:07:40Z

Build result for #2547 at 99a6c15049b714610622427a11b984e82b32daea: Success

ghost · 2014-06-08T11:30:52Z

Build result for #2547 at 8d2755bea51a05a6ea4a4b24676ff5308587156c: Success

ghost · 2014-06-08T22:12:41Z

Build result for #2547 at 6d4dc4ed89eb988a7eae1d67c718df2d9ca36005: Success

normanmaurer · 2014-06-11T04:30:40Z

@trustin wdyt ?

daschl · 2014-06-12T11:14:28Z

@jakobbuchgraber I'm doing something a litte different in my project, maybe you like it:

basically using a processor to find some json markers:

    private static class MarkerProcessor implements ByteBufProcessor {

        private int marker = 0;
        private int counter = 0;
        private int depth = 0;
        private byte open = '{';
        private byte close = '}';
        private byte stringMarker = '"';
        private boolean inString = false;

        @Override
        public boolean process(byte value) throws Exception {
            counter++;
            if (value == stringMarker) {
                inString = !inString;
            }
            if (!inString && value == open) {
                depth++;
            }
            if (!inString && value == close) {
                depth--;
                if (depth == 0) {
                    marker = counter;
                }
            }
            return true;
        }

        public int marker() {
            return marker;
        }
    }

I'm doing slightly different stuff but you can use the processor for this very efficiently.
/cc @normanmaurer

buchgr · 2014-06-12T11:32:53Z

@daschl thanks for sharing. I started with a processor too (as I learned from @normanmaurer's FB talk :P ), but I also have to be able to look back one byte to check for escape characters (e.g. "foo " bar") and so I kinda felt the loop was nicer for my purposes.

Maybe I should benchmark both versions, but I didn't bother with these kind of micro optimizations yet 😄

trustin · 2014-06-24T06:10:27Z

codec/src/test/java/io/netty/handler/codec/json/JsonObjectDecoderTest.java

+        ch.writeInbound(Unpooled.copiedBuffer("blabla 123", CharsetUtil.UTF_8));
+        assertNull(ch.readInbound());
+
+        assertFalse(ch.finish());


You could also check if an exception is raised here.

trustin · 2014-06-24T06:14:19Z

Could you also add a test case where a string contains '{' and other potentially confusing tokens?

trustin · 2014-06-24T06:17:07Z

I also want to see an option that enables a streaming of a JSON array elements, whose size is potentially infinite. For example, when a client sends this:

[
"a",
{ key: "value" },
"c",
...
]

The decoder could generate a frame for each array element (i.e. ""a"", "{ key: "value" }", "c"", ...)

ghost · 2014-06-25T19:38:23Z

Build result for #2547 at 8250ec5554dbfbd4acd56c98cd94811fa14d449b: Failure

buchgr · 2014-06-29T21:00:39Z

I addressed the comments and I think it's ready for final review/merge.

trustin · 2014-07-02T06:14:19Z

codec/src/main/java/io/netty/handler/codec/json/JsonObjectDecoder.java

+    private final boolean streamArrayElements;
+
+    public JsonObjectDecoder() {
+        this(Integer.MAX_VALUE);


The default maxObjectLength should be a much smaller value? 1048576?

trustin · 2014-07-02T06:20:54Z

Looks pretty good all in all. Left some comments for tiny things. To summarize:

Raise a CorruptedFrameException when the received data is not JSON

buchgr · 2014-07-02T12:45:40Z

done

Motivation: See GitHub Issue netty#2536. Modifications: Introduce the class JsonObjectDecoder to split a JSON byte stream into individual JSON objets/arrays. Result: A Netty application can now handle a byte stream where multiple JSON documents follow eachother as opposed to only a single JSON document per request.

trustin · 2014-07-03T09:35:04Z

Nice work. Merged into 4.1 and master. 👍

alex-vas · 2018-12-12T20:03:57Z

It might be worth to add a comment that this implementation is only compatible with UTF-8 or ASCII encoded streams. Might event be worth it to add Charset as a constructor parameter and assert that it is UTF-8 straight away. This would make the API ready for possible future implementation of other encodings support. Same goes to LineBasedFrameDecoder and XmlFrameDecoder - both missing a comment explaining that they are expecting the stream in UTF-8.

normanmaurer · 2018-12-12T20:07:39Z

Are you interested in providing a PR?

…

Am 12.12.2018 um 21:04 schrieb Alex Vasiliev ***@***.***>: It might be worth to add a comment that this implementation is only compatible with UTF-8 or ASCII encoded streams. Might event be worth it to add Charset as a constructor parameter and assert that it is UTF-8 straight away. This would make the API ready for possible future implementation of other encodings support. Same goes to LineBasedFrameDecoder and XmlFrameDecoder - both missing a comment explaining that they are expecting the stream in UTF-8. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

alex-vas · 2018-12-12T20:09:16Z

Let me give it a try. :)

buchgr changed the title ~~Split a JSON byte stream into JSON objects/arrays~~ Split a JSON byte stream into JSON objects/arrays. Fixes #2536 Jun 7, 2014

normanmaurer reviewed Jun 8, 2014
View reviewed changes

trustin reviewed Jun 24, 2014
View reviewed changes

trustin added this to the 4.1.0.Final milestone Jun 24, 2014

trustin added the feature label Jun 24, 2014

trustin self-assigned this Jun 24, 2014

trustin reviewed Jul 2, 2014
View reviewed changes

trustin closed this Jul 3, 2014

trustin modified the milestones: 4.1.0.Beta1, 4.1.0.Final Jul 3, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split a JSON byte stream into JSON objects/arrays. Fixes #2536 #2547

Split a JSON byte stream into JSON objects/arrays. Fixes #2536 #2547

buchgr commented Jun 7, 2014

ghost commented Jun 7, 2014

ghost commented Jun 7, 2014

normanmaurer Jun 8, 2014

ghost commented Jun 8, 2014

ghost commented Jun 8, 2014

ghost commented Jun 8, 2014

normanmaurer commented Jun 11, 2014

daschl commented Jun 12, 2014

buchgr commented Jun 12, 2014

trustin Jun 24, 2014

trustin commented Jun 24, 2014

trustin commented Jun 24, 2014

ghost commented Jun 25, 2014

buchgr commented Jun 29, 2014

trustin Jul 2, 2014

trustin commented Jul 2, 2014

buchgr commented Jul 2, 2014

trustin commented Jul 3, 2014

alex-vas commented Dec 12, 2018

normanmaurer commented Dec 12, 2018 via email

alex-vas commented Dec 12, 2018

Split a JSON byte stream into JSON objects/arrays. Fixes #2536 #2547

Split a JSON byte stream into JSON objects/arrays. Fixes #2536 #2547

Conversation

buchgr commented Jun 7, 2014

ghost commented Jun 7, 2014

ghost commented Jun 7, 2014

normanmaurer Jun 8, 2014

Choose a reason for hiding this comment

ghost commented Jun 8, 2014

ghost commented Jun 8, 2014

ghost commented Jun 8, 2014

normanmaurer commented Jun 11, 2014

daschl commented Jun 12, 2014

buchgr commented Jun 12, 2014

trustin Jun 24, 2014

Choose a reason for hiding this comment

trustin commented Jun 24, 2014

trustin commented Jun 24, 2014

ghost commented Jun 25, 2014

buchgr commented Jun 29, 2014

trustin Jul 2, 2014

Choose a reason for hiding this comment

trustin commented Jul 2, 2014

buchgr commented Jul 2, 2014

trustin commented Jul 3, 2014

alex-vas commented Dec 12, 2018

normanmaurer commented Dec 12, 2018 via email

alex-vas commented Dec 12, 2018