AppendableCharSequence#append(char) uses IOOBE for expansion #4807

rkapsi · 2016-02-01T23:55:09Z

Hello,

I did some profiling in our production environment and noticed that AppendableCharSequence#append(char) catches IndexOutOfBoundsException to expand its internal char[] (in our case called via HTTP header parser, see linked screenshot).

I don't know if it's on purpose (there used to be a pattern around that) but
throwing+catching exceptions is in my experience more expensive than something as simple as:

if (pos >= chars.length) {
  expand();
}
chars[pos++] = c;

Additionally it's screwing and possibly distracting the profiler/developer from real exceptions.

Netty 4.1.0-CR2-SNAPSHOT

http://pasteboard.co/1eVXi7nC.png

The text was updated successfully, but these errors were encountered:

Scottmitch · 2016-02-02T00:09:25Z

This change was introduced by #4039. Please see the profiling results included in this PR, and also let me know if you are not able to reproduce these results, or if you think the benchmark itself is flawed.

AppendableCharSequence is a hot spot during HTTP profiling, and a few different things were tried (including rewriting portions of HttpObjectDecoder to avoid copying, and also using a AppendableByteSequence) but at the time non provided any substantial gains.

Scottmitch · 2016-02-02T00:13:38Z

IIRC I did run this change in isolation to see if it had any impact, but looking back at this PR there were a few other changes introduced too. Let me re-run profiles with this change isolated and see what the impacts are.

Scottmitch · 2016-02-02T00:23:25Z

Now I remember ... AppendableCharSequenceBenchmark does isolate the two different approaches. As requested in #4807 (comment) let me know if you think there is something wrong with this benchmark.

rkapsi · 2016-02-03T00:15:45Z

Hey Scott,

I'm having trouble getting that Benchmark running (I seem to have a newer version of JMH and it doesn't like the absence of a @State annotation and if I do add it then it's running in an infinite loop).

But the first thing that comes to my mind is the potential impact of the stack depth. JMH's call stack is probably shallow. Something along the lines of this (below) may change things:

@Param({ "0", "1", "2", "4", "8", "16" })
private int stackDepth;

@Benchmark
public void appendCatchExceptionAfter() {
    if (stackDepth > 0) {
        --stackDepth;
        appendCatchExceptionAfter();
        return;
    }

    // ...
}

The IOOBEs are in our particular case most likely caused by Cookie headers. HttpObjectDecoder's initial size for the AppendableCharSequence is 128 and our Cookies are in the realm of 400-500 chars. That yields to almost guaranteed 2x Exceptions per (first) request (per connection). That combined with a rush in traffic and all the objects that need to be allocated in Throwable#fillStackTrace() can't be good.

Scottmitch · 2016-02-03T01:01:28Z

@rkapsi - You can run the benchmark with maven:
mvn clean test -DskipTests=false -Dtest=AppendableCharSequenceBenchmark. Let me do some analysis on stack depth. It is also possible if you only use the connection for a single request that the exception being thrown will be more costly than the duplicate index checking...

rkapsi · 2016-02-05T18:08:44Z

@Scottmitch - I've run a few scenarios with that benchmark. The occasional exception being thrown vs. constant double checking is very difficult for the latter to make up. I can fabricate scenarios where the two start showing similar Op/s but it does feel very fabricated. It's hard to quantify the memory cost from these StackTraceElements that need to be created and thrown away.

I think the key takeaway for me is that it'd be great if AppendableCharSequence's initial size was configurable. It's currently hard coded to 128 (HttpObjectDecoder, line 184). From our use-case I can already tell upfront that setting it to 256 would cut the number of exceptions in half and with basically no additional memory cost as most requests will have on header that is in the 400-500 character range.

Scottmitch · 2016-02-05T22:25:25Z

it'd be great if AppendableCharSequence's initial size was configurable

@rkapsi +1. Is it fair to treat this issue as a feature request to enhance HttpObjectDecoder? I'll put together a PR.

rkapsi · 2016-02-05T22:33:59Z

@Scottmitch sounds good.

Motivation: The initial buffer size used to decode HTTP objects is currently fixed at 128. This may be too small for some use cases and create a high amount of overhead associated with resizing/copying. The user should be able to configure the initial size as they please. Modifications: - Make HttpObjectDecoder's AppendableCharSequence initial size configurable Result: Users can more finely tune initial buffer size for increased performance or to save memory. Fixes netty#4807

Motivation: The initial buffer size used to decode HTTP objects is currently fixed at 128. This may be too small for some use cases and create a high amount of overhead associated with resizing/copying. The user should be able to configure the initial size as they please. Modifications: - Make HttpObjectDecoder's AppendableCharSequence initial size configurable Result: Users can more finely tune initial buffer size for increased performance or to save memory. Fixes #4807

Motivation: The initial buffer size used to decode HTTP objects is currently fixed at 128. This may be too small for some use cases and create a high amount of overhead associated with resizing/copying. The user should be able to configure the initial size as they please. Modifications: - Make HttpObjectDecoder's AppendableCharSequence initial size configurable Result: Users can more finely tune initial buffer size for increased performance or to save memory. Fixes netty#4807

Scottmitch self-assigned this Feb 2, 2016

Scottmitch added the feature label Feb 5, 2016

Scottmitch modified the milestones: 4.1.0.CR3, 4.0.35.Final Feb 5, 2016

Scottmitch mentioned this issue Feb 6, 2016

HttpObjectDecoder configurable initial buffer size #4850

Closed

Scottmitch closed this as completed in a15ff32 Feb 8, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AppendableCharSequence#append(char) uses IOOBE for expansion #4807

AppendableCharSequence#append(char) uses IOOBE for expansion #4807

rkapsi commented Feb 1, 2016

Scottmitch commented Feb 2, 2016

Scottmitch commented Feb 2, 2016

Scottmitch commented Feb 2, 2016

rkapsi commented Feb 3, 2016

Scottmitch commented Feb 3, 2016

rkapsi commented Feb 5, 2016

Scottmitch commented Feb 5, 2016

rkapsi commented Feb 5, 2016

AppendableCharSequence#append(char) uses IOOBE for expansion #4807

AppendableCharSequence#append(char) uses IOOBE for expansion #4807

Comments

rkapsi commented Feb 1, 2016

Scottmitch commented Feb 2, 2016

Scottmitch commented Feb 2, 2016

Scottmitch commented Feb 2, 2016

rkapsi commented Feb 3, 2016

Scottmitch commented Feb 3, 2016

rkapsi commented Feb 5, 2016

Scottmitch commented Feb 5, 2016

rkapsi commented Feb 5, 2016