Stop Allocating Buffers in CopyBytesSocketChannel #49825

original-brownbear · 2019-12-04T10:59:49Z

Marked as draft because this is rather an illustration of the issue than the fix I'd like to see here.
The problem with this fix is that it effectively moves us to a 64k read buffer instead of a 1M read buffer (with default settings). This may be a price we can pay for the positive memory effects of reading in smaller chunks without allocations on the IO loop, but it's not great.

The way things currently work, we read up to 1M from the channel
and then potentially force all of it into the ByteBuf passed
by Netty. Since that ByteBuf tends to by default be 64k in size,
large reads will force the buffer to grow, completely circumventing
the logic of allocHandle causing two problems:

(this one could simply be fixed by setting the number of bytes in ioBuffer as the attempted read size if it's larger than the capacity of the ByteBuf ... ) This seems like it could break
io.netty.channel.RecvByteBufAllocator.Handle#continueReading
since that method for the fixed-size allocator does check
whether the last read was equal to the attempted read size.
So if we set 64k because that's what the buffer size is,
then wirte 1M to the buffer we will stop reading on the IO loop,
even though the channel may still have bytes that we can read right away.
More imporatantly though, this can lead to running OOM quite easily
under IO pressure as we are forcing the heap buffers passed to the read
to reallocate. Which with the current default chunk size of 16M means potentially allocating
16M of buffers without the circuit breaker knowing about it simlply to read ahead from the channel (when reading messages one-by-one would have cost zero additions to the pool).

Relates #49699

The way things currently work, we read up to 1M from the channel and then potentially force all of it into the `ByteBuf` passed by Netty. Since that `ByteBuf` tends to by default be `64k` in size, large reads will force the buffer to grow, completely circumventing the logic of `allocHandle`. This seems like it could break `io.netty.channel.RecvByteBufAllocator.Handle#continueReading` since that method for the fixed-size allocator does check whether the last read was equal to the attempted read size. So if we set `64k` because that's what the buffer size is, then wirte `1M` to the buffer we will stop reading on the IO loop, even though the channel may still have bytes that we can read right away. More imporatantly though, this can lead to running OOM quite easily under IO pressure as we are forcing the heap buffers passed to the read to `reallocate`. Closes elastic#49699

elasticmachine · 2019-12-04T10:59:52Z

Pinging @elastic/es-distributed (:Distributed/Network)

Tim-Brooks

LGTM

original-brownbear · 2019-12-04T14:54:33Z

Thanks Tim!

The way things currently work, we read up to 1M from the channel and then potentially force all of it into the `ByteBuf` passed by Netty. Since that `ByteBuf` tends to by default be `64k` in size, large reads will force the buffer to grow, completely circumventing the logic of `allocHandle`. This seems like it could break `io.netty.channel.RecvByteBufAllocator.Handle#continueReading` since that method for the fixed-size allocator does check whether the last read was equal to the attempted read size. So if we set `64k` because that's what the buffer size is, then wirte `1M` to the buffer we will stop reading on the IO loop, even though the channel may still have bytes that we can read right away. More imporatantly though, this can lead to running OOM quite easily under IO pressure as we are forcing the heap buffers passed to the read to `reallocate`. Closes elastic#49699

* Stop Allocating Buffers in CopyBytesSocketChannel (#49825) The way things currently work, we read up to 1M from the channel and then potentially force all of it into the `ByteBuf` passed by Netty. Since that `ByteBuf` tends to by default be `64k` in size, large reads will force the buffer to grow, completely circumventing the logic of `allocHandle`. This seems like it could break `io.netty.channel.RecvByteBufAllocator.Handle#continueReading` since that method for the fixed-size allocator does check whether the last read was equal to the attempted read size. So if we set `64k` because that's what the buffer size is, then wirte `1M` to the buffer we will stop reading on the IO loop, even though the channel may still have bytes that we can read right away. More imporatantly though, this can lead to running OOM quite easily under IO pressure as we are forcing the heap buffers passed to the read to `reallocate`. Closes #49699

The way things currently work, we read up to 1M from the channel and then potentially force all of it into the `ByteBuf` passed by Netty. Since that `ByteBuf` tends to by default be `64k` in size, large reads will force the buffer to grow, completely circumventing the logic of `allocHandle`. This seems like it could break `io.netty.channel.RecvByteBufAllocator.Handle#continueReading` since that method for the fixed-size allocator does check whether the last read was equal to the attempted read size. So if we set `64k` because that's what the buffer size is, then wirte `1M` to the buffer we will stop reading on the IO loop, even though the channel may still have bytes that we can read right away. More imporatantly though, this can lead to running OOM quite easily under IO pressure as we are forcing the heap buffers passed to the read to `reallocate`. Closes elastic#49699

original-brownbear added :Distributed/Network Http and internode communication implementations team-discuss labels Dec 4, 2019

Tim-Brooks approved these changes Dec 4, 2019

View reviewed changes

original-brownbear added >non-issue v7.6.0 v8.0.0 and removed team-discuss labels Dec 4, 2019

original-brownbear merged commit 7a363f4 into elastic:master Dec 4, 2019

original-brownbear deleted the netty-smarter-buffering branch December 4, 2019 14:55

original-brownbear mentioned this pull request Dec 4, 2019

Stop Allocating Buffers in CopyBytesSocketChannel (#49825) #49832

Merged

original-brownbear restored the netty-smarter-buffering branch August 6, 2020 18:36

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop Allocating Buffers in CopyBytesSocketChannel #49825

Stop Allocating Buffers in CopyBytesSocketChannel #49825

original-brownbear commented Dec 4, 2019

elasticmachine commented Dec 4, 2019

Tim-Brooks left a comment

original-brownbear commented Dec 4, 2019

Stop Allocating Buffers in CopyBytesSocketChannel #49825

Stop Allocating Buffers in CopyBytesSocketChannel #49825

Conversation

original-brownbear commented Dec 4, 2019

elasticmachine commented Dec 4, 2019

Tim-Brooks left a comment

Choose a reason for hiding this comment

original-brownbear commented Dec 4, 2019