[Netty 5] Default PooledByteBufAllocator configuration #8536

normanmaurer · 2018-11-13T14:08:19Z

Default memory per chunk that the arena allocates is 16MB. We allocate 2 times number of arenas for direct and for heap buffers. This can be a large amount of memory for applications that may not need it. We should consider reducing the default values to reduce the initial memory footprint, and applications can tune the allocator if they need more chunks.

vkostyukov · 2018-11-14T23:00:48Z

👍

We override these in Finagle for the very same reason.

Motivation: We currently use a thread local cache for all threads which often is suprising to users as it may result in a lot of memory usage if they allocate buffers from outside the EventLoop in different threads. We should better not do this by default to keep suprises to a minimum. Users that need the performance and know what they are doing can still change this. Modifications: Change io.netty.allocator.useCacheForAllThreads to false by default Result: Related to #8536.

…8991) Motivation: We currently use a thread local cache for all threads which often is suprising to users as it may result in a lot of memory usage if they allocate buffers from outside the EventLoop in different threads. We should better not do this by default to keep suprises to a minimum. Users that need the performance and know what they are doing can still change this. Modifications: Change io.netty.allocator.useCacheForAllThreads to false by default Result: Related to #8536.

normanmaurer · 2021-09-29T10:29:53Z

@chrisvest want to have a look ?

chrisvest · 2021-09-29T10:31:37Z

@normanmaurer Yeah, I'll make a note to look at this soon.

… MiB Motivation: By default we allocate 2 arenas per core, and each arena that is put to use will allocate a chunk. If we don't need a lot of memory, and certainly not compared to the number of cores on a system, then this will take up more memory than necessary, since each chunk is 16 MiB. By reducing the chunk size to 4 MiB, we reduce the minimum memory usage by a good deal, in these cases where not much is needed. The drawback is that we risk allocating more huge buffers, but this is a fair trade-off since Netty's use cases mostly involve very small buffers. Modification: Reduce the default max order from 11 to 9. Also make similar configuration changes in PooledByteBufAllocatorTest, to reduce the memory usage during testing. Result: Netty now uses less memory when less memory is needed by the application. This fixes netty#8536

… MiB (#12108) Motivation: By default we allocate 2 arenas per core, and each arena that is put to use will allocate a chunk. If we don't need a lot of memory, and certainly not compared to the number of cores on a system, then this will take up more memory than necessary, since each chunk is 16 MiB. By reducing the chunk size to 4 MiB, we reduce the minimum memory usage by a good deal, in these cases where not much is needed. The drawback is that we risk allocating more huge buffers, but this is a fair trade-off since Netty's use cases mostly involve very small buffers. Modification: Reduce the default max order from 11 to 9. Also make similar configuration changes in PooledByteBufAllocatorTest, to reduce the memory usage during testing. Result: Netty now uses less memory when less memory is needed by the application. This fixes #8536

…etty#8991) Motivation: We currently use a thread local cache for all threads which often is suprising to users as it may result in a lot of memory usage if they allocate buffers from outside the EventLoop in different threads. We should better not do this by default to keep suprises to a minimum. Users that need the performance and know what they are doing can still change this. Modifications: Change io.netty.allocator.useCacheForAllThreads to false by default Result: Related to netty#8536.

…8991) (#12109) Motivation: We currently use a thread local cache for all threads which often is suprising to users as it may result in a lot of memory usage if they allocate buffers from outside the EventLoop in different threads. We should better not do this by default to keep suprises to a minimum. Users that need the performance and know what they are doing can still change this. Modifications: Change io.netty.allocator.useCacheForAllThreads to false by default Result: Related to #8536. Co-authored-by: Norman Maurer <norman_maurer@apple.com>

… MiB (netty#12108) Motivation: By default we allocate 2 arenas per core, and each arena that is put to use will allocate a chunk. If we don't need a lot of memory, and certainly not compared to the number of cores on a system, then this will take up more memory than necessary, since each chunk is 16 MiB. By reducing the chunk size to 4 MiB, we reduce the minimum memory usage by a good deal, in these cases where not much is needed. The drawback is that we risk allocating more huge buffers, but this is a fair trade-off since Netty's use cases mostly involve very small buffers. Modification: Reduce the default max order from 11 to 9. Also make similar configuration changes in PooledByteBufAllocatorTest, to reduce the memory usage during testing. Result: Netty now uses less memory when less memory is needed by the application. This fixes netty#8536

…etty#8991) (netty#12109) Motivation: We currently use a thread local cache for all threads which often is suprising to users as it may result in a lot of memory usage if they allocate buffers from outside the EventLoop in different threads. We should better not do this by default to keep suprises to a minimum. Users that need the performance and know what they are doing can still change this. Modifications: Change io.netty.allocator.useCacheForAllThreads to false by default Result: Related to netty#8536. Co-authored-by: Norman Maurer <norman_maurer@apple.com>

… MiB (netty#12108) Motivation: By default we allocate 2 arenas per core, and each arena that is put to use will allocate a chunk. If we don't need a lot of memory, and certainly not compared to the number of cores on a system, then this will take up more memory than necessary, since each chunk is 16 MiB. By reducing the chunk size to 4 MiB, we reduce the minimum memory usage by a good deal, in these cases where not much is needed. The drawback is that we risk allocating more huge buffers, but this is a fair trade-off since Netty's use cases mostly involve very small buffers. Modification: Reduce the default max order from 11 to 9. Also make similar configuration changes in PooledByteBufAllocatorTest, to reduce the memory usage during testing. Result: Netty now uses less memory when less memory is needed by the application. This fixes netty#8536

…etty#8991) (netty#12109) Motivation: We currently use a thread local cache for all threads which often is suprising to users as it may result in a lot of memory usage if they allocate buffers from outside the EventLoop in different threads. We should better not do this by default to keep suprises to a minimum. Users that need the performance and know what they are doing can still change this. Modifications: Change io.netty.allocator.useCacheForAllThreads to false by default Result: Related to netty#8536. Co-authored-by: Norman Maurer <norman_maurer@apple.com>

normanmaurer added this to the [Netty 5] ByteBuf changes / enhancements milestone Nov 13, 2018

normanmaurer added this to To do in Netty 5 via automation Nov 13, 2018

normanmaurer mentioned this issue Apr 1, 2019

Change default of io.netty.allocator.useCacheForAllThreads to false #8991

Merged

normanmaurer assigned chrisvest Sep 29, 2021

chrisvest mentioned this issue Feb 17, 2022

Implement the new Buffer API in terms of the ByteBuf API #12099

Merged

chrisvest mentioned this issue Feb 17, 2022

Reduce the default PooledByteBufAllocator chunk size from 16 MiB to 4 MiB #12108

Merged

chrisvest mentioned this issue Feb 17, 2022

Change default of io.netty.allocator.useCacheForAllThreads to false #12109

Merged

chrisvest closed this as completed in #12108 Feb 25, 2022

Netty 5 automation moved this from To do to Done Feb 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Netty 5] Default PooledByteBufAllocator configuration #8536

[Netty 5] Default PooledByteBufAllocator configuration #8536

normanmaurer commented Nov 13, 2018

vkostyukov commented Nov 14, 2018

normanmaurer commented Sep 29, 2021

chrisvest commented Sep 29, 2021

[Netty 5] Default PooledByteBufAllocator configuration #8536

[Netty 5] Default PooledByteBufAllocator configuration #8536

Comments

normanmaurer commented Nov 13, 2018

vkostyukov commented Nov 14, 2018

normanmaurer commented Sep 29, 2021

chrisvest commented Sep 29, 2021