High memory footprint per connection #5751

atollena · 2022-10-31T11:35:49Z

gRPC-go allocates two buffers per transport: one for reads, and one for writes. They are allocated on transport creation in http_util.go. With their respective default sizes of 32KB and 64KB, those buffers can account for a significant part of grpc-go memory footprint. This is especially noticeable when there are a high number of mostly idle connections.

As an example, with 1000 connections open, those buffers consume 100MB of memory. We first noticed this problem server-side in our deployments when using the round robin balancer in services that have a lot of clients (in this use case, setting up some form of subsetting would help). Another scenario where this issue is prominent is for control planes with streaming, such as xDS management servers.

It is possible to disable or to tune the size of those buffers by setting ReadBufferSize/WriteBufferSize on the server side and WithReadBufferSize/WithWriteBufferSize on the client side, but there are performance implications. From reading the comments and code, those buffers are meant to batch calls to the transport, thereby reducing the number of system calls: the code dealing with frames reads and writes with size 9 bytes (HTTP2 frame header to get the frame size) immediately followed by a read or write of the size of the frame. This means that in most cases, when the socket becomes readable or there is data to be written, at least two syscalls can be batched.

From what I could read, the benchmarks present in this repository focus on latency and throughput, and it is not surprising that those buffers improve the results. Note also that the benchmarks override the buffer size to 128KB, both for servers and clients. This is a problem because some benchmark results may not reflect the reality of what most users are experiencing. The benchmarks do not account for fixed memory usage per connection.

Things I'm trying to get out of this issue are:

Gains to be had by changing the implementation without significant performance loss. In particular, it is unclear to me why the write buffer does not use bufio.Writer and instead has a custom implementation that allocates a buffer of twice the passed in size (2*32KB by default). This was introduced in this change: transport: refactor to reduce lock contention and improve performance #1962.
Make sure that benchmarks use the default values for buffer sizes -- I'm happy to propose the fix for this.
Opportunities for documenting how to tune those values based on use cases.
Explore whether there is value in adding read and write buffer sizes as a benchmark dimension, as well as measuring the cost of idle connections.
More creative ideas to improve memory usage when there are many mostly idle connections (e.g. dynamically tune buffer size based on activity). We are somewhat limited by the way IOs are done in go: we can't just allocate memory when a call to read is ready, since socket readiness and read are not separate (see net: add mechanism to wait for readability on a TCPConn golang/go#15735).

The text was updated successfully, but these errors were encountered:

easwars · 2022-11-01T19:29:03Z

@atollena : We discussed this issue today and we've decided to create issues for the list of things you've mentioned in here. That way, we can keep the discussions scoped in the respective issues/PRs.

Thanks for filing this issue.

atollena added the Type: Question label Oct 31, 2022

This was referenced Nov 2, 2022

benchmark: make it possible to configure read and write buffer sizes #5757

Closed

transport: write buffer size should equal the size specified by the user #5758

Closed

transport: investigate dynamic buffer sizes for the write buffer #5759

Closed

zasweq closed this as completed Nov 8, 2022

wh75er mentioned this issue May 2, 2023

High memory usage inside transport.newBufWriter (memory leak?) #6242

Closed

github-actions bot locked as resolved and limited conversation to collaborators May 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High memory footprint per connection #5751

High memory footprint per connection #5751

atollena commented Oct 31, 2022 •

edited

Loading

easwars commented Nov 1, 2022

High memory footprint per connection #5751

High memory footprint per connection #5751

Comments

atollena commented Oct 31, 2022 • edited Loading

easwars commented Nov 1, 2022

atollena commented Oct 31, 2022 •

edited

Loading