Reuse server send buffers #6608

PapaCharlie · 2023-09-06T15:04:32Z

Under significant load, the server concurrently allocates a significant amount of buffers to serialize and compress the responses before sending them over the wire. This causes significant GC pressure and can cause large spikes in memory allocations.

Tests are benchmark results are pending.

Under significant load, the server concurrently allocates a significant amount of buffers to serialize and compress the responses before sending them over the wire. This causes significant GC pressure and can cause large spikes in memory allocations.

linux-foundation-easycla · 2023-09-06T15:04:36Z

❌ The email address for the commit (1e8ca4b) is not linked to the GitHub account, preventing the EasyCLA check. Consult this Help Article and GitHub Help to resolve. (To view the commit's email address, add .patch at the end of this PR page's URL.) For further assistance with EasyCLA, please submit a support request ticket.

bohhyang · 2023-09-06T18:27:55Z

internal/transport/controlbuf.go

@@ -152,6 +152,8 @@ type dataFrame struct {
 	// onEachWrite is called every time
 	// a part of d is written out.
 	onEachWrite func()
+	// onSent is called once all the bt


this comment is incomplete?

bohhyang · 2023-09-06T18:40:30Z

stream.go

 // prepareMsg returns the hdr, payload and data
 // using the compressors passed or using the
 // passed preparedmsg


this comment could be updated

bohhyang · 2023-09-06T18:57:17Z

server.go

-		return err
+
+	var compData []byte
+	if shouldCompress(cp, comp) {


could save the result of shouldCompress and reuse in the following lines

bohhyang · 2023-09-06T18:58:44Z

server.go

 	}
 	hdr, payload := msgHeader(data, compData)
 	// TODO(dfawley): should we be checking len(data) instead?
 	if len(payload) > s.opts.maxSendMessageSize {
+		s.opts.sendBufferPool.Put(dataBuf)


shouldn't this be Put(data) instead? since data is the new buffer after encoding at line 1135.

bohhyang · 2023-09-06T18:59:05Z

server.go

 		return status.Errorf(codes.ResourceExhausted, "grpc: trying to send message larger than max (%d vs. %d)", len(payload), s.opts.maxSendMessageSize)
 	}
+	opts.OnSent = func() {
+		s.opts.sendBufferPool.Put(dataBuf)


same here, shouldn't this be Put(data)?

arvindbr8 · 2023-09-08T16:05:00Z

@PapaCharlie, seems like your github account has an issue with CLA. Please fix using this Help Article as mentioned in the comment above.

arvindbr8 · 2023-09-08T16:08:42Z

@PapaCharlie Also please consider filing an issue with us before sending the PR. Changes like this require more investigation and proof - which needs to happen in an issue.
Please open an issue with us here.
I'm closing this for now and we can reopen this once we have a solid idea on what the issue is and set in stone the fix (if required).

PapaCharlie · 2023-09-11T08:38:10Z

Hey @arvindbr8, I didn't get a chance to link these issues when I opened the PR but here you go:
#2817 and #2816
This is also in response to pretty significant performance testing. I took some CPU and heap profiles for the same server code running on v1.58.0 and my fork under significant load (of the nature where the server is concurrently sending very large responses to many clients). Here are the results:

v1.58.0 CPU profile

v1.58.0 Heap profile (alloc_space)

Fork CPU profile

Fork Heap profile (alloc_space)

If you want me to repro this using simpler code (and/or a benchmark) I can give it a shot, it shouldn't be difficult. I just had these profile results handy. As you can see, the allocation overhead of not reusing the send buffers means the server is spending a lot its time collecting them instead of doing meaningful work. By reusing the send buffers, the performance drastically increases

Reuse server send buffers

1e8ca4b

Under significant load, the server concurrently allocates a significant amount of buffers to serialize and compress the responses before sending them over the wire. This causes significant GC pressure and can cause large spikes in memory allocations.

bohhyang reviewed Sep 7, 2023

View reviewed changes

arvindbr8 added the Status: Requires Reporter Clarification label Sep 8, 2023

arvindbr8 closed this Sep 8, 2023

PapaCharlie mentioned this pull request Sep 11, 2023

grpc: provide a mechanism for encoded message buffer recycling #6613

Closed

github-actions bot locked as resolved and limited conversation to collaborators Mar 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reuse server send buffers #6608

Reuse server send buffers #6608

PapaCharlie commented Sep 6, 2023

linux-foundation-easycla bot commented Sep 6, 2023

bohhyang Sep 6, 2023

bohhyang Sep 6, 2023

bohhyang Sep 6, 2023

bohhyang Sep 6, 2023

bohhyang Sep 6, 2023

arvindbr8 commented Sep 8, 2023

arvindbr8 commented Sep 8, 2023

PapaCharlie commented Sep 11, 2023

Reuse server send buffers #6608

Reuse server send buffers #6608

Conversation

PapaCharlie commented Sep 6, 2023

linux-foundation-easycla bot commented Sep 6, 2023

bohhyang Sep 6, 2023

Choose a reason for hiding this comment

bohhyang Sep 6, 2023

Choose a reason for hiding this comment

bohhyang Sep 6, 2023

Choose a reason for hiding this comment

bohhyang Sep 6, 2023

Choose a reason for hiding this comment

bohhyang Sep 6, 2023

Choose a reason for hiding this comment

arvindbr8 commented Sep 8, 2023

arvindbr8 commented Sep 8, 2023

PapaCharlie commented Sep 11, 2023

v1.58.0 CPU profile

v1.58.0 Heap profile (alloc_space)

Fork CPU profile

Fork Heap profile (alloc_space)