Transfer network bytes to smaller buffer #63265

Tim-Brooks · 2020-10-05T17:02:08Z

Currently we read in 64KB blocks from the network. When TLS is not
enabled, these bytes are normally passed all the way to the application
layer (some exceptions: compression). For the HTTP layer this means that
these bytes can live throughout the entire lifecycle of an indexing
request.

The problem is that if the reads from the socket are small, this means
that 64KB buffers can be consumed by 1KB or smaller reads. If the socket
buffer or TCP buffer sizes are small, the leads to massive memory
waste. It has been identified as a major source of OOMs on coordinating
nodes as Elasticsearch easily exhausts the heap for these network bytes.

This commit resolves the problem by placing a handler after the TLS
handler to copy these bytes to a more appropriate buffer size as
necessary. This comes after TLS, because TLS is a framing layer which
often resolves this problem for us (the 64KB buffer will be decoded
into a more appropriate buffer size). However, this extra handler will
solve it for the non-TLS pipelines.

Currently we read in 64KB blocks from the network. When TLS is not enabled, these bytes are normally passed all the way to the application layer (some exceptions: compression). For the HTTP layer this means that these bytes can live throughout the entire lifecycle of an indexing request. The problem is that if the reads from the socket are small, this means that 64KB buffers can be consumed by 1KB or smaller reads. If the socket buffer or TCP buffer sizes are small, the leads to massive memory waste. It has been identified as a major source of OOMs on coordinating nodes as Elasticsearch easily exhausts the heap for these network bytes. This commit resolves the problem by placing a handler after the TLS handler to copy these bytes to a more appropriate buffer size as necessary. This comes after TLS, because TLS is a framing layer which often resolves this problem for us (the 64KB buffer will be decoded into a more appropriate buffer size). However, this extra handler will solve it for the non-TLS pipelines.

elasticmachine · 2020-10-05T17:02:10Z

Pinging @elastic/es-distributed (:Distributed/Network)

Tim-Brooks · 2020-10-05T20:50:53Z

@elasticmachine run elasticsearch-ci/packaging-sample-windows

Tim-Brooks added 2 commits October 5, 2020 10:48

Tim-Brooks added >non-issue :Distributed Coordination/Network Http and internode communication implementations backport v7.9.3 labels Oct 5, 2020

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Oct 5, 2020

Tim-Brooks merged commit 4391438 into elastic:7.9 Oct 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transfer network bytes to smaller buffer #63265

Transfer network bytes to smaller buffer #63265

Tim-Brooks commented Oct 5, 2020

Uh oh!

elasticmachine commented Oct 5, 2020

Uh oh!

Tim-Brooks commented Oct 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Transfer network bytes to smaller buffer #63265

Transfer network bytes to smaller buffer #63265

Conversation

Tim-Brooks commented Oct 5, 2020

Uh oh!

elasticmachine commented Oct 5, 2020

Uh oh!

Tim-Brooks commented Oct 5, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants