runtime: make the scavenger more prompt #16930

aclements · 2016-08-30T18:53:45Z

Currently, there's a lag of at least five minutes (and potentially much longer) between the Go heap size shrinking and the process' RSS shrinking. This is done to amortize the cost of releasing and re-acquiring system memory, but has several downsides:

It's bad for memory-constrained environments, whether that constraint is 10MB or 100GB, since it makes it basically impossible to respond to resource changes by shrinking the heap. This is particularly noticeable in server environments that allow temporarily exceeding a soft limit, since it turns this soft limit into a hard limit.
It prevents using memory for other useful purposes. There's no such thing as "unused memory". Memory not used by an application will be used by the OS for the buffer cache or the zero page cache, and free memory gives the OS the flexibility to perform optimization like transparent huge pages or NUMA migration. In the mobile sphere, this is particularly noticeable to the user, since it means less space for application and resource caching, which directly affects user experience.
Finally, it's confusing. It's clear from questions on golang-nuts and elsewhere that people expect their RSS to track their heap size and, in particular, their heap profiles.

I believe we should make the scavenger release memory promptly and aggressively.

I believe we can do this without significant overhead by improving the design of the scavenger. Currently the scavenger is careful to recycle spans that have been unused for more than five minutes. This is largely wasted effort because virtual memory is fungible: there's a slight TLB locality boost to retaining very recently used memory (on the order of a few megabytes), but beyond this it doesn't matter what unused pages we return to the OS. The inflexibility of the scavenger has several downsides. Primarily, it requires many system calls to release sparse unused regions of memory, and these system calls have a very high per-call cost because the OS needs to do remote TLB invalidation. This cost also grows with the number of CPUs. It can also needlessly delay freeing memory because coalescing two neighboring free spans takes the most recent "used" time of the two.

I propose separating the concerns of how many pages to release from which pages to release.

Which pages to release should be based on minimizing the number of sysUnused calls (madvise on Linux). Roughly, releasing n pages should attempt to release an n page span and if no spans n pages or larger are on the free list, it should release as few smaller spans as possible. There are many algorithms that satisfy this outline, and I suspect the details don't matter. It would be reasonable to adopt the exact algorithm used by tcmalloc, since that has been field-tested. We should preferentially release spans from the end of the free lists, since those have been least recently used and have the least preference for being reused.

How many pages to release is a harder question, but by separating these concerns we have the flexibility to choose a good answer. For example, the current policy can be expressed as retaining max(HeapInUse over the past 5 minutes). The "right" answer depends on the costs of releasing and re-acquiring memory, the future size of the heap, and the relative CPU versus memory costs we're willing to incur. Because of the "sawtooth" behavior of GC, we do know something about the future size of the heap, at least: the heap will grow to the heap goal in the near future, and often that heap goal is in a rough steady state. Hence, as a simple heuristic, I propose we use the heap goal times a "steady state variance" factor, with some hysteresis to retain cost amortization in case garbage collections are happening rapidly:

retain = C * max(current heap goal, max({heap goals of GCs over the past T seconds}))
C = 1 + ε = 1.1
T = 10 seconds

/cc @matloob @RLH

The text was updated successfully, but these errors were encountered:

quentinmit · 2016-11-07T19:46:29Z

ping @aclements
Is there any chance we can get this into 1.8? If this is a simple change, it would make a huge difference. If it's not a simple change, it's probably too late for 1.8.

aclements · 2016-11-07T20:41:31Z

Not a simple change, I'm afraid. Moving to 1.9.

aclements · 2017-06-23T15:53:03Z

Closely related: #14045 (comment)

aclements · 2017-08-31T16:30:44Z

An interesting report of this problem in the field: https://medium.com/samsara-engineering/running-go-on-low-memory-devices-536e1ca2fe8f

gopherbot · 2018-10-12T20:58:43Z

Change https://golang.org/cl/141937 mentions this issue: runtime: eagerly scavenge based on global budget

gopherbot · 2018-10-17T23:49:15Z

Change https://golang.org/cl/142959 mentions this issue: runtime: track the last 16 heap goals

gopherbot · 2018-10-17T23:49:15Z

Change https://golang.org/cl/142960 mentions this issue: runtime: add background scavenger

gopherbot · 2018-10-18T20:16:52Z

Change https://golang.org/cl/143157 mentions this issue: runtime: remove periodic scavenging

aclements · 2019-05-09T22:30:34Z

@mknyszek, should this issue be closed now?

mknyszek · 2019-05-10T14:59:37Z

Yes, I think so.

quentinmit added this to the Go1.8 milestone Sep 6, 2016

quentinmit added the NeedsFix The path to resolution is known, but the work has not been done. label Oct 11, 2016

aclements modified the milestones: Go1.9Early, Go1.8 Nov 7, 2016

bradfitz modified the milestones: Go1.10Early, Go1.9Early May 3, 2017

bradfitz added early-in-cycle A change that should be done early in the 3 month dev cycle. and removed early-in-cycle A change that should be done early in the 3 month dev cycle. labels Jun 14, 2017

bradfitz modified the milestones: Go1.10Early, Go1.10 Jun 14, 2017

aclements self-assigned this Jun 23, 2017

This was referenced Aug 24, 2017

Make producer's event channel size configurable confluentinc/confluent-kafka-go#87

Closed

Increased memory usage with many short-lived producers skroutz/rafka#39

Open

garvitjuniwal mentioned this issue Nov 16, 2017

cockroach retains a high memory footprint after a large query is processed cockroachdb/cockroach#20078

Closed

rsc modified the milestones: Go1.10, Go1.11 Nov 22, 2017

ianlancetaylor modified the milestones: Go1.11, Go1.12 Jul 10, 2018

bradfitz removed the early-in-cycle A change that should be done early in the 3 month dev cycle. label Nov 3, 2018

aclements removed this from the Go1.12 milestone Jan 8, 2019

aclements added this to the Go1.13 milestone Jan 8, 2019

zhouhaibing089 mentioned this issue Jan 29, 2019

CoreDNS uses a lot memory during initialization phase coredns/coredns#2511

Closed

mknyszek mentioned this issue Feb 20, 2019

runtime: smarter scavenging #30333

Closed

mknyszek closed this as completed May 10, 2019

golang locked and limited conversation to collaborators May 9, 2020

gopherbot added the FrozenDueToAge label May 9, 2020

rsc unassigned aclements Jun 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

runtime: make the scavenger more prompt #16930

runtime: make the scavenger more prompt #16930

aclements commented Aug 30, 2016

quentinmit commented Nov 7, 2016

aclements commented Nov 7, 2016

aclements commented Jun 23, 2017

aclements commented Aug 31, 2017

gopherbot commented Oct 12, 2018

gopherbot commented Oct 17, 2018

gopherbot commented Oct 17, 2018

gopherbot commented Oct 18, 2018

aclements commented May 9, 2019

mknyszek commented May 10, 2019

runtime: make the scavenger more prompt #16930

runtime: make the scavenger more prompt #16930

Comments

aclements commented Aug 30, 2016

quentinmit commented Nov 7, 2016

aclements commented Nov 7, 2016

aclements commented Jun 23, 2017

aclements commented Aug 31, 2017

gopherbot commented Oct 12, 2018

gopherbot commented Oct 17, 2018

gopherbot commented Oct 17, 2018

gopherbot commented Oct 18, 2018

aclements commented May 9, 2019

mknyszek commented May 10, 2019