feat(podtemplate): calculate GOMEMLIMIT at 90% of memory limit to reduce OOM #1418

shlomitubul · 2025-05-20T09:32:42Z

What does this PR do?

chart current pod template injects
GOMEMLIMIT = pod memory limits, it leaves no headroom for Go GC to run before the process hits the cgroup ceiling, leading to OOM kills. In my tests, I show great improvement (in terms of oom events and steady memory usage) with lower values, like 90%.

Motivation

Multiple OOMs that were gone once we changed GOMEMLIMIT value

More

Yes, I updated the tests accordingly
Yes, I updated the schema accordingly
Yes, I ran make test and all the tests passed

…ce OOM Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

traefik/templates/_helpers.tpl

traefik/Chart.yaml

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

traefik/templates/_helpers.tpl

traefik/tests/pod-config_test.yaml

fix(podtemplate-test): fix unit & tests (+comment) Co-authored-by: Rémi BUISSON <remi-buisson@orange.fr>

traefik/templates/_helpers.tpl

jnoordsij

Hey there!

Thanks a lot for this contribution. I'm personally a great fan of the performance and stability improvements that setting GOMEMLIMIT introduces, which is one of the reasons I've submitted various PRs elsewhere to add this functionality to Helm charts.

As such, I think it's great to allow more flexibility on this from the chart side. Although it does come with some maintenance burden and slightly complicated helpers, I still consider this beneficial to improve a lot of usecases.

However, I do wonder a bit what would be the most suitable 'default' here to pick. The current implementation is of course the simplest, but as you've clearly seen in your use-case, it may not be the best choice for everyone. But I think that using 80% will effectively be the same: it will be suitable to some use-cases, but might also impact some use-cases which are currently doing fine in a negative way. Given that decreasing the limit may result in more gc runs, which may result in more CPU consumption and thus eventually potential slower response times, I think it is essential that we pick a number that is both generic enough, yet still customisable for people aiming to tweak their own setup.

For the choice of default value, in cert-manager/cert-manager#6977 (comment) I was provided with some great feedback on it. Based on those references, with the (indirectly referenced) Go documentation itself being the most relevant in my opinion, I would suggest that probably a value between 90 and 95 would be more accurate as 'default'.

I think a default deviating from the recommended 90-95% or the simple current 100% is something we should only do if we actually have some very clear benchmarks for a representative use-case that suggests another value is more suitable.

So in that regard I would suggest:

allowing tuning as @darkweaver87 suggested is essential to introduce this change;
using a default value of 90% to 95% is more suitable based on current knowledge.

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

shlomitubul · 2025-06-07T14:34:35Z

Hey @jnoordsij, I appreciate your comment and review. I applied @darkweaver87 and your suggestion

traefik/values.yaml

traefik/templates/_helpers.tpl

traefik/tests/pod-config_test.yaml

Co-authored-by: Jesper Noordsij <45041769+jnoordsij@users.noreply.github.com>

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

darkweaver87

LGTM

mloiseleur · 2025-06-16T06:30:26Z

@shlomitubul You need to update the schema. See make schema

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

shlomitubul · 2025-06-16T18:13:06Z

@shlomitubul You need to update the schema. See make schema

@mloiseleur done

mloiseleur · 2025-06-17T06:58:01Z

@shlomitubul And also "make docs", to update VALUES.md

jnoordsij

LGTM (after running make docs)

shlomitubul added 2 commits May 20, 2025 11:29

fix(podtemplate): calculate GOMEMLIMIT at 80% of memory limit to redu…

ad507e5

…ce OOM Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

fix(podtemplate-test): fix tests

da6e017

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

shlomitubul commented May 20, 2025

View reviewed changes

traefik/templates/_helpers.tpl Show resolved Hide resolved

mloiseleur reviewed May 20, 2025

View reviewed changes

traefik/Chart.yaml Outdated Show resolved Hide resolved

fix(podtemplate-test): revert version bump

b520e47

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

shlomitubul requested a review from mloiseleur May 20, 2025 12:36

darkweaver87 reviewed May 22, 2025

View reviewed changes

Apply suggestions from code review

7865492

fix(podtemplate-test): fix unit & tests (+comment) Co-authored-by: Rémi BUISSON <remi-buisson@orange.fr>

shlomitubul requested a review from darkweaver87 May 25, 2025 14:12

Merge branch 'master' into master

d1f4308

darkweaver87 reviewed May 27, 2025

View reviewed changes

traefik/templates/_helpers.tpl Outdated Show resolved Hide resolved

jnoordsij requested changes May 28, 2025

View reviewed changes

mloiseleur added the kind/enhancement New feature or request label Jun 6, 2025

mloiseleur changed the title ~~fix(podtemplate): calculate GOMEMLIMIT at 80% of memory limit to reduce OOM~~ feat(podtemplate): calculate GOMEMLIMIT at 80% of memory limit to reduce OOM Jun 6, 2025

fix(helpers) - add goMemLimitPercentage to make it tunable

6653376

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

shlomitubul requested review from jnoordsij and darkweaver87 June 7, 2025 14:34

shlomitubul changed the title ~~feat(podtemplate): calculate GOMEMLIMIT at 80% of memory limit to reduce OOM~~ feat(podtemplate): calculate GOMEMLIMIT at 90% of memory limit to reduce OOM Jun 7, 2025

Merge branch 'master' into master

b0ffc84

mloiseleur reviewed Jun 10, 2025

View reviewed changes

traefik/values.yaml Outdated Show resolved Hide resolved

jnoordsij requested changes Jun 10, 2025

View reviewed changes

traefik/templates/_helpers.tpl Outdated Show resolved Hide resolved

traefik/tests/pod-config_test.yaml Outdated Show resolved Hide resolved

shlomitubul and others added 3 commits June 12, 2025 11:06

Update traefik/tests/pod-config_test.yaml

fffd447

Co-authored-by: Jesper Noordsij <45041769+jnoordsij@users.noreply.github.com>

Update traefik/templates/_helpers.tpl

63d371b

Co-authored-by: Jesper Noordsij <45041769+jnoordsij@users.noreply.github.com>

fix(helpers) - move goMemLimitPercentage under deployment

cc78750

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

shlomitubul requested review from jnoordsij and mloiseleur June 12, 2025 08:17

darkweaver87 approved these changes Jun 16, 2025

View reviewed changes

Merge branch 'master' into master

ace50a6

shlomitubul added 2 commits June 16, 2025 18:59

fix(values-schema) - update schema

26dcc9b

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

fix(values-schema) - update schema

d76bf92

Signed-off-by: ShlomiTubul <shlomi.tubul@placer.ai>

jnoordsij approved these changes Jun 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(podtemplate): calculate GOMEMLIMIT at 90% of memory limit to reduce OOM #1418

feat(podtemplate): calculate GOMEMLIMIT at 90% of memory limit to reduce OOM #1418

Uh oh!

shlomitubul commented May 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jnoordsij left a comment

Uh oh!

shlomitubul commented Jun 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

darkweaver87 left a comment

Uh oh!

mloiseleur commented Jun 16, 2025

Uh oh!

shlomitubul commented Jun 16, 2025

Uh oh!

mloiseleur commented Jun 17, 2025

Uh oh!

jnoordsij left a comment

Uh oh!

Uh oh!

feat(podtemplate): calculate GOMEMLIMIT at 90% of memory limit to reduce OOM #1418

Are you sure you want to change the base?

feat(podtemplate): calculate GOMEMLIMIT at 90% of memory limit to reduce OOM #1418

Uh oh!

Conversation

shlomitubul commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Motivation

More

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jnoordsij left a comment

Choose a reason for hiding this comment

Uh oh!

shlomitubul commented Jun 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

darkweaver87 left a comment

Choose a reason for hiding this comment

Uh oh!

mloiseleur commented Jun 16, 2025

Uh oh!

shlomitubul commented Jun 16, 2025

Uh oh!

mloiseleur commented Jun 17, 2025

Uh oh!

jnoordsij left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

shlomitubul commented May 20, 2025 •

edited

Loading