pod: set both the resource CPU request and limit #112

jlebon · 2022-07-27T21:23:45Z

Right now, we're not setting any CPU limit in the pods we schedule. This
means our workloads runs without constraints and can hog the hosts
they're running on if there are no namespace-wide default resource
limits.

Unlike with memory requests/limits, limits are enforced at the CPU
scheduling level, so IIUC it's not possible to be evicted due to
excessive CPU usage.

So let's start defaulting to a reasonable not too large value and ensure
we set both the CPU requests and limits. This also allows cgroups-aware
applications to derive the correct number of "equivalent CPUs" to use
for multi-threaded/multi-processing steps.

vars/pod.groovy

ravanelli

lgtm

dustymabe · 2022-07-28T13:23:27Z

hmm. I kind of liked the fact that if there were spare resources available we could "go faster". I guess there's no difference between "use resources if they are available" and "prevent other people from using resources we are currently using".

i.e. it would be nice if CPUs are idling for us to use them, but if other pods got scheduled on the node to be able to yield that extra CPU back to them.

Right now, we're not setting any CPU limit in the pods we schedule. This means our workloads runs without constraints and can hog the hosts they're running on if there are no namespace-wide default resource limits. Unlike with memory requests/limits, limits are enforced at the CPU scheduling level, so IIUC it's not possible to be evicted due to excessive CPU usage. So let's start defaulting to a reasonable not too large value and ensure we set both the CPU requests and limits. This also allows cgroups-aware applications to derive the correct number of "equivalent CPUs" to use for multi-threaded/multi-processing steps.

bgilbert

LGTM

jlebon · 2022-07-28T23:07:05Z

hmm. I kind of liked the fact that if there were spare resources available we could "go faster". I guess there's no difference between "use resources if they are available" and "prevent other people from using resources we are currently using".

i.e. it would be nice if CPUs are idling for us to use them, but if other pods got scheduled on the node to be able to yield that extra CPU back to them.

Yeah, this is an intricate topic, and I'm definitely not an SME. But the base problem as I understand it is that there's no reliable way to know how much CPU we should use when there is only a soft limit (e.g. Kubernetes' request, which gets translated into cpu.shares). It feels wrong to have e.g. a compressor think it can use 64 threads. And in theory being able to use idle CPU sounds nice, but in practice this makes us noisy neighbours which can affect performance (esp. latency) in colocated workloads.

Being able to declare upfront how much CPU we need also means we'll be nicer to others and ourselves (since we schedule multiple pods across multiple namespaces and jobs) and could actually improve CI and pipeline reliability. It might lead to realizing we're not reserving enough CPU, in which case we should feel free in e.g. the pipeline to request more as necessary.

dustymabe

LGTM

We weren't specifying a number. Previously that applied no limit at all, but with coreos/coreos-ci-lib#112, the default will now be 2.

With the recent changes in coreos/coreos-assembler#2975 we now need to set a cpu request limit. cosaPod() does this for us now that coreos/coreos-ci-lib#112 is merged. Let's convert our jobs to use cosaPod().

coreos-boot-edit.service wasn't sequenced Before anything, so under heavy I/O contention it could race with services being terminated prior to switching to the real root. Before=initrd.target should fix this, so we specify it, but it isn't enough for the same unknown reason as in 8b80486. Also specify Before=initrd-parse-etc.service to avoid that problem. Fixes occasional multipath.day1 flakes in coreos/fedora-coreos-tracker#1105, which became more frequent after coreos/coreos-ci-lib#112 landed. Fixes coreos/fedora-coreos-tracker#1105.

coreos-boot-edit.service wasn't sequenced Before anything, so under heavy I/O contention it could race with services being terminated prior to switching to the real root. Before=initrd.target should fix this, so we specify it, but it isn't enough for the same unknown reason as in 8b80486. Also specify Before=initrd-parse-etc.service to avoid that problem. Fixes occasional multipath.day1 flakes in coreos/fedora-coreos-tracker#1105, which became more frequent after coreos/coreos-ci-lib#112 landed. Fixes coreos/fedora-coreos-tracker#1105. (cherry picked from commit 651846e)

We weren't specifying a number. Previously that applied no limit at all, but with coreos/coreos-ci-lib#112, the default will now be 2. (cherry picked from commit f634965)

coreos-boot-edit.service wasn't sequenced Before anything, so under heavy I/O contention it could race with services being terminated prior to switching to the real root. Before=initrd.target should fix this, so we specify it, but it isn't enough for the same unknown reason as in 8b80486. Also specify Before=initrd-parse-etc.service to avoid that problem. Fixes occasional multipath.day1 flakes in coreos/fedora-coreos-tracker#1105, which became more frequent after coreos/coreos-ci-lib#112 landed. Fixes coreos/fedora-coreos-tracker#1105.

jlebon requested review from ravanelli, mike-nguyen, saqibali-2k, gursewak1997 and dustymabe as code owners July 27, 2022 21:23

jlebon mentioned this pull request Jul 27, 2022

Consolidate and reimplement CPU-count logic; enable SMP mksquashfs coreos/coreos-assembler#2975

Merged

bgilbert reviewed Jul 27, 2022

View reviewed changes

vars/pod.groovy Outdated Show resolved Hide resolved

ravanelli reviewed Jul 27, 2022

View reviewed changes

ravanelli approved these changes Jul 27, 2022

View reviewed changes

jlebon force-pushed the pr/cpu-limit branch from 0426070 to 31552e4 Compare July 28, 2022 16:38

bgilbert approved these changes Jul 28, 2022

View reviewed changes

dustymabe approved these changes Jul 28, 2022

View reviewed changes

jlebon pushed a commit to jlebon/coreos-assembler that referenced this pull request Jul 29, 2022

ci: request 8 CPUs in pod

537e743

We weren't specifying a number. Previously that applied no limit at all, but with coreos/coreos-ci-lib#112, the default will now be 2.

jlebon merged commit 7b552e1 into coreos:main Jul 29, 2022

jlebon deleted the pr/cpu-limit branch July 29, 2022 14:52

jlebon pushed a commit to coreos/coreos-assembler that referenced this pull request Jul 29, 2022

ci: request 8 CPUs in pod

f634965

We weren't specifying a number. Previously that applied no limit at all, but with coreos/coreos-ci-lib#112, the default will now be 2.

dustymabe mentioned this pull request Aug 1, 2022

convert build/build-arch jobs to use cosaPod coreos/fedora-coreos-pipeline#584

Merged

bgilbert mentioned this pull request Aug 2, 2022

05core: fix coreos-boot-edit.service race with switch-root coreos/fedora-coreos-config#1883

Merged

dustymabe mentioned this pull request Aug 9, 2022

rawhide: 20220802: coreos.boot-mirror.luks failing to reboot coreos/fedora-coreos-tracker#1268

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pod: set both the resource CPU request and limit #112

pod: set both the resource CPU request and limit #112

jlebon commented Jul 27, 2022

ravanelli left a comment

dustymabe commented Jul 28, 2022

bgilbert left a comment

jlebon commented Jul 28, 2022

dustymabe left a comment

pod: set both the resource CPU request and limit #112

pod: set both the resource CPU request and limit #112

Conversation

jlebon commented Jul 27, 2022

ravanelli left a comment

Choose a reason for hiding this comment

dustymabe commented Jul 28, 2022

bgilbert left a comment

Choose a reason for hiding this comment

jlebon commented Jul 28, 2022

dustymabe left a comment

Choose a reason for hiding this comment