kube-controller-manager v1.26 high cpu usage #118706

zigmund · 2023-06-16T08:38:21Z

What happened?

Upgraded cluster from v1.24.6 to v1.26.4 (via v1.25.9) and after that kube-controller-manager starts to eat all available cpu:

Also I see massive workqueue_retries_total{name="cronjob"} metrics rate increase - from 2-3 per second to 20-30k:

Dumped pprof profile from kube-controller-manager and also see massive cronjob related load:

What did you expect to happen?

Same CPU usage of kube-controller-manager.

How can we reproduce it (as minimally and precisely as possible)?

IDK. We have 7 similar clusters of same version - issue is presents only in one of them.

Anything else we need to know?

~170 cronjobs
~300 job

Deleted some cronjobs and old failed jobs - didn't see any effect.

Kubernetes version

$ kubectl version
Client Version: version.Info{Major:"1", Minor:"25", GitVersion:"v1.25.4", GitCommit:"872a965c6c6526caa949f0c6ac028ef7aff3fb78", GitTreeState:"clean", BuildDate:"2022-11-09T13:36:36Z", GoVersion:"go1.19.3", Compiler:"gc", Platform:"darwin/arm64"}
Kustomize Version: v4.5.7
Server Version: version.Info{Major:"1", Minor:"26", GitVersion:"v1.26.4", GitCommit:"f89670c3aa4059d6999cb42e23ccb4f0b9a03979", GitTreeState:"clean", BuildDate:"2023-04-12T12:05:35Z", GoVersion:"go1.19.8", Compiler:"gc", Platform:"linux/amd64"}

Cloud provider

none, self-hosted baremetal

OS version

# On Linux:
$ cat /etc/os-release
NAME="Ubuntu"
VERSION="20.04.4 LTS (Focal Fossa)"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu 20.04.4 LTS"
VERSION_ID="20.04"
HOME_URL="https://www.ubuntu.com/"
SUPPORT_URL="https://help.ubuntu.com/"
BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"
PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"
VERSION_CODENAME=focal
UBUNTU_CODENAME=focal
$ uname -a
Linux kube-[REDACTED] 5.15.0-53-generic #59~20.04.1-Ubuntu SMP Thu Oct 20 15:10:22 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux

Install tools

ansible

Container runtime (CRI) and version (if applicable)

containerd 1.6.8-1

Related plugins (CNI, CSI, ...) and versions (if applicable)

flannel v0.14.0

The text was updated successfully, but these errors were encountered:

zigmund · 2023-06-16T08:42:10Z

/sig api-machinery

aojea · 2023-06-16T12:19:32Z

/sig scheduling

@alculquicondor I think sig-scheduling is handling the cronjob and job controllers, right?

alculquicondor · 2023-06-16T12:25:11Z

No :) I personally look at Job, though

/remove-sig scheduling
/sig apps
cc @soltysh

alculquicondor · 2023-06-16T12:30:25Z

Cronjob v2 got to stable in 1.22, so if anything, it's something more recent.

alculquicondor · 2023-06-16T12:55:46Z

My wild guess would be that somehow kcm is reconciling all CronJobs even though they are not due for schedule.

@zigmund any chance you can look at the logs and see whether a particular cronjob with a low frequency is being reconciled continuosly?

zigmund · 2023-06-17T14:44:03Z

@alculquicondor
Checked few monthly cronjobs, don't see such problem.

But I see multiple logs about same jobs:

I0617 20:28:00.368553      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:00.369321      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:00.409013      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:01.056718      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:03.946792      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:16.982832      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:17.997649      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:20.009703      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:21.034160      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:21.055513      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868
I0617 20:28:21.066377      12 job_controller.go:514] enqueueing job market-production/api-scheduler-28116868

Also there are many error logs:

E0617 20:37:18.442053      12 cronjob_controllerv2.go:166] error syncing CronJobController market-production/api-scheduler, requeuing: Operation cannot be fulfilled on cronjobs.batch "api-scheduler": the object has been modified; please apply your changes to the latest version and try again

But I also see all same logs in other healthy clusters.

zigmund · 2023-06-19T07:33:31Z

@alculquicondor

I started kcm locally for debug and that's what I found so far:
There are two cronjobs constantly readded to workqueue with negative *requeueAfter: https://github.com/kubernetes/kubernetes/blob/v1.26.4/pkg/controller/cronjob/cronjob_controllerv2.go#L168-L171
Duration calculated wrongly in getMostRecentScheduleTime: https://github.com/kubernetes/kubernetes/blob/v1.26.4/pkg/controller/cronjob/utils.go#L121-L143

Cronjob parts-panel-cronjob-products-data:

Schedule:                      0 16 * * *
Concurrency Policy:            Forbid
Last Schedule Time:  Fri, 16 Jun 2023 16:00:00 +0600
Active Jobs:         parts-panel-cronjob-products-data-28115160

Now is 2023-06-19 13:11:04
There is 1 active long running job, started at 3 days ago, already missed some schedules, getMostRecentScheduleTime should return 2023-06-19 16:00:00 (in the future), but returns 2023-06-18 16:00:00 (in the past).

Cronjob parts-panel-cronjob-abcp-products-data:

Schedule:                      0 12 * * 3
Last Schedule Time:  Wed, 31 May 2023 12:00:00 +0600
Starting Deadline Seconds:     30s
Active Jobs:         <none>

Now is 2023-06-19 13:11:04
Schedules at 2023-06-17 12:00:00 and 2023-06-14 12:00:00 were missed for some reason (maybe because of Starting Deadline Seconds, doesn't matters), getMostRecentScheduleTime should return 2023-06-21 12:00:00 (in the future), but returns 2023-06-14 12:00:00.

zigmund · 2023-06-19T08:53:34Z

Workaround - recreated these cronjobs and now kcm works fine.
Looked new code - the issue maybe already fixed in v1.27, but need to check.

alculquicondor · 2023-06-19T12:43:09Z

@zigmund thanks for the detailed debugging.

Just for clarification job_controller.go is about Job, not CronJob. Jobs do need to be synced somewhat continuously while they are running.

Feel free to submit a PR if you find the fix.

/triage accepted

zigmund · 2023-08-03T08:49:35Z

It happened again after another controller-manager restart.

alculquicondor · 2023-08-03T13:02:36Z

Did you have a chance to test in 1.27?

zigmund · 2023-08-03T13:26:55Z

@alculquicondor, unfortunately it's not possible to upgrade the cluster in near future.

alculquicondor · 2023-08-03T15:42:31Z

I see. In any case @soltysh probably has more context about recent changes in CronJob that could solve the issue and can potentially be cherry-picked.

sxllwx · 2023-09-26T09:44:39Z

/assign

sxllwx · 2023-09-26T09:46:06Z

This af1c9e4 seems to have fixed this problem.

I'm looking at ways to fix it. I will give some conclusions later.

sxllwx · 2023-10-20T07:01:08Z

It has been confirmed that #110838 commit(be44d67) corrects the error here.

We need to pay attention to the calculation method of earliestTime in the function mostRecentScheduleTime In the past, only Status#LastScheduleTime and ObjectMeta#CreationTimestamp were involved in the comparison. Now Spec#StartingDeadlineSeconds and now() will be involved in the calculation, so that the requeue time we calculate will not be a negative number. .

This is the unit test case I use for testing. You can use it for verification if you want.

sxllwx@82d0a0a

- bug-case reproduced

sxllwx · 2023-10-20T09:20:27Z

There is another interesting thing here:

kubernetes/pkg/controller/cronjob/utils.go

Lines 140 to 142 in b46a3f8

    
           numberOfMissedSchedules := (timeElapsed / timeBetweenTwoSchedules) + 1 
        
           t := time.Unix(t1.Unix()+((numberOfMissedSchedules-1)*timeBetweenTwoSchedules), 0).UTC() 
        
           return &t, numberOfMissedSchedules, nil

t.UTC() operation is performed on t, which will affect sched (I noticed that our spec does not set the time zone. I recommend setting it to "TZ=UTC 0 12 * * 3" Compatible with the UTC TZ of the hardcode in the code) calculates the scheduling time of the next round.

kubernetes/pkg/controller/cronjob/cronjob_controllerv2.go

Lines 656 to 665 in b46a3f8

    
           mostRecentTime, _, err := getMostRecentScheduleTime(earliestTime, now, sched) 
        
           if err != nil { 
        
           	// we still have to requeue at some point, so aim for the next scheduling slot from now 
        
           	mostRecentTime = &now 
        
           } else if mostRecentTime == nil { 
        
           	// no missed schedules since earliestTime 
        
           	mostRecentTime = &earliestTime 
        
           } 
        
           t := sched.Next(*mostRecentTime).Add(nextScheduleDelta).Sub(now)

liggitt · 2023-10-20T13:55:14Z

I noticed that our spec does not set the time zone. I recommend setting it to "TZ=UTC 0 12 * * 3"

don't set TZ in the string schedule (that will be forbidden in future releases), set it in the cronjob spec

liggitt · 2023-10-20T14:00:16Z

@sxllwx thanks for working on this

@soltysh, now that we have a crisp unit test reproducer, can we prioritize review and backport of a fix? it looks like this changed in 1.25, which is almost at EOL (I'm not even sure there's another patch release planned).

soltysh · 2023-10-20T14:37:19Z

@sxllwx thx for your effort with reproducer

I'll make sure to prioritize this next week on Monday to figure out possible fixes.

soltysh · 2023-10-23T14:49:48Z

Indeed #110838 fixed a lot of issues around the calculations of time schedules, especially around its performance. Additionally PRs #118724 and #118940 and currently awaiting one #121327 (where I've just added the test case from @sxllwx) improve both accuracy and performance. I'll check which ones we can safely backport to previous releases once the last one merges.

soltysh · 2023-10-23T14:51:14Z

/assign

zigmund · 2023-10-23T15:01:27Z

@soltysh nice, looking forward for patch release.

soltysh · 2023-10-26T12:20:23Z

1.28 pick: #121536 - picks only #121327
1.27 pick: #121537 - picks only #121327
1.26 pick: #121540 - picks #110838 and #121327

zigmund · 2023-12-27T04:13:31Z

All related PR's are merged, looks like we can close it now.
@soltysh thanks!

zigmund added the kind/bug Categorizes issue or PR as related to a bug. label Jun 16, 2023

k8s-ci-robot added needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Jun 16, 2023

k8s-ci-robot added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Jun 16, 2023

k8s-ci-robot added the sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. label Jun 16, 2023

k8s-ci-robot added sig/apps Categorizes an issue or PR as relevant to SIG Apps. and removed sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. labels Jun 16, 2023

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Jun 19, 2023

k8s-ci-robot assigned sxllwx Sep 26, 2023

sxllwx added a commit to sxllwx/kubernetes that referenced this issue Oct 20, 2023

trace: kubernetes#118706

82d0a0a

- bug-case reproduced

sxllwx added a commit to sxllwx/kubernetes that referenced this issue Oct 20, 2023

bugfix: make test for kubernetes#118706

58d4ee7

k8s-ci-robot assigned soltysh Oct 23, 2023

zigmund closed this as completed Dec 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kube-controller-manager v1.26 high cpu usage #118706

kube-controller-manager v1.26 high cpu usage #118706

zigmund commented Jun 16, 2023

zigmund commented Jun 16, 2023

aojea commented Jun 16, 2023

alculquicondor commented Jun 16, 2023

alculquicondor commented Jun 16, 2023

alculquicondor commented Jun 16, 2023 •

edited

zigmund commented Jun 17, 2023

zigmund commented Jun 19, 2023

zigmund commented Jun 19, 2023

alculquicondor commented Jun 19, 2023

zigmund commented Aug 3, 2023

alculquicondor commented Aug 3, 2023

zigmund commented Aug 3, 2023

alculquicondor commented Aug 3, 2023

sxllwx commented Sep 26, 2023

sxllwx commented Sep 26, 2023

sxllwx commented Oct 20, 2023 •

edited

sxllwx commented Oct 20, 2023 •

edited

liggitt commented Oct 20, 2023

liggitt commented Oct 20, 2023

soltysh commented Oct 20, 2023

soltysh commented Oct 23, 2023

soltysh commented Oct 23, 2023

zigmund commented Oct 23, 2023

soltysh commented Oct 26, 2023

zigmund commented Dec 27, 2023

kube-controller-manager v1.26 high cpu usage #118706

kube-controller-manager v1.26 high cpu usage #118706

Comments

zigmund commented Jun 16, 2023

What happened?

What did you expect to happen?

How can we reproduce it (as minimally and precisely as possible)?

Anything else we need to know?

Kubernetes version

Cloud provider

OS version

Install tools

Container runtime (CRI) and version (if applicable)

Related plugins (CNI, CSI, ...) and versions (if applicable)

zigmund commented Jun 16, 2023

aojea commented Jun 16, 2023

alculquicondor commented Jun 16, 2023

alculquicondor commented Jun 16, 2023

alculquicondor commented Jun 16, 2023 • edited

zigmund commented Jun 17, 2023

zigmund commented Jun 19, 2023

zigmund commented Jun 19, 2023

alculquicondor commented Jun 19, 2023

zigmund commented Aug 3, 2023

alculquicondor commented Aug 3, 2023

zigmund commented Aug 3, 2023

alculquicondor commented Aug 3, 2023

sxllwx commented Sep 26, 2023

sxllwx commented Sep 26, 2023

sxllwx commented Oct 20, 2023 • edited

sxllwx commented Oct 20, 2023 • edited

liggitt commented Oct 20, 2023

liggitt commented Oct 20, 2023

soltysh commented Oct 20, 2023

soltysh commented Oct 23, 2023

soltysh commented Oct 23, 2023

zigmund commented Oct 23, 2023

soltysh commented Oct 26, 2023

zigmund commented Dec 27, 2023

alculquicondor commented Jun 16, 2023 •

edited

sxllwx commented Oct 20, 2023 •

edited

sxllwx commented Oct 20, 2023 •

edited