kubelet: Pods created and rapidly terminated get stuck #98424

rphillips · 2021-01-26T14:38:59Z

What type of PR is this?
/kind bug
/sig node

What this PR does / why we need it:
This PR fixes the race when a pod gets terminated but crio has not had the chance to clean up the cgroup yet. The podKiller will register all pods into a map and bypass the cgroup cleanup in cleanupOrphanedPodCgroups if the kill is still pending.

In Openshift, we are seeing an issue where pods can be deleted from the API and the cgroup is torn down before the pod is completely killed. This is due to the Kubelet seeing the terminated pod from the API and the housekeeping routine (runs every 2 seconds) starts cleaning up the cgroup (cleanupOrphanedPodCgroups) if cgroupPerQOS is enabled.

The fix skips over the cgroup cleanup if the pod is being killed by podKiller. Once the pod exits the podKiller then pcm.Destroy is called within cleanupOrphanedPodCgroups.

Adds mock for FakeContainerManager
Adds mock for PodContainerManager
Unit test

Which issue(s) this PR fixes:
Fixes #98142

Special notes for your reviewer:

Does this PR introduce a user-facing change?:

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

k8s-ci-robot · 2021-01-26T14:39:07Z

@rphillips: This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

rphillips · 2021-01-26T15:33:53Z

/retest

pkg/kubelet/kubelet_pods.go

ehashman · 2021-01-28T00:36:30Z

/cc @SergeyKanzhelev

ehashman

This generally looks like an improvement to me (yay thread safety!!)

I'd love to see what this looks like against some of the e2es that are frequently failing. @rphillips I think some of them are OpenShift-only, do you want me to try porting them back to k8s? Maybe I can run them against this patch in another DNM PR?

pkg/kubelet/kubelet_pods.go

rphillips · 2021-02-04T15:16:43Z

/test pull-kubernetes-e2e-gce-ubuntu-containerd

do not delete the cgroup from a pod when it is being killed

rphillips · 2021-02-04T17:59:06Z

/retest

rphillips · 2021-02-04T19:12:05Z

Added a lock for the final validation step.
https://github.com/kubernetes/kubernetes/pull/98424/files#diff-ab8f7cf0865da30dc5a3c29c539591dff7fd7d8d9202df58206b7d70b37fae2aR459

sjenning · 2021-02-04T21:19:33Z

/lgtm

ehashman

/lgtm

rphillips · 2021-02-04T23:36:23Z

/test pull-kubernetes-e2e-kind

k8s-ci-robot · 2021-02-04T23:54:52Z

@rphillips: The following test failed, say /retest to rerun all failed tests:

Test name	Commit	Details	Rerun command
pull-kubernetes-bazel-test	`f918e11`	link	`/test pull-kubernetes-bazel-test`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

jingxu97 · 2021-02-18T02:14:45Z

I think this PR could consider cherrypick?

jingxu97 · 2021-02-18T02:19:08Z

@rphillips Just want to confirm, without this fix, the issue is pod is sometimes stuck in terminating phase due to cgroup clean up issue? Thanks!

ehashman · 2021-02-18T21:27:41Z

@jingxu97 it's a pretty big change, I'm not sure if we want to cherry-pick this. There are some other improvements like #98933 going in that would be good to land together with it.

k8s-ci-robot added the needs-priority Indicates a PR lacks a `priority/foo` label and requires one. label Jan 26, 2021

k8s-ci-robot requested review from odinuge and SergeyKanzhelev January 26, 2021 14:39

k8s-ci-robot added the area/kubelet label Jan 26, 2021

rphillips force-pushed the fixes/98142 branch from 9690243 to c3cc0d4 Compare January 26, 2021 15:58

k8s-ci-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Jan 26, 2021

rphillips changed the title ~~kubelet: only filter out terminated pods if the resources have been reclaimed~~ WIP: kubelet: only filter out terminated pods if the resources have been reclaimed Jan 26, 2021

k8s-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 26, 2021

ehashman added this to Waiting on Author in SIG Node PR Triage Jan 26, 2021

rphillips force-pushed the fixes/98142 branch from c3cc0d4 to f35bb71 Compare January 26, 2021 21:03

k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Jan 26, 2021

rphillips force-pushed the fixes/98142 branch from f35bb71 to 78775d5 Compare January 26, 2021 22:22

rphillips changed the title ~~WIP: kubelet: only filter out terminated pods if the resources have been reclaimed~~ WIP: kubelet: add pods to termination map Jan 26, 2021

rphillips force-pushed the fixes/98142 branch from 78775d5 to a94d9ba Compare January 26, 2021 23:29

haircommander reviewed Jan 27, 2021

View reviewed changes

pkg/kubelet/kubelet_pods.go Outdated Show resolved Hide resolved

haircommander mentioned this pull request Jan 27, 2021

REQUEST: New membership for haircommander kubernetes/org#2463

Closed

6 tasks

ehashman reviewed Jan 28, 2021

View reviewed changes

pkg/kubelet/kubelet_pods.go Outdated Show resolved Hide resolved

rphillips force-pushed the fixes/98142 branch 2 times, most recently from 3215c07 to a87e5b7 Compare January 28, 2021 01:18

rphillips mentioned this pull request Feb 4, 2021

Bug 1915085: UPSTREAM: 98424: register all pending pod deletions and check for kill openshift/kubernetes#551

Merged

rphillips force-pushed the fixes/98142 branch from 9679fbd to 3cdd573 Compare February 4, 2021 17:28

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 4, 2021

register all pending pod deletions and check for kill

f918e11

do not delete the cgroup from a pod when it is being killed

rphillips force-pushed the fixes/98142 branch from 3cdd573 to f918e11 Compare February 4, 2021 17:46

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 4, 2021

ehashman reviewed Feb 4, 2021

View reviewed changes

k8s-ci-robot merged commit c14465c into kubernetes:master Feb 4, 2021

SIG Node PR Triage automation moved this from Needs Approver to Done Feb 4, 2021

k8s-ci-robot added this to the v1.21 milestone Feb 4, 2021

This was referenced Feb 9, 2021

apiserver: remove terminates kube-apiserver gracefully test openshift/origin#25720

Closed

kubelet_test: fixes race in TestSyncPodsDeletesWhenSourcesAreReadyPerQOS #98938

Merged

ehashman mentioned this pull request Feb 17, 2021

Pods stuck on terminating #51835

Closed

rphillips mentioned this pull request Apr 27, 2021

kubelet: do not cleanup volumes if pod is being killed #101524

Merged

This was referenced May 24, 2021

Rebase 1.20.7 openshift/kubernetes#765

Closed

Rebase 1.20.7 openshift/kubernetes#772

Closed

openshift-ci-robot mentioned this pull request Jun 1, 2021

Bug 1958371: UPSTREAM: 98424: register all pending pod deletions and check for kill openshift/kubernetes#779

Merged

ehashman mentioned this pull request Jun 24, 2021

CronJob produces seemingly random warnings for sandbox creation failures #95916

Closed

openshift-ci-robot mentioned this pull request Jul 8, 2021

[WIP] [release-4.6] Rebase onto v1.19.12 openshift/kubernetes#850

Closed

openshift-ci-robot mentioned this pull request Sep 7, 2021

Bug 2003027: Rebase 1.20.10 openshift/kubernetes#935

Merged

openshift-ci-robot mentioned this pull request Sep 16, 2021

[release-4.6] Bug 2008266: Rebase 1.19.14 openshift/kubernetes#962

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kubelet: Pods created and rapidly terminated get stuck #98424

kubelet: Pods created and rapidly terminated get stuck #98424

rphillips commented Jan 26, 2021 •

edited

k8s-ci-robot commented Jan 26, 2021

rphillips commented Jan 26, 2021

ehashman commented Jan 28, 2021

ehashman left a comment

rphillips commented Feb 4, 2021

rphillips commented Feb 4, 2021

rphillips commented Feb 4, 2021

sjenning commented Feb 4, 2021

ehashman left a comment

rphillips commented Feb 4, 2021

k8s-ci-robot commented Feb 4, 2021 •

edited

jingxu97 commented Feb 18, 2021

jingxu97 commented Feb 18, 2021

ehashman commented Feb 18, 2021

kubelet: Pods created and rapidly terminated get stuck #98424

kubelet: Pods created and rapidly terminated get stuck #98424

Conversation

rphillips commented Jan 26, 2021 • edited

k8s-ci-robot commented Jan 26, 2021

rphillips commented Jan 26, 2021

ehashman commented Jan 28, 2021

ehashman left a comment

Choose a reason for hiding this comment

rphillips commented Feb 4, 2021

rphillips commented Feb 4, 2021

rphillips commented Feb 4, 2021

sjenning commented Feb 4, 2021

ehashman left a comment

Choose a reason for hiding this comment

rphillips commented Feb 4, 2021

k8s-ci-robot commented Feb 4, 2021 • edited

jingxu97 commented Feb 18, 2021

jingxu97 commented Feb 18, 2021

ehashman commented Feb 18, 2021

rphillips commented Jan 26, 2021 •

edited

k8s-ci-robot commented Feb 4, 2021 •

edited