kubelet: don't reject pods without adding them to the pod manager #37661

yujuhong · 2016-11-29T23:40:32Z

kubelet relies on the pod manager as a cache of the pods in the apiserver (and
other sources) . The cache should be kept up-to-date even when rejecting pods.
Without this, kubelet may decide at any point to drop the status update
(request to the apiserver) for the rejected pod since it would think the pod no
longer exists in the apiserver.

This should fix #37658

k8s-oncall · 2016-11-29T23:40:39Z

This change is

Random-Liu · 2016-11-29T23:45:20Z

pkg/kubelet/kubelet.go

+		existingPods := kl.podManager.GetPods()
+		// Always add the pod to the pod manager. Kubelet relies on the pod
+		// manager as the source of truth for the desired state. If a pod does
+		// not esist in the pod manager, it means that it has been deleted in


s/esist/exist

Random-Liu · 2016-11-29T23:46:00Z

LGTM. Thanks for the fix. And sorry for not catching this when review that PR. :p

k8s-ci-robot · 2016-11-30T01:19:37Z

Jenkins GCI GKE smoke e2e failed for commit 66690bd0df8a941fe3999da512c7e60d384cc8a9. Full PR test history.

The magic incantation to run this job again is @k8s-bot gci gke e2e test this. Please help us cut down flakes by linking to an open flake issue when you hit one in your PR.

derekwaynecarr · 2016-11-30T02:21:01Z

@sjenning - I think this may also fix the issue we saw today where pods that were stuck terminating even when containers were all terminated remained stuck after a Kubelet restart. It seems those pods were never getting their status updated which would happen if they had been filtered out of pod manager. Need to verify this more though .

dims · 2016-11-30T03:09:08Z

@k8s-bot gci gke e2e test this

calebamiles · 2016-11-30T08:27:09Z

should this get a cherry pick label @dchen1107, @Random-Liu?

cc: @saad-ali, @dims, @kubernetes/sig-node

yujuhong · 2016-11-30T17:34:29Z

should this get a cherry pick label

Yes. Just added.

yujuhong · 2016-11-30T17:37:15Z

FWIW, I reproduced this issue in my own cluster using a somewhat extreme test case (>1000 pods assigned to one node), and verified the fix. However, it's good to keep in mind that the kubelet's apiserver client has limited QPS and if the replication control keeps creating new pods assigned to the node, kubelet's status update throughout would never be able to catch up.

yujuhong · 2016-11-30T20:34:35Z

@dchen1107 @Random-Liu I added a check to see whether the pod in question has terminated or not. PTAL again, thanks!

/cc @dashpole

k8s-ci-robot · 2016-11-30T23:43:49Z

Jenkins GCE etcd3 e2e failed for commit a71e6b780b789149f4fd7de1fe98127fc61bd0e4. Full PR test history.

The magic incantation to run this job again is @k8s-bot gce etcd3 e2e test this. Please help us cut down flakes by linking to an open flake issue when you hit one in your PR.

yujuhong · 2016-11-30T23:48:20Z

@k8s-bot gce etcd3 e2e test this

kubelet relies on the pod manager as a cache of the pods in the apiserver (and other sources) . The cache should be kept up-to-date even when rejecting pods. Without this, kubelet may decide at any point to drop the status update (request to the apiserver) for the rejected pod since it would think the pod no longer exists in the apiserver. Also check if the pod to-be-admitted has terminated or not. In the case where it has terminated, skip the admission process completely.

Random-Liu · 2016-12-01T02:36:46Z

LGTM

k8s-github-robot · 2016-12-01T05:59:11Z

Automatic merge from submit-queue

…61-upstream-release-1.5 Automated cherry pick of #37661

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Nov 29, 2016

yujuhong added the sig/node Categorizes an issue or PR as relevant to SIG Node. label Nov 29, 2016

yujuhong assigned dchen1107 and Random-Liu Nov 29, 2016

Random-Liu reviewed Nov 29, 2016

View reviewed changes

yujuhong force-pushed the always_add_pods branch from 79b97ce to 66690bd Compare November 29, 2016 23:48

k8s-github-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. release-note-label-needed labels Nov 29, 2016

yujuhong added release-note-none Denotes a PR that doesn't merit a release note. and removed release-note-label-needed labels Nov 30, 2016

yujuhong added this to the v1.5 milestone Nov 30, 2016

Random-Liu added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 30, 2016

yujuhong added the cherrypick-candidate label Nov 30, 2016

dashpole mentioned this pull request Nov 30, 2016

Upon restart, kubelet tries to start all pods including terminated ones #37447

Closed

yujuhong removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 30, 2016

yujuhong force-pushed the always_add_pods branch from 66690bd to bddcdf7 Compare November 30, 2016 20:33

yujuhong force-pushed the always_add_pods branch from bddcdf7 to a71e6b7 Compare November 30, 2016 22:23

yujuhong force-pushed the always_add_pods branch from a71e6b7 to 69caf53 Compare December 1, 2016 02:05

k8s-github-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Dec 1, 2016

Random-Liu added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 1, 2016

k8s-github-robot merged commit c4b33f3 into kubernetes:master Dec 1, 2016

yujuhong mentioned this pull request Dec 1, 2016

Automated cherry pick of #37661 #37840

Merged

yujuhong added release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed release-note-none Denotes a PR that doesn't merit a release note. labels Dec 1, 2016

saad-ali added the cherry-pick-approved Indicates a cherry-pick PR into a release branch has been approved by the release branch manager. label Dec 1, 2016

saad-ali added a commit that referenced this pull request Dec 1, 2016

Merge pull request #37840 from yujuhong/automated-cherry-pick-of-#376…

e2772be

…61-upstream-release-1.5 Automated cherry pick of #37661

saad-ali removed the cherrypick-candidate label Dec 28, 2016

yujuhong deleted the always_add_pods branch February 2, 2017 18:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kubelet: don't reject pods without adding them to the pod manager #37661

kubelet: don't reject pods without adding them to the pod manager #37661

yujuhong commented Nov 29, 2016 •

edited

Loading

k8s-oncall commented Nov 29, 2016

Random-Liu Nov 29, 2016

yujuhong Nov 29, 2016

Random-Liu commented Nov 29, 2016

k8s-ci-robot commented Nov 30, 2016

derekwaynecarr commented Nov 30, 2016

dims commented Nov 30, 2016

calebamiles commented Nov 30, 2016

yujuhong commented Nov 30, 2016

yujuhong commented Nov 30, 2016

yujuhong commented Nov 30, 2016

k8s-ci-robot commented Nov 30, 2016

yujuhong commented Nov 30, 2016

Random-Liu commented Dec 1, 2016

k8s-github-robot commented Dec 1, 2016

kubelet: don't reject pods without adding them to the pod manager #37661

kubelet: don't reject pods without adding them to the pod manager #37661

Conversation

yujuhong commented Nov 29, 2016 • edited Loading

k8s-oncall commented Nov 29, 2016

Random-Liu Nov 29, 2016

Choose a reason for hiding this comment

yujuhong Nov 29, 2016

Choose a reason for hiding this comment

Random-Liu commented Nov 29, 2016

k8s-ci-robot commented Nov 30, 2016

derekwaynecarr commented Nov 30, 2016

dims commented Nov 30, 2016

calebamiles commented Nov 30, 2016

yujuhong commented Nov 30, 2016

yujuhong commented Nov 30, 2016

yujuhong commented Nov 30, 2016

k8s-ci-robot commented Nov 30, 2016

yujuhong commented Nov 30, 2016

Random-Liu commented Dec 1, 2016

k8s-github-robot commented Dec 1, 2016

yujuhong commented Nov 29, 2016 •

edited

Loading