ReplicationController + crashlooping or invalid docker image = bad times #2529

lavalamp · 2014-11-21T22:16:55Z

ReplicationController has no idea when the pods it is making will deterministicly fail. This can rapidly fill your cluster with pod objects.

Mitigation: throttle the rate at which rep. ctlrs. can make new pods.

Real fix? Make replication controller watch events & stop after N tries failed without any successes?

bgrant0607 · 2014-11-21T22:40:03Z

See also #941 -- Kubelet has the same problem at the container level.

I'll leave this open to address my comment on #2491 -- gracefully handle pods that don't schedule.

lavalamp · 2014-11-21T23:27:51Z

To fix your cluster, not that anyone has ever done this:

cluster/kubectl.sh get --output=template --template="{{range .items}}{{.id}}{{\"\n\"}}{{end}}" pods | xargs -l1 cluster/kubectl.sh delete pod

rawlingsj · 2014-12-12T21:24:08Z

I've been caught out by this recently too. Having configuration for max number of tries on a replication controller would be great.

davidopp · 2014-12-12T23:16:35Z

I am confused about this issue. IIUC replication controller only works with pods that specify RestartPolicy = Always. So wouldn't a fix for this (limit # retries, or rate limit retries) need to go into the Kubelet (or whatever is restarting the container -- is it Docker or the Kubelet)? I don't see how replication controller can help. It seems replication controller would only get involved if the pod needs to move off of that machine, but that doesn't seem like the case you're talking about here.

lavalamp · 2014-12-12T23:59:44Z

@davidopp The problem is in how we report the pod's status; we report it as "failed" when kubelet is actually in the process of restarting it. This causes rep. ctrlr to make more pods, instead of waiting. So the bug here is in the assignment of status. Er, condition. Whatever we're calling it these days.

bgrant0607 · 2014-12-17T05:24:22Z

ReplicationController shouldn't stop permanently, it should back off. It can be very hard to distinguish temporary from permanent failures, and we don't want users to need to create replication controller controllers. The same applies for Kubelet.

We should find a way to facilitate implementation of more sophisticated policies by users.

lavalamp · 2015-01-07T00:39:44Z

To recap.

While kubelet is restarting a pod, apiserver will list its status as "Failed", which will cause replication controller to make additional pods.

Action item here is to make apiserver give a reasonable amount of time (forever?) to kubelet to perform as many restarts as it wants, and during that time the pod should count as Running (if it ever ran) or Pending (if it never successfully started).

lavalamp · 2015-01-07T00:48:51Z

(Eventually we want to push pod status generation down into kubelet but I think it's worthwhile making this intermediate fix because this really hoses your cluster when it happens to you.)

thockin · 2015-01-07T00:53:34Z

I don't think it is about "reasonable amount of time" but having enough
information to make the correct observation.

On Tue, Jan 6, 2015 at 4:40 PM, Daniel Smith notifications@github.com
wrote:

To recap.

While kubelet is restarting a pod, apiserver will list its status as
"Failed", which will cause replication controller to make additional pods.

Action item here is to make apiserver give a reasonable amount of time
(forever?) to kubelet to perform as many restarts as it wants, and during
that time the pod should count as Running (if it ever ran) or Pending (if
it never successfully started).

Reply to this email directly or view it on GitHub
#2529 (comment)
.

dchen1107 · 2015-01-07T01:33:30Z

@lavalamp #2999 was merged a while back, and with that change, now PodCondition / PodStatus are determined by both ContainerStatus and RestartPolicy.

lavalamp · 2015-01-07T01:38:36Z

Dawn is right, this bug seems to be fixed and should probably be closed.

bgrant0607 · 2015-01-07T01:39:27Z

There's still the broader issue of making replication controller do something useful (e.g., raising events, including number of pending pods in status) in this scenario.

liggitt · 2019-05-04T02:11:52Z

xref #76370

goltermann added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Nov 26, 2014

rawlingsj mentioned this issue Dec 12, 2014

ElasticSearch pod failing to start and preventing start script from finishing fabric8io/fabric8#3254

Closed

lavalamp assigned saad-ali Jan 7, 2015

bgrant0607 added area/usability priority/backlog Higher priority than priority/awaiting-more-evidence. and removed priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. labels Jan 7, 2015

bgrant0607 mentioned this issue Jan 14, 2015

Unidling proposal #3247

Closed

bgrant0607 mentioned this issue Feb 4, 2015

Kubelet should back off restarts of crashlooping containers #4121

Closed

bgrant0607 added the sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. label Feb 5, 2015

bgrant0607 modified the milestone: v1.0 Feb 6, 2015

yujuhong mentioned this issue Mar 26, 2015

Surface container failures in a more obvious manner #6014

Closed

bgrant0607 removed this from the v1.0 milestone Mar 27, 2015

bgrant0607 added the team/master label Mar 27, 2015

bgrant0607 unassigned saad-ali Mar 27, 2015

davidopp added the status/triaged label Apr 15, 2015

bgrant0607 mentioned this issue Apr 22, 2015

RC keeps creating new pods if there's an issue starting a container, even after RC is deleted #6291

Closed

bgrant0607 added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label May 8, 2015

bgrant0607 removed the priority/backlog Higher priority than priority/awaiting-more-evidence. label May 8, 2015

bgrant0607 added this to the v1.0-post milestone May 8, 2015

bgrant0607 mentioned this issue May 8, 2015

Pod should not say "pending" when a fundamental failure has occured #7968

Closed

thockin removed the status/triaged label May 29, 2015

thockin closed this as completed Jul 9, 2015

This was referenced Jul 30, 2015

Reap/GC terminated pods #7660

Closed

QoS proposal #11713

Merged

bgrant0607-nocc mentioned this issue Sep 24, 2015

Container failing to start isn't governed by pod's restart policy #14491

Closed

bgrant0607 mentioned this issue Nov 2, 2015

Event recorder is too strict in counting cached events, can OOM a server #16600

Closed

bgrant0607 mentioned this issue Jan 26, 2016

Rolling updates should fail early if a pod fails to start #18568

Open

bgrant0607 mentioned this issue Apr 11, 2016

Controllers should backoff when pod creation is denied #22298

Closed

bgrant0607 mentioned this issue May 5, 2016

kubectl run --restart=Never restarts (creates a Job) #24533

Closed

soltysh mentioned this issue May 6, 2016

Add doomed-to-failure detection to Job #25254

Closed

bgrant0607 mentioned this issue Mar 21, 2017

Workload API v1 requirements umbrella issue #42752

Closed

0xmichalis mentioned this issue Apr 19, 2017

Hundreds of pods with status "MatchNodeSelector" #43846

Closed

This was referenced May 4, 2019

Tight retry loops should not cause cascading failure of the cluster #74405

Open

Unable to Promote e2e which are using optional Condition fields such as Reason or Message #75324

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReplicationController + crashlooping or invalid docker image = bad times #2529

ReplicationController + crashlooping or invalid docker image = bad times #2529

lavalamp commented Nov 21, 2014

bgrant0607 commented Nov 21, 2014

lavalamp commented Nov 21, 2014

rawlingsj commented Dec 12, 2014

davidopp commented Dec 12, 2014

lavalamp commented Dec 12, 2014

bgrant0607 commented Dec 17, 2014

lavalamp commented Jan 7, 2015

lavalamp commented Jan 7, 2015

thockin commented Jan 7, 2015

dchen1107 commented Jan 7, 2015

lavalamp commented Jan 7, 2015

bgrant0607 commented Jan 7, 2015

liggitt commented May 4, 2019

ReplicationController + crashlooping or invalid docker image = bad times #2529

ReplicationController + crashlooping or invalid docker image = bad times #2529

Comments

lavalamp commented Nov 21, 2014

bgrant0607 commented Nov 21, 2014

lavalamp commented Nov 21, 2014

rawlingsj commented Dec 12, 2014

davidopp commented Dec 12, 2014

lavalamp commented Dec 12, 2014

bgrant0607 commented Dec 17, 2014

lavalamp commented Jan 7, 2015

lavalamp commented Jan 7, 2015

thockin commented Jan 7, 2015

dchen1107 commented Jan 7, 2015

lavalamp commented Jan 7, 2015

bgrant0607 commented Jan 7, 2015

liggitt commented May 4, 2019