ReplicaSetController can miss handling the deletion of a ReplicaSet #69376

zegl · 2018-10-03T16:12:07Z

Is this a BUG REPORT or FEATURE REQUEST?:
/kind bug

What happened:

@ncdc's description hoisted from #69376 (comment):

I was about to file this. I spent yesterday triaging it and have determined the root cause. The issue is that the ReplicaSetController can "miss" handling the deletion of a ReplicaSet if things happen quickly enough. Here's the flow:

Client creates rs
ReplicaSetController sees new rs, starts working on creating pods
Client deletes rs
ReplicaSetController's rsInformer see the rs deletion and calls rsc.enqueueReplicaSet, which adds the namespace/name of the rs to the work queue
Client recreates rs with the exact same name as before
ReplicaSetController's rsInformer see the rs addition and calls rsc.enqueueReplicaSet, which adds the namespace/name of the rs to the work queue (again)
ReplicaSetController's sync handler processes the entry from the queue
Because the rs was recreated with the same name, when syncReplicaSet calls rsLister to get the rs, it's there (it's the 2nd one)

This is a timing issue. The ReplicaSetController doesn't check the rs's UID, and if the order of operations is "just right", the controller's sync handler won't "see" the deletion, so it never calls rsc.expectations.DeleteExpectations(key) to reflect that the rs was deleted.

/sig apps

Original description follows:

In #69344 I had to re-run the integration test 4 times to get it to work.

Log outputs:

The text was updated successfully, but these errors were encountered:

neolit123 · 2018-10-04T03:05:25Z

/kind flake
/sig testing

neolit123 · 2018-10-04T03:06:16Z

/remove-kind failing-test

ncdc · 2018-10-05T18:12:26Z

I was about to file this. I spent yesterday triaging it and have determined the root cause. The issue is that the ReplicaSetController can "miss" handling the deletion of a ReplicaSet if things happen quickly enough. Here's the flow:

Client creates rs
ReplicaSetController sees new rs, starts working on creating pods
Client deletes rs
ReplicaSetController's rsInformer see the rs deletion and calls rsc.enqueueReplicaSet, which adds the namespace/name of the rs to the work queue
Client recreates rs with the exact same name as before
ReplicaSetController's rsInformer see the rs addition and calls rsc.enqueueReplicaSet, which adds the namespace/name of the rs to the work queue (again)
ReplicaSetController's sync handler processes the entry from the queue
Because the rs was recreated with the same name, when syncReplicaSet calls rsLister to get the rs, it's there (it's the 2nd one)

This is a timing issue. The ReplicaSetController doesn't check the rs's UID, and if the order of operations is "just right", the controller's sync handler won't "see" the deletion, so it never calls rsc.expectations.DeleteExpectations(key) to reflect that the rs was deleted.

/sig apps

ncdc · 2018-10-05T20:19:46Z

For a bit more clarity:

When the rs informer sees a new rs, it calls

kubernetes/pkg/controller/replicaset/replica_set.go

Line 142 in 7bc48ba

AddFunc: rsc.enqueueReplicaSet,

When the rs informer sees a deleted rs, it calls

kubernetes/pkg/controller/replicaset/replica_set.go

Line 147 in 7bc48ba

DeleteFunc: rsc.enqueueReplicaSet,

The net effect is

kubernetes/pkg/controller/replicaset/replica_set.go

Line 417 in 7bc48ba

rsc.queue.Add(key)

Once a key ($namespace/$name) is in the queue, syncReplicaSet pops it off and then does

kubernetes/pkg/controller/replicaset/replica_set.go

Lines 582 to 587 in 7bc48ba

    
           rs, err := rsc.rsLister.ReplicaSets(namespace).Get(name) 
        
           if errors.IsNotFound(err) { 
        
           	glog.V(4).Infof("%v %v has been deleted", rsc.Kind, key) 
        
           	rsc.expectations.DeleteExpectations(key) 
        
           	return nil 
        
           }

The problem here is that there's not necessarily a guarantee that the order of operations is always like this:

AddFunc -> enqueueReplicaSet -> pop -> rsLister.Get() -> normal processing
DeleteFunc -> enqueueReplicaSet -> pop -> rsLister.Get() -> not found -> delete expectations
AddFunc -> enqueueReplicaSet -> pop -> rsLister.Get() -> normal processing

When the test fails, this is what happens:

AddFunc -> enqueueReplicaSet -> pop -> rsLister.Get() -> normal processing
DeleteFunc -> enqueueReplicaSet (note the change here - no pop!)
AddFunc -> enqueueReplicaSet -> pop -> rsLister.Get() -> normal processing

I confirmed this by adding some additional print statements to AddFunc and DeleteFunc. The DeleteFunc is always called at the "right" time. But sometimes syncReplicaSet isn't scheduled fast enough, so the new rs is already visible by the time syncReplicaSet is executed.

spiffxp · 2018-10-09T01:30:55Z

/milestone v1.13
FYI @jberkus

Yeah this is definitely becoming more of a problem, the daily failure rate for pull-integration-test is over 50%

It shows up on triage, but I can't point to when it started become more flaky: https://storage.googleapis.com/k8s-gubernator/triage/index.html?pr=1&job=integration

spiffxp · 2018-10-09T01:32:28Z

/priority critical-urgent

ncdc · 2018-10-11T18:02:24Z

@kubernetes/sig-apps-bugs do you think the controller should change to differentiate between two replicasets with the same name but different UIDs?

mattfarina · 2018-10-12T13:54:58Z

cc @caesarxuchao @lavalamp @enisoc @kow3ns

mortent · 2018-10-12T16:16:09Z

The flaky test is actually testing kubectl, so an option is to split this into two issues. We can fix the test to avoid this problem, which would fix the flakiness for people working on other things. And then separately come up with a fix for the issue in replicaset.

I created a PR that updates the test to avoid creating replicasets with the same name: #69739

lavalamp · 2018-10-12T18:47:36Z

The replica set controller needs a fix. Thanks for the clear description, @ncdc.

/assign @kow3ns

lavalamp · 2018-10-12T18:49:29Z

Alternatively, perhaps we could fix this by adding UID to the key used in the delta fifo?

spiffxp · 2018-10-17T14:36:17Z

https://storage.googleapis.com/k8s-gubernator/triage/index.html?pr=1&job=integration#3e7dba9374ac0dbc9c11 @kow3ns so flakes are definitely down now that we're working around this (thanks @mortent!), but do we have an issue for the "proper" fix @lavalamp proposed?

kow3ns · 2018-11-14T22:31:07Z

/assign janetkuo

kow3ns · 2018-11-14T22:31:28Z

We should get a fix in for the next release

AishSundar · 2018-11-16T07:51:46Z

As per @kow3ns comment, moving this to 1.14

/milestone v1.14

When the ReplicaSet controller fetches a fresh version of the ReplicaSet from the server it validates that the UID of the fresh matches the one in the cache. If the UID does not match, a race condition outlined in the issue must have occurred and the controller should intepret the call to syncReplicaSet as a delete. Reverts the merge that temporarily fixed the bug that would occasionally fail as a result of this bug.

nikopen · 2019-03-01T18:14:13Z

(tracking update - in progress #72927)

nikopen · 2019-03-11T07:41:56Z

This is an important bug but the fix in #72927 still needs work @runyontr

Shifting milestones
/milestone v1.15

@kow3ns @spiffxp

timmycarr · 2019-05-29T18:43:54Z

Hi! We are starting the code freeze for 1.15 tomorrow EOD. Just checking in to see if this issue still planned for the 1.15 cycle?

lavalamp · 2019-05-29T19:55:59Z

I think #72927 is pretty close, I left a few comments.

soggiest · 2019-06-03T20:55:44Z

/milestone v1.16

Pothulapati · 2019-08-20T18:10:58Z

Hello! I'm part of the bug triage team for the 1.16 release cycle and considering this issue is tagged for 1.16, but not updated for a long time, I'd like to check its status. The code freeze is starting on August 29th (about 1.5 weeks from now), which means that there should be a PR ready (and merged) until then.

Do we still target this issue to be fixed for 1.16? If not please re-tag the issue to the planned milestone.

xmudrii · 2019-09-05T12:34:53Z

@lavalamp @liggitt @janetkuo This issue and the relevant PR are open for a long time now. Should we move this to the next milestone or should we remove this from the milestone entirely?

liggitt · 2019-09-05T13:09:28Z

A change this low-level needs more soak time. Moving to 1.17

/milestone v1.17

tnozicka · 2019-09-12T07:46:13Z

I think fixing the handlers and handling expectations in them will fix this issue #82572

/assign

josiahbjorgaard · 2019-10-20T22:41:56Z

Bug triage for 1.17 here. This issue has been open for a significant amount of time and since it is tagged for milestone 1.17, we want to let you know that the 1.17 code freeze is coming in less than one month on Nov. 14th. Will this issue be resolved before then?

tnozicka · 2019-10-21T13:42:24Z

I hope it will, PR #82572 is there, just a matter of getting a review/tag.

k8s-ci-robot added kind/bug Categorizes issue or PR as related to a bug. needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Oct 3, 2018

k8s-ci-robot removed the kind/failing-test Categorizes issue or PR as related to a consistently or frequently failing test. label Oct 4, 2018

ncdc changed the title ~~pull-integration-test is flakey~~ pull-integration-test (run_rs_tests) is flakey Oct 5, 2018

k8s-ci-robot added the sig/apps Categorizes an issue or PR as relevant to SIG Apps. label Oct 5, 2018

k8s-ci-robot added this to the v1.13 milestone Oct 9, 2018

k8s-ci-robot added the priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. label Oct 9, 2018

spiffxp mentioned this issue Oct 9, 2018

pr:pull-kubernetes-integration flaked 26 times in the past week #61384

Closed

Freyert mentioned this issue Oct 12, 2018

Fix AWS NLB security group updates #68422

Merged

mortent mentioned this issue Oct 12, 2018

Remove flaky test by avoiding reuse of name for replicaset #69739

Merged

k8s-ci-robot assigned kow3ns Oct 12, 2018

caesarxuchao mentioned this issue Oct 12, 2018

The metadata.generation of a Custom Resource is always incremented #69059

Merged

nikopen mentioned this issue Oct 16, 2018

pr:pull-kubernetes-integration flaked 86 times in the past week #69857

Closed

k8s-ci-robot closed this as completed in #69739 Oct 17, 2018

k8s-ci-robot assigned janetkuo Nov 14, 2018

k8s-ci-robot modified the milestones: v1.13, v1.14 Nov 16, 2018

runyontr mentioned this issue Jan 15, 2019

Catch ReplicaSet Deletion Events #72927

Closed

k8s-ci-robot modified the milestones: v1.14, v1.15 Mar 11, 2019

k8s-ci-robot modified the milestones: v1.15, v1.16 Jun 3, 2019

k8s-ci-robot modified the milestones: v1.16, v1.17 Sep 5, 2019

tnozicka mentioned this issue Sep 11, 2019

Fix RS expectations for recreate case #82572

Merged

k8s-ci-robot assigned tnozicka Sep 12, 2019

k8s-ci-robot closed this as completed in #82572 Nov 11, 2019

tanjunchen mentioned this issue Jan 1, 2020

/test: resolve pending TODOs #86756

Closed

gavinfish mentioned this issue Jan 9, 2020

Remove workaround for RS bug in cmd apps test #87023

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReplicaSetController can miss handling the deletion of a ReplicaSet #69376

ReplicaSetController can miss handling the deletion of a ReplicaSet #69376

zegl commented Oct 3, 2018 •

edited by janetkuo

neolit123 commented Oct 4, 2018 •

edited

neolit123 commented Oct 4, 2018

ncdc commented Oct 5, 2018 •

edited by janetkuo

ncdc commented Oct 5, 2018

spiffxp commented Oct 9, 2018

spiffxp commented Oct 9, 2018

ncdc commented Oct 11, 2018

mattfarina commented Oct 12, 2018

mortent commented Oct 12, 2018

lavalamp commented Oct 12, 2018

lavalamp commented Oct 12, 2018

spiffxp commented Oct 17, 2018

kow3ns commented Nov 14, 2018

kow3ns commented Nov 14, 2018

AishSundar commented Nov 16, 2018

nikopen commented Mar 1, 2019

nikopen commented Mar 11, 2019

timmycarr commented May 29, 2019

lavalamp commented May 29, 2019

soggiest commented Jun 3, 2019

Pothulapati commented Aug 20, 2019

xmudrii commented Sep 5, 2019

liggitt commented Sep 5, 2019

tnozicka commented Sep 12, 2019 •

edited

josiahbjorgaard commented Oct 20, 2019 •

edited

tnozicka commented Oct 21, 2019

ReplicaSetController can miss handling the deletion of a ReplicaSet #69376

ReplicaSetController can miss handling the deletion of a ReplicaSet #69376

Comments

zegl commented Oct 3, 2018 • edited by janetkuo

neolit123 commented Oct 4, 2018 • edited

neolit123 commented Oct 4, 2018

ncdc commented Oct 5, 2018 • edited by janetkuo

ncdc commented Oct 5, 2018

spiffxp commented Oct 9, 2018

spiffxp commented Oct 9, 2018

ncdc commented Oct 11, 2018

mattfarina commented Oct 12, 2018

mortent commented Oct 12, 2018

lavalamp commented Oct 12, 2018

lavalamp commented Oct 12, 2018

spiffxp commented Oct 17, 2018

kow3ns commented Nov 14, 2018

kow3ns commented Nov 14, 2018

AishSundar commented Nov 16, 2018

nikopen commented Mar 1, 2019

nikopen commented Mar 11, 2019

timmycarr commented May 29, 2019

lavalamp commented May 29, 2019

soggiest commented Jun 3, 2019

Pothulapati commented Aug 20, 2019

xmudrii commented Sep 5, 2019

liggitt commented Sep 5, 2019

tnozicka commented Sep 12, 2019 • edited

josiahbjorgaard commented Oct 20, 2019 • edited

tnozicka commented Oct 21, 2019

zegl commented Oct 3, 2018 •

edited by janetkuo

neolit123 commented Oct 4, 2018 •

edited

ncdc commented Oct 5, 2018 •

edited by janetkuo

tnozicka commented Sep 12, 2019 •

edited

josiahbjorgaard commented Oct 20, 2019 •

edited