Prevent stranding of partial load balancers resources #7852

a-robinson · 2015-05-06T19:02:46Z

Will likely fix #7753, although the proof is in the pudding. Explanation of what this is doing in the comment there.

@quinton-hoole

mbforbes · 2015-05-06T19:07:15Z

+cc @smarterclayton (openshift)

a-robinson · 2015-05-06T19:10:27Z

openstack != openshift, Max :)

cc @anguslees for openstack change

a-robinson · 2015-05-06T19:27:08Z

Passes LB e2es consistently

mbforbes · 2015-05-06T19:32:13Z

How embarrassing... that's what I get for attempting to triage during a meeting :-)

smarterclayton · 2015-05-06T19:46:54Z

:) I'd like to pretend like some marketing guy probably didn't anticipate that when the name was chosen, but I know better ;)

----- Original Message -----

How embarrassing... that's what I get for attempting to triage during a
meeting :-)

Reply to this email directly or view it on GitHub:
#7852 (comment)

ghost · 2015-05-06T20:00:16Z

pkg/cloudprovider/cloud.go

@@ -66,7 +66,8 @@ type TCPLoadBalancer interface {
 	CreateTCPLoadBalancer(name, region string, externalIP net.IP, ports []int, hosts []string, affinityType api.AffinityType) (string, error)
 	// UpdateTCPLoadBalancer updates hosts under the specified load balancer.
 	UpdateTCPLoadBalancer(name, region string, hosts []string) error
-	// DeleteTCPLoadBalancer deletes a specified load balancer.
+	// DeleteTCPLoadBalancer deletes a specified load balancer. It should return
+	// nil if the load balancer specified already didn't exist.
 	DeleteTCPLoadBalancer(name, region string) error


I'd suggest renaming to something like EnsureTCPLoadBalancerDeleted() to better advertise it's idempotence.

ghost · 2015-05-06T20:33:23Z

The main feedback is that you need to retry pool deletion. The other comments are cosmetic, and won't block merging.

a-robinson · 2015-05-06T22:40:09Z

Rebased and fixed all your comments except the pool error one.

anguslees · 2015-05-07T00:37:37Z

pkg/cloudprovider/openstack/openstack.go

-		return err
+	// It's ok if the pool doesn't exist, as we may still need to delete the vip
+	// (although I don't believe the system should ever be in that state).
+	pool, poolErr := pools.Get(lb.network, vip.PoolID).Extract()


If the above returned ErrNotFound, then vip.PoolID will be undefined here.

ghost · 2015-05-07T20:09:13Z

@a-robinson let me know when you're ready for a re-review. Assigning to you until then. It looks like you still need to address some of @anguslees openstack feedback.

Also, it would be good to get someone to test this on openstack before moreging. Perhaps @anguslees can do that for us?

…g rule.

a-robinson · 2015-05-15T23:01:10Z

Sorry about the delay, I had some other things float to the top of my priority queue this week. I think we should be fine with getting pools by name rather than ID, but have separated that out into issue #8352.

In the meantime, I've modified this PR to effectively maintain current behavior, just returning nil rather than an error if we attempt to delete a VIP that doesn't exist, paralleling what would happen if we first attempted to get load balancer before deleting it.

a-robinson · 2015-05-18T22:59:34Z

@quinton-hoole, are you alright with merging this?

ghost · 2015-05-18T23:15:43Z

Apologies for the delay. I'll review it now. Has this been tested on openstack yet, as per above comment?

a-robinson · 2015-05-18T23:30:08Z

No, I don't believe it has been tested on openstack, but @anguslees seems to be happy with it?

ghost · 2015-05-19T02:31:43Z

pkg/cloudprovider/openstack/openstack.go

+	} else if vipErr != nil {
+		return vipErr
+	}
+	vipExists := (vipErr == nil)


I don't think that it's possible to get here without vipErr being nil, in which case vipExists is superfluous, right?

ghost · 2015-05-19T02:42:05Z

Just two minor things left to check/fix, as far as I can see.

to load balancers having already been deleted.

controller to rely on that, so that we won't strand partial resources from them anymore (target pools in GCE, pools in OpenStack, etc.).

a-robinson · 2015-05-19T17:07:29Z

Re-pushed the code with the vipExists logic torn out and TODOs clarified with the openstack issue number.

ghost · 2015-05-19T19:10:07Z

Have you performed a few runs of the services e2e tests against this code to confirm that:

They pass?
They don't leak GCE load balancer resources?

ghost · 2015-05-21T21:08:32Z

Can anyone explain why the v1.0 candidate milestone tag was removed from this one? I don't think that we can launch without this, as all clusters that employ external load balancers on GCE will stop working properly within a few days, like our current soak test clusters.

ghost · 2015-05-21T21:09:19Z

Tentatively re-adding v1.0-candidate milestone until I understand why it's not v1.0 worthy.

Prevent stranding of partial load balancers resources

googlebot added the cla: yes label May 6, 2015

mbforbes assigned ghost May 6, 2015

ghost reviewed May 6, 2015
View reviewed changes

a-robinson force-pushed the tp branch 2 times, most recently from 5dc516a to 93a0574 Compare May 6, 2015 22:39

anguslees reviewed May 7, 2015
View reviewed changes

ghost assigned a-robinson and unassigned ghost May 7, 2015

Don't try deleting a target pool if we failed to delete its forwardin…

edf5a78

…g rule.

a-robinson mentioned this pull request May 15, 2015

Openstack load balancer improvements #8352

Closed

a-robinson force-pushed the tp branch from 93a0574 to e64916a Compare May 15, 2015 22:59

a-robinson assigned ghost and unassigned a-robinson May 15, 2015

This was referenced May 18, 2015

Leaking GCE load balancer target pools detected in services e2e test #8377

Closed

Make LoadBalancer creation more self-healing; don't delete on AWS #8366

Merged

ghost reviewed May 19, 2015
View reviewed changes

a-robinson added 2 commits May 19, 2015 17:06

Make openstack's impl of DeleteTCPLoadBalancer idempotent with respect

dc2f10d

to load balancers having already been deleted.

Require DeleteTCPLoadBalancer to be idempotent and change the service

dbf2244

controller to rely on that, so that we won't strand partial resources from them anymore (target pools in GCE, pools in OpenStack, etc.).

a-robinson force-pushed the tp branch from e64916a to dbf2244 Compare May 19, 2015 17:07

ghost added team/cluster priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. labels May 19, 2015

ghost added this to the v1.0-candidate milestone May 19, 2015

roberthbailey removed this from the v1.0-candidate milestone May 19, 2015

ghost added this to the v1.0-candidate milestone May 21, 2015

ghost pushed a commit that referenced this pull request May 22, 2015

Merge pull request #7852 from a-robinson/tp

6d8eaa9

Prevent stranding of partial load balancers resources

ghost merged commit 6d8eaa9 into kubernetes:master May 22, 2015

a-robinson unassigned ghost Aug 12, 2015

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent stranding of partial load balancers resources #7852

Prevent stranding of partial load balancers resources #7852

a-robinson commented May 6, 2015

mbforbes commented May 6, 2015

a-robinson commented May 6, 2015

a-robinson commented May 6, 2015

mbforbes commented May 6, 2015

smarterclayton commented May 6, 2015

ghost May 6, 2015

ghost commented May 6, 2015

a-robinson commented May 6, 2015

anguslees May 7, 2015

ghost commented May 7, 2015

a-robinson commented May 15, 2015

a-robinson commented May 18, 2015

ghost commented May 18, 2015

a-robinson commented May 18, 2015

ghost May 19, 2015

ghost commented May 19, 2015

a-robinson commented May 19, 2015

ghost commented May 19, 2015

ghost commented May 21, 2015

ghost commented May 21, 2015

Prevent stranding of partial load balancers resources #7852

Prevent stranding of partial load balancers resources #7852

Conversation

a-robinson commented May 6, 2015

mbforbes commented May 6, 2015

a-robinson commented May 6, 2015

a-robinson commented May 6, 2015

mbforbes commented May 6, 2015

smarterclayton commented May 6, 2015

ghost May 6, 2015

Choose a reason for hiding this comment

ghost commented May 6, 2015

a-robinson commented May 6, 2015

anguslees May 7, 2015

Choose a reason for hiding this comment

ghost commented May 7, 2015

a-robinson commented May 15, 2015

a-robinson commented May 18, 2015

ghost commented May 18, 2015

a-robinson commented May 18, 2015

ghost May 19, 2015

Choose a reason for hiding this comment

ghost commented May 19, 2015

a-robinson commented May 19, 2015

ghost commented May 19, 2015

ghost commented May 21, 2015

ghost commented May 21, 2015