Need an API call for "teardown all external resources" #4630

zmerlynn · 2015-02-19T23:11:18Z

See #4627 / #4530: These are both the wrong approach, as also noted in #4411 (comment). We need to delete these things on the master, prior to deleting the VM itself. For system add-ons, this is basically the API hook necessary for #3579 cleanup, but it's also required for any user services that were created as well.

zmerlynn · 2015-02-19T23:15:03Z

cc @roberthbailey @jlowdermilk

fgrzadkowski · 2015-02-23T08:07:38Z

cc @jszczepkowski

Can you please explain what this API would look like? I'm not sure that widening API for a central component is the right approach.
If external resource is tightly related to a resource (e.g. service) it'd be strange to have a separate API call just to remove it's external parts (e.g. load balancer). Also it may leave a resource (service) in an inconsistent state.
Additionally you will still need something like kube-down.sh to clear other things (e.g. remove machines etc.) so what's the benefit?

fgrzadkowski · 2015-02-23T14:21:02Z

Just to be clear - I think that removing of internal services should be handled by master itself, but I see no reason why we need an API call to delete external resources for user defined services.

zmerlynn · 2015-02-23T14:43:01Z

First, the meta-point: kube-down.sh isn't the only setup and teardown
"client". (GKE is, others might come along.) That was the meta-point for
#3579 as well. To the extent that we can push this logic onto the server,
we should.

The second meta-point: today ELBs are the biggest issue. There are
discussions around, say, firewall rules - those would also need teardown.
Or xyz cloud provider network widget. The point is, that shell script is
going to keep creeping.

As far as what the API looks like, we were envisioning something like:
"kubectl clusterteardown", that made one API call to the master (call it
teardown, or destroy, or destroyExternalResources or whatever) that could
even just have a loop on Go very similar to rejected PR #4530. (And yes,
we'd make one addition to kube-down.sh, right before the VM itself was
annihilated.)

The API should probably be best-effort semantics, since it's going down
anyways.
On Feb 23, 2015 12:08 AM, "Filip Grzadkowski" notifications@github.com
wrote:

cc @jszczepkowski https://github.com/jszczepkowski

Can you please explain what this API would look like? I'm not sure that
widening API for a central component is the right approach.
If external resource is tightly related to a resource (e.g. service) it'd
be strange to have a separate API call just to remove it's external parts
(e.g. load balancer). Also it may leave a resource (service) in an
inconsistent state.
Additionally you will still need something like kube-down.sh to clear
other things (e.g. remove machines etc.) so what's the benefit?

—
Reply to this email directly or view it on GitHub
#4630 (comment)
.

zmerlynn · 2015-02-23T14:57:56Z

I just saw your next comment. Why a distinction between user services and
add-ons here? They both represent external resources owned by cluster
services. Some might be owned by the user, so you could argue that we need
policy bits like "don't delete on teardown", but that doesn't mean the
client should delete them.
On Feb 23, 2015 6:42 AM, "Zachary Loafman" zml@google.com wrote:

First, the meta-point: kube-down.sh isn't the only setup and teardown
"client". (GKE is, others might come along.) That was the meta-point for
#3579 as well. To the extent that we can push this logic onto the server,
we should.

The second meta-point: today ELBs are the biggest issue. There are
discussions around, say, firewall rules - those would also need teardown.
Or xyz cloud provider network widget. The point is, that shell script is
going to keep creeping.

As far as what the API looks like, we were envisioning something like:
"kubectl clusterteardown", that made one API call to the master (call it
teardown, or destroy, or destroyExternalResources or whatever) that could
even just have a loop on Go very similar to rejected PR #4530. (And yes,
we'd make one addition to kube-down.sh, right before the VM itself was
annihilated.)

The API should probably be best-effort semantics, since it's going down
anyways.
On Feb 23, 2015 12:08 AM, "Filip Grzadkowski" notifications@github.com
wrote:

cc @jszczepkowski https://github.com/jszczepkowski

Can you please explain what this API would look like? I'm not sure that
widening API for a central component is the right approach.
If external resource is tightly related to a resource (e.g. service) it'd
be strange to have a separate API call just to remove it's external parts
(e.g. load balancer). Also it may leave a resource (service) in an
inconsistent state.
Additionally you will still need something like kube-down.sh to clear
other things (e.g. remove machines etc.) so what's the benefit?

—
Reply to this email directly or view it on GitHub
#4630 (comment)
.

zmerlynn · 2015-02-23T15:10:22Z

Also, to be clear, the API can just outright delete the services, too. I don't see a reason it has to delete the underlying resources versus the services themselves, since it's running in the shutdown path. If we really want to be delicate, we can terminate all objects (#1535) first.

jszczepkowski · 2015-02-25T15:07:05Z

I'll be happy working on this. I hope no one is working on it now.

Implementation of master call "/teardown" which removes all external resources used by kubernetes cluseter (currently, external load balancers are removed). Related to kubernetes#4630.

alex-mohr · 2015-03-19T20:56:18Z

We need to support deletion of clusters, both for GKE and for e.g. e2e tests. The master will create and delete various CloudProvider objects, so it should own those objects. We need a way to (a) crease accepting new objects, (b) change desired state for existing objects to does-not-exist, (c) block until the CloudProvider reconciler (or equivalent) finishes actually deleting all of those, then (d) the master can be deleted.

Given the master knows what it created and has code to delete such things (for whatever version of k8s it's running), we should use the master itself to clean up a cluster that needs to be deleted, not require some out-of-band tool to do so.

bgrant0607 · 2015-03-20T00:19:08Z

Discussion is occurring in #5025.

brendandburns · 2015-03-23T22:59:18Z

I don't think that this makes the 1.0 cut.

alex-mohr · 2015-03-23T23:04:17Z

I don't think that this makes the 1.0 cut.

@brendandburns Without this, we orphan resources in GCE on cluster delete. And if e.g. user spins up a new cluster with the same name, there will be all sorts of fun from dangling rules. I think this falls under operational reliability.

lavalamp · 2017-07-26T21:55:00Z

I don't think a 'protected' field is necessary for this. Today we have RBAC, finalizers, and GC. I think a client w/ super admin powers could delete all namespaces and then wait for the namespace count to go to 0. (There's probably a corner case or two around the default and kube-system namespaces that this would turn up.)

zmerlynn · 2017-07-26T22:03:45Z

I might be missing it, but I haven't found a way to delete default and kube-system. If you come up with one, I'll close the bug. :)

fejta-bot · 2018-01-01T21:44:36Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

roberthbailey · 2018-01-08T19:52:24Z

/remove-lifecycle stale

roberthbailey · 2018-01-08T19:53:33Z

It still isn't possible to delete the following namespaces: kube-system, kube-public, default. So we can't rely on finalizers / GC for resources in those namespaces.

We also don't have a way to put the apiserver into a lame duck mode to prevent new namespaces from being created during cluster teardown.

bgrant0607 · 2018-01-23T02:52:41Z

/lifecycle frozen

neolit123 · 2020-09-03T18:54:31Z

seems like a FR for api-machinery, that can eventually land in kubectl (sig-cli).
sig-cluster-lifecycle tools can adapt it via client-go if they need to.

/sig cli api-machinery

zmerlynn added the sig/cluster-lifecycle Categorizes an issue or PR as relevant to SIG Cluster Lifecycle. label Feb 19, 2015

zmerlynn mentioned this issue Feb 20, 2015

Delete cluster level logging services during kube down #4627

Merged

roberthbailey added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Feb 20, 2015

bgrant0607 added the area/cloudprovider label Feb 20, 2015

zmerlynn mentioned this issue Feb 20, 2015

Deleting external load balancer services at shut-down. #4530

Closed

jszczepkowski self-assigned this Feb 25, 2015

roberthbailey added this to the v1.0 milestone Mar 2, 2015

roberthbailey mentioned this issue Mar 2, 2015

Bringing down the cluster doesn't delete forwarding rules and load balancer for default services #4411

Closed

jszczepkowski mentioned this issue Mar 4, 2015

GCE: Forwarding Rules and Target Pools remain after kube-down #2984

Closed

piosz mentioned this issue Mar 4, 2015

guestbook e2e appears to strand resources sometimes #4961

Closed

jszczepkowski mentioned this issue Mar 4, 2015

API call to teardown all external resources. #5025

Closed

brendandburns modified the milestones: v1.0-bubble, v1.0 Mar 23, 2015

alex-mohr added the area/hosting label Mar 23, 2015

zmerlynn mentioned this issue Mar 26, 2015

Wait for ELBs to be deleted #6007

Merged

roberthbailey mentioned this issue May 26, 2015

Target pool leak in GCE project running our jenkins e2e tests #7753

Closed

a-robinson mentioned this issue May 27, 2015

Can't create external load balancer #8799

Closed

bgrant0607 removed this from the v1.0-post milestone Jul 24, 2015

roberthbailey added the team/control-plane label Aug 27, 2015

bgrant0607 mentioned this issue Oct 27, 2015

Cluster shutdown "events" #16337

Closed

roberthbailey mentioned this issue Feb 4, 2016

Add preemptible option for GCE Master #20417

Closed

bgrant0607 removed the area/hosting label Jul 12, 2016

bgrant0607 mentioned this issue Sep 7, 2016

Allow service controller to indicate that it has cleared load balancer resources #32157

Closed

bgrant0607 mentioned this issue Oct 7, 2016

Facilitate API orchestration #34363

Open

bgrant0607 self-assigned this Oct 17, 2016

This was referenced Oct 17, 2016

kube-down does not cleanup GCE external load balancers #25503

Closed

Handle deleting external load balancers across kube-controller-manager restarts #15203

Closed

bgrant0607 added the area/teardown label Oct 17, 2016

bgrant0607 mentioned this issue Jun 2, 2017

kube-addon-update revamp #23233

Closed

bgrant0607 mentioned this issue Nov 7, 2017

Control plane bootstrapping order AKA we need a run-level concept #54522

Open

danawillow mentioned this issue Nov 10, 2017

Removing Google Container Cluster does not remove Google Backend Service hashicorp/terraform-provider-google#706

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 1, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 8, 2018

bgrant0607 mentioned this issue Jan 22, 2018

Cluster versioning #4855

Closed

4 tasks

k8s-ci-robot added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Jan 23, 2018

spiffxp removed the team/control-plane (deprecated - do not use) label Mar 15, 2018

shyamjvs mentioned this issue Apr 24, 2018

GKE scalability jobs failing during Teardown #63026

Closed

k8s-ci-robot added sig/cli Categorizes an issue or PR as relevant to SIG CLI. sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. labels Sep 3, 2020

thockin closed this as completed Aug 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need an API call for "teardown all external resources" #4630

Need an API call for "teardown all external resources" #4630

zmerlynn commented Feb 19, 2015

zmerlynn commented Feb 19, 2015

fgrzadkowski commented Feb 23, 2015

fgrzadkowski commented Feb 23, 2015

zmerlynn commented Feb 23, 2015

zmerlynn commented Feb 23, 2015

zmerlynn commented Feb 23, 2015

jszczepkowski commented Feb 25, 2015

alex-mohr commented Mar 19, 2015

bgrant0607 commented Mar 20, 2015

brendandburns commented Mar 23, 2015

alex-mohr commented Mar 23, 2015

lavalamp commented Jul 26, 2017

zmerlynn commented Jul 26, 2017

fejta-bot commented Jan 1, 2018

roberthbailey commented Jan 8, 2018

roberthbailey commented Jan 8, 2018

bgrant0607 commented Jan 23, 2018

neolit123 commented Sep 3, 2020

Need an API call for "teardown all external resources" #4630

Need an API call for "teardown all external resources" #4630

Comments

zmerlynn commented Feb 19, 2015

zmerlynn commented Feb 19, 2015

fgrzadkowski commented Feb 23, 2015

fgrzadkowski commented Feb 23, 2015

zmerlynn commented Feb 23, 2015

zmerlynn commented Feb 23, 2015

zmerlynn commented Feb 23, 2015

jszczepkowski commented Feb 25, 2015

alex-mohr commented Mar 19, 2015

bgrant0607 commented Mar 20, 2015

brendandburns commented Mar 23, 2015

alex-mohr commented Mar 23, 2015

lavalamp commented Jul 26, 2017

zmerlynn commented Jul 26, 2017

fejta-bot commented Jan 1, 2018

roberthbailey commented Jan 8, 2018

roberthbailey commented Jan 8, 2018

bgrant0607 commented Jan 23, 2018

neolit123 commented Sep 3, 2020