Cluster shutdown "events" #16337

bprashanth · 2015-10-27T01:06:08Z

I'd like to delete certain external resources (loadbalancers) only when the cluster is being torn down. Not when the user deletes the pod, or the scheduler preempts it. Currently scripts like kube-down aren't kubernetes aware, they just start nuking cluster resources using gcloud, meaning the following happens:

$ ./cluster/kube-up.sh

$ cat <<EOF | kubectl create -f -
> apiVersion: v1
> kind: Service
> metadata:
>   name: nginxsvc
>   labels:
>     app: nginx
> spec:
>   type: LoadBalancer
>   ports:
>   - port: 8080
>     targetPort: 80
>     protocol: TCP
>     name: http
>   - port: 443
>     protocol: TCP
>     name: https
>   selector:
>     app: nginx
> EOF

$ kubectl get svc
NAME         CLUSTER_IP    EXTERNAL_IP      PORT(S)            SELECTOR    AGE
kubernetes   10.0.0.1      <none>           443/TCP            <none>      2m
nginxsvc     10.0.104.32   104.154.47.133   8080/TCP,443/TCP   app=nginx   1m

$ gcloud compute forwarding-rules list 
NAME                        REGION      IP_ADDRESS      IP_PROTOCOL TARGET
a5653a6897c4311e5bf0242010af0000 us-central1 104.154.47.133  TCP         us-central1/targetPools/a5653a6897c4311e5bf0242010af0000

$ ./cluster/kube-down.sh
Done

$ gcloud compute forwarding-rules list
NAME                        REGION      IP_ADDRESS      IP_PROTOCOL TARGET
a5653a6897c4311e5bf0242010af0000 us-central1 104.154.47.133  TCP         us-central1/targetPools/a5653a6897c4311e5bf0242010af0000

Having at least the following guarantees when there's a pending shutdown will make life a little easier:

Deny new resource create requests
Notify existing services/controllers/pods

@thockin

thockin · 2015-10-27T05:03:41Z

@bgrant0607 @brendandburns @smarterclayton

On Mon, Oct 26, 2015 at 6:06 PM, Prashanth B notifications@github.com
wrote:

I'd like to delete certain external resources (loadbalancers) only when
the cluster is being torn down. Not when the user deletes the pod, or the
scheduler preempts it. Currently scripts like kube-down aren't kubernetes
aware, they just start nuking cluster resources using gcloud, meaning the
following happens:

$ ./cluster/kube-up.sh

$ cat <<EOF | kubectl create -f -

apiVersion: v1
kind: Service
metadata:
name: nginxsvc
labels:
app: nginx
spec:
type: LoadBalancer
ports:

port: 8080
targetPort: 80
protocol: TCP
name: http

port: 443
protocol: TCP
name: https
selector:
app: nginx
EOF

$ kubectl get svcNAME CLUSTER_IP EXTERNAL_IP PORT(S) SELECTOR AGEkubernetes 10.0.0.1 443/TCP 2mnginxsvc 10.0.104.32 104.154.47.133 8080/TCP,443/TCP app=nginx 1m

$ gcloud compute forwarding-rules list NAME REGION IP_ADDRESS IP_PROTOCOL TARGETa5653a6897c4311e5bf0242010af0000 us-central1 104.154.47.133 TCP us-central1/targetPools/a5653a6897c4311e5bf0242010af0000

$ ./cluster/kube-down.shDone

$ gcloud compute forwarding-rules listNAME REGION IP_ADDRESS IP_PROTOCOL TARGETa5653a6897c4311e5bf0242010af0000 us-central1 104.154.47.133 TCP us-central1/targetPools/a5653a6897c4311e5bf0242010af0000

Having at least the following guarantees when there's a pending shutdown
will make life a little easier:

Deny new resource create requests

Notify existing services/controllers/pods

@thockin https://github.com/thockin

—
Reply to this email directly or view it on GitHub
#16337.

bgrant0607 · 2015-10-27T06:40:15Z

Related #4630

cc @davidopp @fgrzadkowski @roberthbailey @zmerlynn

bgrant0607 · 2015-10-27T06:42:55Z

It's the pod (LB plugin?) that creates the forwarding rule?

bgrant0607 · 2015-10-27T14:24:40Z

Ideally there would be some Kubernetes resources that represented the underlying infrastructure resources, such that deleting the former would cause the latter to be deleted.

bgrant0607 · 2015-10-27T14:25:17Z

See also #13515

bprashanth · 2015-10-27T16:45:00Z

My example was with a HEAD kube cluster, so it's the service controller (that runs as part of kube-controller-manager) that creates the forwarding rule for service Type=LoadBalancer, not the L7 LB plugin pod.

zmerlynn · 2015-10-27T16:53:07Z

Yes, GKE still really wants this. It's a pretty big wart.

bprashanth · 2015-10-27T16:53:11Z

Ideally there would be some Kubernetes resources that represented the underlying infrastructure resources, such that deleting the former would cause the latter to be deleted.

Just receiving a delete is insufficient, I need to differentiate that SIGTERM from a preemption in the case of a pod controller. A new field as suggested in some of the other issues might help, because I can use the grace period to check it in the apiserver.

bprashanth · 2015-10-27T16:55:55Z

Another possible solution would be to map the "Steps" in the simple cluster setup proposal (https://docs.google.com/document/d/1v68yStV2O6aHuRuT3AlWnbe6vkUCtzyC5I7Elhgik3o/edit?ts=561ee618) to runlevels persisted through the apiserver, and add a shutdown level just like init systems, hence the title.

thockin · 2015-10-27T16:57:22Z

Naively, deleting the Ingress object should trigger the controller Pod to
clean up the backend resources. But that requires some heuristic like
"pods are deleted last" and even then it has to be across namespaces.

On Tue, Oct 27, 2015 at 9:53 AM, Prashanth B notifications@github.com
wrote:

Ideally there would be some Kubernetes resources that represented the
underlying infrastructure resources, such that deleting the former would
cause the latter to be deleted.

Just receiving a delete is insufficient, I need to differentiate that
SIGTERM from a preemption in the case of a pod controller. A new field as
suggested in some of the other issues might help, because I can use the
grace period to check it in the apiserver.

—
Reply to this email directly or view it on GitHub
#16337 (comment)
.

bgrant0607 · 2015-10-29T08:16:52Z

Re. pods deleted last: Another use case for finalizers. #3585

davidopp · 2015-11-03T01:38:04Z

Also vaguely related to #7459

davidopp · 2015-11-03T04:05:50Z

@bprashanth I'm marking this P2 but feel free to upgrade to P1 if you think it's higher priority.

bgrant0607 · 2016-10-17T19:56:25Z

Also related to #10179

bgrant0607 · 2016-10-17T22:40:20Z

Note that new resources cannot be created in namespaces that are being deleted:

kubernetes/plugin/pkg/admission/namespace/lifecycle/admission.go

Line 162 in e02b73f

    
           return admission.NewForbidden(a, fmt.Errorf("unable to create new content in namespace %s because it is being terminated.", a.GetNamespace()))

And we already treat the default and kube-system namespaces specially (and openshift-infra in OS):

kubernetes/plugin/pkg/admission/namespace/lifecycle/admission.go

Line 53 in e02b73f

    
           return NewLifecycle(client, sets.NewString(api.NamespaceDefault, api.NamespaceSystem))

bgrant0607 · 2016-10-18T00:05:20Z

For my own reference: https://github.com/kubernetes/kubernetes/tree/master/cluster/addons/cluster-loadbalancing/glbc

fejta-bot · 2017-12-18T12:04:38Z

Issues go stale after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

tonglil · 2018-01-10T05:24:20Z

Should this issue be consolidated with something else, or kept open for the tear down of GCLB resources?

bgrant0607 · 2018-01-22T19:55:39Z

/remove-lifecycle stale
/lifecycle frozen

cc @roberthbailey

roberthbailey · 2018-02-17T04:45:53Z

/cc @k4leung4

bprashanth added the team/cluster label Oct 27, 2015

bgrant0607 added sig/cluster-lifecycle Categorizes an issue or PR as relevant to SIG Cluster Lifecycle. team/control-plane and removed team/cluster labels Oct 27, 2015

davidopp added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Nov 3, 2015

bprashanth mentioned this issue Aug 2, 2016

Modified influxdb petset to provision persistent volume. #28840

Merged

jszczepkowski mentioned this issue Aug 4, 2016

Reverted conversion of influx-db to PetSet. #30080

Merged

bprashanth mentioned this issue Sep 6, 2016

Allow service controller to indicate that it has cleared load balancer resources #32157

Closed

bprashanth mentioned this issue Sep 23, 2016

[k8s.io] Services should be able to change the type and ports of a service [Slow] [It] #31715

Closed

bprashanth changed the title ~~Runlevel events for cluster~~ Cluster shutdown "events" Oct 11, 2016

bgrant0607 added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. and removed priority/backlog Higher priority than priority/awaiting-more-evidence. team/control-plane labels Oct 17, 2016

k8s-github-robot added area/kubectl team/cluster labels Oct 17, 2016

matchstick assigned bgrant0607 Oct 17, 2016

bgrant0607 added the area/teardown label Oct 17, 2016

bgrant0607 added sig/service-catalog Categorizes an issue or PR as relevant to SIG Service Catalog. sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. and removed team/cluster area/kubectl labels Nov 3, 2016

bprashanth mentioned this issue Nov 30, 2016

DiffResources {e2e.go} #34060

Closed

bgrant0607 mentioned this issue Jun 2, 2017

kube-addon-update revamp #23233

Closed

bgrant0607 mentioned this issue Nov 7, 2017

Control plane bootstrapping order AKA we need a run-level concept #54522

Open

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 18, 2017

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 22, 2018

thockin closed this as completed Feb 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster shutdown "events" #16337

Cluster shutdown "events" #16337

bprashanth commented Oct 27, 2015

thockin commented Oct 27, 2015

bgrant0607 commented Oct 27, 2015

bgrant0607 commented Oct 27, 2015

bgrant0607 commented Oct 27, 2015

bgrant0607 commented Oct 27, 2015

bprashanth commented Oct 27, 2015

zmerlynn commented Oct 27, 2015

bprashanth commented Oct 27, 2015

bprashanth commented Oct 27, 2015

thockin commented Oct 27, 2015

bgrant0607 commented Oct 29, 2015

davidopp commented Nov 3, 2015

davidopp commented Nov 3, 2015

bgrant0607 commented Oct 17, 2016

bgrant0607 commented Oct 17, 2016

bgrant0607 commented Oct 18, 2016

fejta-bot commented Dec 18, 2017

tonglil commented Jan 10, 2018

bgrant0607 commented Jan 22, 2018

roberthbailey commented Feb 17, 2018

Cluster shutdown "events" #16337

Cluster shutdown "events" #16337

Comments

bprashanth commented Oct 27, 2015

thockin commented Oct 27, 2015

bgrant0607 commented Oct 27, 2015

bgrant0607 commented Oct 27, 2015

bgrant0607 commented Oct 27, 2015

bgrant0607 commented Oct 27, 2015

bprashanth commented Oct 27, 2015

zmerlynn commented Oct 27, 2015

bprashanth commented Oct 27, 2015

bprashanth commented Oct 27, 2015

thockin commented Oct 27, 2015

bgrant0607 commented Oct 29, 2015

davidopp commented Nov 3, 2015

davidopp commented Nov 3, 2015

bgrant0607 commented Oct 17, 2016

bgrant0607 commented Oct 17, 2016

bgrant0607 commented Oct 18, 2016

fejta-bot commented Dec 18, 2017

tonglil commented Jan 10, 2018

bgrant0607 commented Jan 22, 2018

roberthbailey commented Feb 17, 2018