Service Instances cascading delete proposal #2734

mszostok · 2019-10-21T15:31:17Z

Service Instances cascading delete proposal

This is the umbrella issue for the Service Instances cascading delete operation.
In this issue, we also track the implementation of this functionality from alpha to GA stage.

Motivation

Currently, during the deprovisioning operation, the default case is to fail the deprovisioning if there are bindings against the instance being deprovisioned.

Problems:

Users expect that deleting a Kubernetes API resource results in the total deletion of resources associated with the API resource
Not going back once deletion timestamp is set

OSB API Spec states:

Platforms MUST delete all Service Bindings for a Service Instance prior to attempting to deprovision the Service Instance. This specification does not specify what a Service Broker is to do if it receives a deprovision request while there are still Service Bindings associated with it.

source: https://github.com/openservicebrokerapi/servicebroker/blob/v2.15/spec.md#deprovisioning

Discussions

Create consensus on deprovision #46

Issue created on Nov 22, 2016

remove first the instance then remove the bindings.
Policy for handling broker deletion and deprovision #481

F2F meeting in late January, 2017

On a deprovision operation, we decided that the default case is to fail the deprovision if there are bindings against the instance being deprovisioned. A "force" or "cascade" flag can be explicitly set on the call to make all bindings be unbound, and then the deprovision be done.
Determine how to handle deprovision requests to an instance with bindings #820

F2F meeting on May 9, 2017

When a user attempts to delete an instance (ie kubectl delete instance) Catalog will check if any bindings are associated with the instance. If there are, the operation will fail with an error indicating that the instance is in use. The user can repeat the operation with a -force flag. In this case, the instance will be marked as soft deleted. Instances that don't have bindings will be deleted immediate with or without the force flag.

Implement Matrix of Deleting Things - recursive namespace deletion decision from F2F #2229

F2F meeting on Jul 23, 2018

We decided that the object that owns other objects should recursively delete everything that these objects own in case they are deleted.

Action Description of Scenario What we do / What will happen

Delete Instance Has bindings, broker allows it "Depth First Deletion": Cascade delete the bindings from the Broker. Once all bindings are deleted, delete the instance (blockOwnerDeletion, foreground deletion). Instance status says something like "Deprovision will start after bindings are unbound" instead of "Can't deprovision until all bindings are unbound" (current wording)

Proposed behaviors

Based on the discussions above, there are four proposed options:

Use the kubectl --cascade flag

--cascade=true: If true, cascade the deletion of the resources managed by this resource (e.g. Pods created by a
ReplicationController). Default true.

Basically, under the hood, the propagationPolicy is applied on a given resource. In case of cascade=true, it is propagationPolicy=foreground. In case of cascade=false, it is propagationPolicy=background. The propagationPolicy is managed by the Kubernetes Garbage Collection.

However, this option cannot be considered because of the expected behavior:

The ownerReference cannot be used because of a problem when a user sets the --cascade=false option. In such case, Kubernetes requires the propagationPolicy=background.

In background cascading deletion, Kubernetes deletes the owner object immediately and the garbage collector then deletes the dependents in the background.

source: https://kubernetes.io/docs/concepts/workloads/controllers/garbage-collection/#background-cascading-deletion

In our case, the binding must be deleted before the instance. Implementing the background behavior is not an option in the case of Service Catalog.
Global flag for changing the behavior
Add the disable-cascading-deletion=<true/false> flag for the Service Catalog controller-manager. This flag globally changes the behavior of the controller-manager.
ServiceInstance spec property changes
We can add the deleteBehavior field under the Instance spec. The field has two possible behaviors: forceCascade or failOnBindings. In the former case, deleting an instance force-deletes all Bindings that reference the instance, without confirming with the person who deletes it. In the latter one, the deletion simply fails if there are bindings that reference the instance. *This field will be optional and it will default to failOnBindings.
Always perform cascade deletion
When a user deletes a ServiceInstance, the related ServiceBindings are deleted (execute the unbind call to the broker). When all ServiceBindings are deleted successfully, the ServiceInstance is also deleted (execute the deprovision call to the broker).

Accepted solution

Based on the client feedback and OSB API specification, we know that we want to implement this behavior and to do it safely we decided to go with the 2nd option: Global flag for changing the behavior.

Reasons:

OSB API Spec:

Platforms MUST delete all Service Bindings for a Service Instance prior to attempting to deprovision the Service Instance. This specification does not specify what a Service Broker is to do if it receives a deprovision request while there are still Service Bindings associated with it.

source: https://github.com/openservicebrokerapi/servicebroker/blob/v2.15/spec.md#deprovisioning
If Users expect that deleting a Kubernetes API resource results in the total deletion of resources associated with the API resource then they can set the disable-cascading-deletion flag to true and controller-manager will take care about that.

This solution was accepted and the alpha implementation has already been added in this PR: #2711. Starting from the Service Catalog version 0.3.0, you can enable this option by setting the CascadingDeletion feature gate to true. Users, who do not accept cascading deletion, the controller-manager provides a flag disable-cascading-deletion which blocks the feature even if the cascading deletion is enabled by default (in the future).

ACTION REQUIRED

Going with this behavior is backward compatible. It was implemented as alpha.
We are waiting for your feedback and use-cases if this approach happens to break your flow. It's unacceptable for us, so please let us know.

The text was updated successfully, but these errors were encountered:

fejta-bot · 2020-02-13T13:14:48Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-03-14T13:58:59Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

mszostok · 2020-03-23T11:35:28Z

/remove-lifecycle rotten

mszostok · 2020-05-25T17:24:29Z

ICYMI:

The Cascading binding deletion feature is officially available in 0.3.0 release. The controller manager deletes all Service Bindings for a Service Instance before attempting to deprovision the Service Instance. This option can be enabled by setting the CascadingDeletion feature gate to true (#2711)

fejta-bot · 2020-08-23T18:12:46Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-09-22T18:55:41Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2020-10-22T19:38:27Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2020-10-22T19:38:34Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

jhvhs · 2020-10-27T00:14:43Z

/remove-lifecycle rotten
/reopen
/lifecycle frozen

k8s-ci-robot · 2020-10-27T00:14:51Z

@jhvhs: Reopened this issue.

In response to this:

/remove-lifecycle rotten
/reopen
/lifecycle frozen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

mrbobbytables · 2022-05-06T14:24:46Z

This project is being archived, closing open issues and PRs.
Please see this PR for more information: kubernetes/community#6632

mszostok pinned this issue Oct 28, 2019

piotrmiskiewicz mentioned this issue Oct 28, 2019

Cascading binding deletion #2711

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 13, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 14, 2020

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Mar 23, 2020

gmrodgers mentioned this issue Jun 9, 2020

Restricting ClusterServiceBroker's Plans/Classes to Namespaces Proposal #2820

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 23, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Sep 22, 2020

k8s-ci-robot closed this as completed Oct 22, 2020

k8s-ci-robot reopened this Oct 27, 2020

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. labels Oct 27, 2020

mrbobbytables closed this as completed May 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Service Instances cascading delete proposal #2734

Service Instances cascading delete proposal #2734

mszostok commented Oct 21, 2019 •

edited

Loading

fejta-bot commented Feb 13, 2020

fejta-bot commented Mar 14, 2020

mszostok commented Mar 23, 2020

mszostok commented May 25, 2020

fejta-bot commented Aug 23, 2020

fejta-bot commented Sep 22, 2020

fejta-bot commented Oct 22, 2020

k8s-ci-robot commented Oct 22, 2020

jhvhs commented Oct 27, 2020

k8s-ci-robot commented Oct 27, 2020

mrbobbytables commented May 6, 2022

Service Instances cascading delete proposal #2734

Service Instances cascading delete proposal #2734

Comments

mszostok commented Oct 21, 2019 • edited Loading