add kep for better pruning in apply #810

Liujingfang1 · 2019-02-04T19:18:16Z

This KEP proposes a different approach of kubectl apply --prune.

fixes kubernetes/kubectl#572

Liujingfang1 · 2019-02-04T19:19:40Z

/assign @pwittrock @soltysh @seans3

barney-s

The conceptualized root-object seems to be a grouping construct that is similar to application object (https://github.com/kubernetes-sigs/application) albeit different purpose. I would prefer if it was application object.
Consider injecting an unique label when applying. Say apply introduces "label-applied-context" uuid. Its the same on all objects in that apply command. This could help with tracking objects that are removed from dir but exist in the server side. If the label exists for even 1 object server side it is reused for all new objects as well.

barney-s · 2019-02-04T21:32:41Z

keps/sig-cli/kubectl-apply-prune-beta.md

+
+`kubectl apply` operates an aggregation of Kubernetes Objects. While `kubectl apply` is widely used in different workflows, users expect it to be able to sync the objects on the cluster with configuration saved in files, directories or repos. That means, when `kubectl apply` is run on those configurations, the corresponding objects are expected to be correctly created, updated and pruned. 
+
+  Currently `kubectl apply --prune` is still in Alpha. It is based on label matching. Users need to be really careful to use this command, so that they don't delete objects unintentionally. There is a list of [issues]([kubernetes/kubectl#572]) observed by users. The most noticeable issue happens when changing directories or working with directories with the same labels. When two directories share the same label. `kubectl apply --prune` from one directory will delete objects from the other directory.


Link to issues broken. Would you please add the issues you think need to be addressed.

Here is the issue kubernetes/kubectl#572. I'll update the link

barney-s · 2019-02-04T21:34:16Z

keps/sig-cli/kubectl-apply-prune-beta.md

+
+`kubectl apply` operates an aggregation of Kubernetes Objects. While `kubectl apply` is widely used in different workflows, users expect it to be able to sync the objects on the cluster with configuration saved in files, directories or repos. That means, when `kubectl apply` is run on those configurations, the corresponding objects are expected to be correctly created, updated and pruned. 
+
+  Currently `kubectl apply --prune` is still in Alpha. It is based on label matching. Users need to be really careful to use this command, so that they don't delete objects unintentionally. There is a list of [issues]([kubernetes/kubectl#572]) observed by users. The most noticeable issue happens when changing directories or working with directories with the same labels. When two directories share the same label. `kubectl apply --prune` from one directory will delete objects from the other directory.


Is supporting working across directories an explicit goal.
If an user has conflicting labels across directories isn't that an user error ?

Is supporting working across directories an explicit goal.

Yes

If an user has conflicting labels across directories isn't that an user error ?

It could be, but conflicting labels are sometimes not easy to discover. The point here is that we need to provide a solution in kubectl, which will avoid errors that are easy to make and hard to detect.

User error can be challenging to avoid. It is hard to differentiate whether the error is really an error or a user intent. IMO, we should at least express it clearly how prune works for some popular user errors.

barney-s · 2019-02-04T21:35:23Z

keps/sig-cli/kubectl-apply-prune-beta.md

+
+
+### Root Object
+To solve the first challenge, a concept of root object is proposed. A root object is a ConfigMap object that contains the GVKN information for all the objects that are to be applied. The root ConfigMap objects should be created by other tools such as kustomize, Helm.  When generating resources to be consumed by kubectl apply, those tools are responsible to generate the associated root ConfigMap object to utilize better pruning.


Could we leverage https://github.com/kubernetes-sigs/application for this ?

I would prefer a resource 100% of clusters are guaranteed to have installed.

Another issue is that apply isn't just for applications. It's for arbitrary resource types and groupings.

barney-s · 2019-02-04T21:37:58Z

keps/sig-cli/kubectl-apply-prune-beta.md

+The presence of a root object indicates apply need to handle prune. When kubectl apply is given such a root object, it finds the live root object with the same name. 
+- If a live root object is not found, Kubectl will create the root object and the corresponding set of objects
+- If a live root object is found, then the set of live objects will be found by reading the data field of the live root object. Kubectl compares the live set with new set of objects and takes following steps: 
+  - set the root object data as the union of two sets and update the root object


Is this a client side operator (kubectl) or a server side one ?

This is kubectl. There is not server side actions.

easeway · 2019-02-04T21:56:45Z

keps/sig-cli/kubectl-apply-prune-beta.md

+
+
+### Root Object
+To solve the first challenge, a concept of root object is proposed. A root object is a ConfigMap object that contains the GVKN information for all the objects that are to be applied. The root ConfigMap objects should be created by other tools such as kustomize, Helm.  When generating resources to be consumed by kubectl apply, those tools are responsible to generate the associated root ConfigMap object to utilize better pruning.


ConfigMap is namespace scoped, does that mean --prune can only be used for deleting namespace scoped objects?

No, --prune will be used for all objects. ConfigMap is chosen since it has data field which can hold arbitrary data.

I think the question is, what namespace with the configmap live in for non-namespaced objects?

easeway · 2019-02-04T22:00:04Z

keps/sig-cli/kubectl-apply-prune-beta.md

+data:
+ object1:  namespace1:group1:version1:kind1:somename
+ object2:  namespace1:group2:version2:kind2:secondname
+ object3:  namespace3:group3:version3:kind3:thirdname


As ConfigMap is namespace scoped, is namespace is still required in each entry? From the example, is it allowed to specify objects in a different namespace from the ConfigMap?

If we allow Root object to specify resources across multiple namespaces, or even cluster scoped resources, it's better to use a cluster scoped CRD, instead of namespace scoped ConfigMap. But using a cluster scoped CRD requires additional care of setting up the correct RBAC to allow user tools to work properly (limiting the access on cluster level is more complicated and needs to be more careful than inside a namespace).

The namespace of the root object itself can be different from the namespace listed in data. So namespace is required in each entry.

Using a CRD requirest that the CRD is installed in the cluster. In addition, CRD based approach doesn't work with old version of clusters.

easeway · 2019-02-04T22:02:31Z

keps/sig-cli/kubectl-apply-prune-beta.md

+```
+
+The presence of a root object indicates apply need to handle prune. When kubectl apply is given such a root object, it finds the live root object with the same name. 
+- If a live root object is not found, Kubectl will create the root object and the corresponding set of objects


Does this imply the existing behavior of --prune when root object is missing? Will it be safer if root object is always required?

Does this imply the existing behavior of --prune when root object is missing?

When the root object is missing, kubectl will skip pruning part and no object is deleted. Since the proposed solution is quite different from existing behavior. I don't think it's good to keep both of them.

Will it be safer if root object is always required?

It could be. If the root object is not present, no object is deleted at each apply time. Eventually the cluster will need a manual clean up. If the user has a good strategy of cleaning up, it is safe for them to do so. Otherwise, using a root object every time is better.

easeway · 2019-02-04T22:06:20Z

keps/sig-cli/kubectl-apply-prune-beta.md

+  - set the root object data as the union of two sets and update the root object
+  - create new objects
+  - prune old objects
+  - remove the pruned objects from root object’s data field


The new objects and old objects are confusing to me (sorry, if I missed something). My understanding is the actual objects are used to construct one set, and those defined in root object data field are used to construct the other set. Which one is treated as new set and which one is the old set?

The new set is the set of objects to be applied. The old set is the set of objects running on cluster, which were applied previously.

The new objects are the child objects only in the new set. The old objects are the child objects only in the old set.

I see the point why name of resource is needed for combining the new and old sets. If we want to integrate with App CRD, will the extra listing operation (based on GK details, assuming all underlying resources are within same namespace as App CR) expensive?

As synced by person, users may not know name of all underlying resources, but we can leverage controllers to generate App CRs, so supporting name for App CRD is a viable option too (i.e., I agree with you).

easeway · 2019-02-04T22:09:13Z

keps/sig-cli/kubectl-apply-prune-beta.md

+ labels:
+    kubectl.kubernetes.io/applyOwner: v1:ConfigMap:predictable-name-foo
+```
+The presence of this label is to confirm that a live object is safe to delete when it is to be pruned.


So every object must have this label to be associated with the root object and thus the tool is able to construct the new set and old set? I guess I understand how the new set and old set are constructed.

But I still feel it will be safer that the root object must be consistent with the actual objects in the system before --prune is allowed to proceed. The tools creating child objects are responsible for updating the root object to make it consistent. Otherwise it's still possible for making mistakes.

But I still feel it will be safer that the root object must be consistent with the actual objects in the system before --prune is allowed to proceed. The tools creating child objects are responsible for updating the root object to make it consistent. Otherwise it's still possible for making mistakes.

By this proposal, Kubectl will assume the root object is consistent with the actual objects. I agree with you that the tools are responsible to make sure this is true.

smarterclayton · 2019-02-04T22:17:19Z

keps/sig-cli/kubectl-apply-prune-beta.md

+The objects passed to apply will have labels or annotations pointing back to the root object for the same apply.
+```
+ labels:
+    kubectl.kubernetes.io/applyOwner: v1:ConfigMap:predictable-name-foo


Won’t work - config map names can be longer than allowed label length.

You’ll need to use an annotation if you want to do data like this.

iiuc the plan would be to auto-generate a fixed length name for the "root object" based on the (I'm guessing) the has of its contents and deal with collisions the same way we do with replica-sets. We could mandate that implementations ensure that the name of the ConfigMap for a "root object" is no greater than the allowable 63 chars.

Thank you for pointing out. I'll update it to use annotation.

smarterclayton · 2019-02-04T22:18:59Z

keps/sig-cli/kubectl-apply-prune-beta.md

+- Find out correct objects to delete. When a directory is applied after some updates, it loses the track of objects that are previously applied from the same directory.
+- Delete objects at proper time. Assume the set of objects to be deleted is available, when to delete them is another challenge. Those objects could be still needed during rolling out other objects.
+
+In this design, we focus on a solution for the first challenge. For the second challenge, we can blacklist certain types such as ConfigMaps and Secrets for deletion. Generally deleting objects at proper time can be better addressed by Garbage Collection.


There are plenty of other extension types now like configmap and secret. How do they get blacklisted?

iiuc the objects that will participate in the prune must also be white listed by having them select the root object and by including them in the root object by GVKN? Does this proposal allow users to create objects (e.g. ConfigMaps and Secrets) that will not participate in pruning? If so are you really saying that you will (in addition to requiring white listing) allow for the implementation of explicit black listing in the future?

We can also add the blacklist types into data to allow customizing the blacklist.

data: object1: ... object2:... object3:... blacklist1: namespace1:group1:version1:kind1 blacklist2: namespace2:group2:version2:kind2

Instead of blacklisting, would it work to only perform deletion after rollout has been completed if we could figure out how to detect that? I can imagine other resources being dependencies that shouldn't be deleted until a rollout is complete.

@smarterclayton

There are plenty of other extension types now like configmap and secret. How do they get blacklisted?

I believe the issue with configmaps and secrets that is being addressed is due to 2 factors - 1) some tools generate them based on the contents of another file - so they are likely to be replaced by a new instance without the user explicitly deleting them and 2) the old instances will be referenced by Pods that may not have been rolled out during a rolling update. Pruning them would then delete them before they stopped being used by these Pods.

Are you thinking that this issue or other issues might apply to extension or other types?

nm, I think we could do this without whitelisting.

If someone has multiple applications (think of it as a term for grouping) in a namespace, will a prune of application A cause the assets for application B to be deleted? Unless they know about each other and have them blacklisted from pruning?

Ah, nevermind... I see the diff discussed elsewhere. Ignore the prev comment.

kow3ns · 2019-02-04T22:30:43Z

keps/sig-cli/kubectl-apply-prune-beta.md

+The objects passed to apply will have labels or annotations pointing back to the root object for the same apply.
+```
+ labels:
+    kubectl.kubernetes.io/applyOwner: v1:ConfigMap:predictable-name-foo


iiuc the plan would be to auto-generate a fixed length name for the "root object" based on the (I'm guessing) the has of its contents and deal with collisions the same way we do with replica-sets. We could mandate that implementations ensure that the name of the ConfigMap for a "root object" is no greater than the allowable 63 chars.

kow3ns · 2019-02-04T22:37:43Z

keps/sig-cli/kubectl-apply-prune-beta.md

+```
+The presence of this label is to confirm that a live object is safe to delete when it is to be pruned.
+
+### Risks and Mitigations


I think this operation is idempotent. If it is you should call that out. If it isn't you should explain where multiple applications may lead to a different result. I can't, off the top of my head, think of any circumstance where an intermittent failure can't be corrected by subsequent invocations of the command on the same input. This seems to be the desired intent of the design. If so, you should call this out specifically. If not, it deserves careful thought.

I was considering a scenario when an object, belonging to some root object, was updated/applied by other means(either manually, or name conflicting with an object belonging to another root object). In that case, the label/annotation will be different. This gives us a signal that this object has been touched by others, or owned by others. It may not be safe to delete it.

Will UUID a solution here?

kow3ns · 2019-02-04T22:41:15Z

keps/sig-cli/kubectl-apply-prune-beta.md

+- Find out correct objects to delete. When a directory is applied after some updates, it loses the track of objects that are previously applied from the same directory.
+- Delete objects at proper time. Assume the set of objects to be deleted is available, when to delete them is another challenge. Those objects could be still needed during rolling out other objects.
+
+In this design, we focus on a solution for the first challenge. For the second challenge, we can blacklist certain types such as ConfigMaps and Secrets for deletion. Generally deleting objects at proper time can be better addressed by Garbage Collection.


iiuc the objects that will participate in the prune must also be white listed by having them select the root object and by including them in the root object by GVKN? Does this proposal allow users to create objects (e.g. ConfigMaps and Secrets) that will not participate in pruning? If so are you really saying that you will (in addition to requiring white listing) allow for the implementation of explicit black listing in the future?

kow3ns · 2019-02-04T22:43:05Z

keps/sig-cli/kubectl-apply-prune-beta.md

+
+
+### Root Object
+To solve the first challenge, a concept of root object is proposed. A root object is a ConfigMap object that contains the GVKN information for all the objects that are to be applied. The root ConfigMap objects should be created by other tools such as kustomize, Helm.  When generating resources to be consumed by kubectl apply, those tools are responsible to generate the associated root ConfigMap object to utilize better pruning.


You don't need V. GKN is sufficient. The version of the resource can be selected by the client and available versions can be discovered dynamically.

Thank you for pointing out. I'll update it.

barney-s · 2019-02-04T23:29:39Z

keps/sig-cli/kubectl-apply-prune-beta.md

+### Challenges
+The challenges of pruning are mainly from two aspects.
+- Find out correct objects to delete. When a directory is applied after some updates, it loses the track of objects that are previously applied from the same directory.
+- Delete objects at proper time. Assume the set of objects to be deleted is available, when to delete them is another challenge. Those objects could be still needed during rolling out other objects.


Could you give an example of this scenario ?

One example is a ConfigMap object being referred in a Deployment object. When rolling update happens, new pods refer to new ConfigMap and old pods are still running and referring to old ConfigMap. The old ConfigMap can't be deleted until the rolling up successfully finishes.

enisoc · 2019-02-04T23:34:47Z

keps/sig-cli/kubectl-apply-prune-beta.md

+
+
+### Child Objects
+The objects passed to apply will have labels or annotations pointing back to the root object for the same apply.


This seems to require higher-level tools to generate a unique string to use as the ConfigMap name. If you assume that such a string has already been generated and given to kubectl, couldn't you then add that unique string in a label/annotation on every object created during kubectl apply, and then do label-based pruning like in alpha, except that the user does not manually provide a label selector? That is, we instead generate a selector based on this unique string we've assumed to exist, and which identifies a particular "apply group" as defined by the user or user's higher-level tool.

What are the additional benefits to maintaining an explicit, denormalized list of live objects in a ConfigMap? Is it just concern over selecting matching objects across all resources and namespaces? That may be enough justification, but it would be good to make that trade-off clear if so.

What are the additional benefits to maintaining an explicit, denormalized list of live objects in a ConfigMap? Is it just concern over selecting matching objects across all resources and namespaces? That may be enough justification, but it would be good to make that trade-off clear if so.

If we use a unique label to find the objects as in alpha --prune, we have to query all namespaces, all types to find the matched objects. With a list of objects, the query step is not necessary any more.

enisoc · 2019-02-05T19:26:45Z

keps/sig-cli/kubectl-apply-prune-beta.md

+- it can find the correct set of objects to delete
+- it can avoid mess up when changing directories
+
+### Non-Goals


For people who currently use apply without --prune, I imagine it might be common to do something like:

kubectl apply -f dir/ # later ... cd dir edit fileA.yaml kubectl apply -f fileA.yaml

If the user wants to start using the new prune feature, is this workflow still supported? Or is it a non-goal to support this, and the user will need to always kubectl apply -f dir (the whole directory)?

It will be supported.
By kubectl apply -f fileA.yaml, I guess the user only wants to update the resources in fileA.yaml.
Let's consider apply with pruning.

kubectl apply --prune -f dir/ # if there is a root object, it prunes obsolete objects # later ... cd dir edit fileA.yaml kubectl apply --prune -f fileA.yaml # if there is a root object, it triggers pruning actions. Assume the root object doesn't change, no object is deleted.

Liujingfang1 · 2019-02-06T19:58:40Z

Thanks for all your comments. I'd like to address a common one here: why not using an Application object as the root object.

There are some overlaps between the purpose of a root ConfigMap object and an Application object:

Both are used as a root object and tend to link to a set of child objects
Both are aimed to support better clean up
Both are supposed to be work with other tools

Regardless the difference of core type v.s. a CRD or existence of other spec fields or status, the major difference between a root ConfigMap object and an Application object is how they are connected to the child objects.

a root ConfigMap object lists all the child objects explicitly

data:
 object1:  namespace1:group1:kind1:somename
 object2:  namespace1:group2:kind2:secondname
 object3:  namespace3:group3:kind3:thirdname

an application object selects child objects by matching labels for specified types

spec:
  selector:
    matchLabels:
     app.kubernetes.io/name: "wordpress-01"
  componentKinds:
  - group: core
    kind: Service
  - group: apps
    kind: Deployment
  - group: apps
    kind: StatefulSet

In this design, we avoid selecting objects by labels when dealing with prune. It relies on users/tools to get the explicit list of child objects and write it to a root ConfigMap object. So using Application object with label selector doesn't fit with this purpose. However, if application objects can list the child objects explicitly, it can act the same as a ConfigMap root object. The API need to be updated to include the child objects names, such as

spec:
  selector:
    matchLabels:
     app.kubernetes.io/name: "wordpress-01"
  componentKinds:
  - group: core
    kind: Service
    name: app-service
  - group: apps
    kind: Deployment
    name: app-deployment
  - group: apps
    kind: StatefulSet
    name: app-statefulset

By looking at the application object itself, kubectl can get the set of child objects. I noticed that Application CRD may need assigning owner reference to make sure garbage collection can clean up child objects. But that part need to be handled by controller. If the application CRD can include child object names, Kubectl apply --prune will clean up child objects from the client side by this design.

@barney-s, @kow3ns Let me know what you think about this. Thank you.

k8s-ci-robot · 2019-02-06T19:58:54Z

@Liujingfang1: Cannot trigger testing until a trusted user reviews the PR and leaves an /ok-to-test message.

In response to this:

Thanks for all your comments. I'd like to address a common one here: why not using an Application object as the root object.

There are some overlaps between the purpose of a root ConfigMap object and an Application object:

Both are used as a root object and tend to link to a set of child objects

Both are aimed to support better clean up

Both are supposed to be work with other tools

Regardless the difference of core type v.s. a CRD or existence of other spec fields or status, the major difference between a root ConfigMap object and an Application object is how they are connected to the child objects.

a root ConfigMap object lists all the child objects explicitly
data:
 object1:  namespace1:group1:kind1:somename
 object2:  namespace1:group2:kind2:secondname
 object3:  namespace3:group3:kind3:thirdname
an application object selects child objects by matching labels for specified types
spec:
  selector:
    matchLabels:
     app.kubernetes.io/name: "wordpress-01"
  componentKinds:
  - group: core
    kind: Service
  - group: apps
    kind: Deployment
  - group: apps
    kind: StatefulSet
In this design, we avoid selecting objects by labels when dealing with prune. It relies on users/tools to get the explicit list of child objects and write it to a root ConfigMap object. So using Application object with label selector doesn't fit with this purpose. However, if application objects can list the child objects explicitly, it can act the same as a ConfigMap root object. The API need to be updated to include the child objects names, such as
spec:
  selector:
    matchLabels:
     app.kubernetes.io/name: "wordpress-01"
  componentKinds:
  - group: core
    kind: Service
    name: app-service
  - group: apps
    kind: Deployment
    name: app-deployment
  - group: apps
    kind: StatefulSet
    name: app-statefulset
By looking at the application object itself, kubectl can get the set of child objects. I noticed that Application CRD may need assigning owner reference to make sure garbage collection can clean up child objects. But that part need to be handled by controller. If the application CRD can include child object names, Kubectl apply --prune will clean up child objects from the client side by this design.

@barney-s, @kow3as Let me know what you think about this. Thank you.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

crimsonfaith91 · 2019-02-07T00:40:03Z

keps/sig-cli/kubectl-apply-prune-beta.md

+- add functions to identify root object
+- add a different code path to handle the case when root object is present
+- add unit test
+- add integration test


Should we prioritize end-to-end tests rather than integration tests? The integration tests are a must-have, but I think they does not test the real-world situation.

Yes, you're right. I'll update it.

crimsonfaith91 · 2019-02-07T00:44:07Z

keps/sig-cli/kubectl-apply-prune-beta.md

+ object3:  namespace3:group3:version3:kind3:thirdname
+```
+
+The presence of a root object indicates apply need to handle prune. When kubectl apply is given such a root object, it finds the live root object with the same name. 


Do you mean: "When kubectl apply is given name of a root object"?

For example: kubectl apply -f --prune -r <some-root-object-name> (adding -r flag for inputting the root object name)

We don't introduce new flags for this. It will be kubectl apply --prune -f. As long as a ConfigMap object is present in the resources passed by -f with a proper lable, Kubectl will treat it as a root object.

Got it, thanks!

crimsonfaith91 · 2019-02-07T00:58:29Z

keps/sig-cli/kubectl-apply-prune-beta.md

+
+Deciding the correct set of objects to change is hard. `--prune` Alpha tended to solve that problem in kubectl itself. In this new approach, kubectl is not going to decide such set. Instead it delegates the job to users or tools. Those who need to use `--prune` will be responsible to create a root object as a record of the correct set of objects. Kubectl consumes this root object and triggers the pruning action.
+
+This approach requires users/tools to do more work than just specifying a label. which might discourage the uage of `--prune`. On the other side, existing tools such as Kustomize, Helm can utilize this approach by generating a root object when expanding or generating the resources.


nit: uage to usage

IMO, requiring creation of a root object may potentially discourage most users. Furthermore, the users may set up the root object wrongly (e.g., missing some intended objects).

Does Kustomize and/or Helm support old clusters? If so, we may direct the users to them when the users want to perform --prune.

Kustomize doesn't talk with server. It only expands the resources and passes the output YAMLs to kubectl apply. helm has its own cli, and I believe it support old version clusters.

Noted. Should we also consider integrating --prune with helm? If not, is there any reason why not? Thanks!

Not direct integration. We should provide a pruning solution in Kubectl that can be easily leveraged by other tools such as helm.

pwittrock · 2019-02-07T07:14:19Z

@Liujingfang1

We may need to restrict this to cases where all the resources are in the same namespace, or create 1 configmap per namespace.

An alternative to using a configmap is store the same data but in an annotation on one of the objects. This object would need to be the "root object", and couldn't be deleted without changing the root object first.

Liujingfang1 · 2019-02-07T18:45:40Z

We may need to restrict this to cases where all the resources are in the same namespace, or create 1 configmap per namespace.

@pwittrock Creating one ConfigMap per namespace seems not necessary. Is it to keep different namespaces isolated? Considering a kubectl apply, resources could be in the same namespace, or different namespaces, or even with non-namespaced resources. Which cases are more likely? If we restrict to one cases, the pruning won't support other cases.
What about putting all configmap root objects in a different namespace kube-apply?

Liujingfang1 · 2019-02-07T18:59:08Z

An alternative to using a configmap is store the same data but in an annotation on one of the objects. This object would need to be the "root object", and couldn't be deleted without changing the root object first.

@pwittrock Storing data in annotations looks good, but I don't think we can drop the root configmap object completely. It helps us to find live root object. This alternation works for Application, since application cr servers as the root object and it's not likely to be deleted when there are still other resources. For manifests without Application, we still need an extra object, such as a configmap object, to act as a root object. If user randomly pick one object from the resources as the root, it's easy to forget about reassigning the root object label.

ekimekim · 2019-02-08T21:58:29Z

We're currently using prune in combination with some simple internal tooling for generating resources. Our tool automatically labels the resources with a unique label associated with that group of generated resources (similar to a helm "release name"), then uses kubectl apply --prune to update existing resources and delete any with the same release label that don't exist in the updated set of resources.
We also use this label for other tasks like finding the existing set of objects and displaying a diff.

Under this proposal, if I understand correctly, we would need to additionally create a root object Configmap listing all our resources, and for getting the set of existing objects we would have to parse the server's current copy of that same configmap then look up each object explicitly.

This isn't unreasonable, but it's a much more complex implementation than the previous "add a label", and is harder for operators to inspect by hand using kubectl commands (as they can no longer simply search on a label).
It's also much harder to reason about.

For example, suppose I have two root configmaps A and B, and they both list the same object. Perhaps due to user error, or perhaps because the object was previously generated in release A and we wish to migrate it to release B (this is a scenario that is admittedly rare, but we have faced in practice). If I first apply B, so the object's root-object label now points to configmap B, what happens if I subsequently apply A with that object removed? Under the old system, the answer is easy: The labels on the object have changed, so it's no longer part of the A release, so it's unaffected. But under the new system, I have no idea. It depends on the implementation.

You could get back the same behaviour as the old system by always checking that the object points at the root configmap you're applying, but my point is that now we're dealing with duplicated state that can get out of sync and you need to start adding caveats and explanations to what should be a simple operation: Delete any matching things that I don't explicitly list.

I realise that there's a desire to make pruning more useful when kubectl applying whole directories, but I don't understand the benefit to the configmap approach when we could backwards-compatibly tie it into the existing system simply by generating a unique label (which we would need to do anyway to produce a unique root configmap object name) and matching on it, as suggested in one of the comments above. The cost of doing a label query over all namespaces and types is mentioned, but I don't think the drastically increased complexity and number of edge cases is worth the minor performance benefit.

hazcod · 2020-03-29T09:37:58Z

/remove-lifecycle stale

fejta-bot · 2020-06-27T09:53:21Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-07-27T10:34:44Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

hazcod · 2020-07-27T11:08:04Z

/remove-lifecycle rotten

stefanprodan · 2020-07-27T11:19:34Z

In case someone needs an alternative to the current punning implementation, I made a kubectl plugin that prunes builtin resources as well as custom resources: https://github.com/stefanprodan/kustomizer

fejta-bot · 2020-10-25T11:41:27Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

tonybenchsci · 2020-11-03T04:43:07Z

What's the latest on kubectl --prune? We'd like to use this in combination with kustomize

fejta-bot · 2020-12-03T04:48:58Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

eestewart · 2020-12-17T05:49:11Z

/remove-lifecycle rotten

fejta-bot · 2021-03-17T06:32:51Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

fejta-bot · 2021-04-16T07:25:00Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

fejta-bot · 2021-05-16T07:48:14Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

k8s-ci-robot · 2021-05-16T07:48:23Z

@fejta-bot: Closed this PR.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot requested review from pwittrock and soltysh February 4, 2019 19:18

k8s-ci-robot assigned pwittrock, seans3 and soltysh Feb 4, 2019

barney-s reviewed Feb 4, 2019

View reviewed changes

easeway reviewed Feb 4, 2019

View reviewed changes

smarterclayton reviewed Feb 4, 2019

View reviewed changes

kow3ns reviewed Feb 4, 2019

View reviewed changes

barney-s reviewed Feb 4, 2019

View reviewed changes

enisoc reviewed Feb 4, 2019

View reviewed changes

enisoc reviewed Feb 5, 2019

View reviewed changes

crimsonfaith91 reviewed Feb 7, 2019

View reviewed changes

Liujingfang1 force-pushed the prune branch from 8565e77 to d49668c Compare February 8, 2019 22:01

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 29, 2020

monopole mentioned this pull request Apr 21, 2020

delete the inventory transformer and related docs kubernetes-sigs/kustomize#2392

Closed

justinkillen mentioned this pull request May 20, 2020

Be able to specify that a resource should not exist kubernetes/kubectl#867

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 27, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 27, 2020

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Jul 27, 2020

ezimanyi mentioned this pull request Aug 25, 2020

halyard deploys new secrets without cleaning up old secrets spinnaker/spinnaker#5968

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 25, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Dec 3, 2020

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Dec 17, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 17, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Apr 16, 2021

k8s-ci-robot closed this May 16, 2021

bgrant0607 mentioned this pull request Jul 10, 2021

kustomize localize command to pull all remote data kubernetes-sigs/kustomize#3980

Closed


		`kubectl apply` operates an aggregation of Kubernetes Objects. While `kubectl apply` is widely used in different workflows, users expect it to be able to sync the objects on the cluster with configuration saved in files, directories or repos. That means, when `kubectl apply` is run on those configurations, the corresponding objects are expected to be correctly created, updated and pruned.

		Currently `kubectl apply --prune` is still in Alpha. It is based on label matching. Users need to be really careful to use this command, so that they don't delete objects unintentionally. There is a list of [issues]([kubernetes/kubectl#572]) observed by users. The most noticeable issue happens when changing directories or working with directories with the same labels. When two directories share the same label. `kubectl apply --prune` from one directory will delete objects from the other directory.



		### Root Object
		To solve the first challenge, a concept of root object is proposed. A root object is a ConfigMap object that contains the GVKN information for all the objects that are to be applied. The root ConfigMap objects should be created by other tools such as kustomize, Helm. When generating resources to be consumed by kubectl apply, those tools are responsible to generate the associated root ConfigMap object to utilize better pruning.



		### Child Objects
		The objects passed to apply will have labels or annotations pointing back to the root object for the same apply.


		Deciding the correct set of objects to change is hard. `--prune` Alpha tended to solve that problem in kubectl itself. In this new approach, kubectl is not going to decide such set. Instead it delegates the job to users or tools. Those who need to use `--prune` will be responsible to create a root object as a record of the correct set of objects. Kubectl consumes this root object and triggers the pruning action.

		This approach requires users/tools to do more work than just specifying a label. which might discourage the uage of `--prune`. On the other side, existing tools such as Kustomize, Helm can utilize this approach by generating a root object when expanding or generating the resources.

add kep for better pruning in apply #810

add kep for better pruning in apply #810

Conversation

Liujingfang1 commented Feb 4, 2019

Liujingfang1 commented Feb 4, 2019

barney-s left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

easeway Feb 4, 2019 • edited Loading

Choose a reason for hiding this comment

Liujingfang1 Feb 4, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Liujingfang1 commented Feb 6, 2019 • edited Loading

k8s-ci-robot commented Feb 6, 2019

crimsonfaith91 Feb 7, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

crimsonfaith91 Feb 7, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

crimsonfaith91 Feb 7, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pwittrock commented Feb 7, 2019

Liujingfang1 commented Feb 7, 2019

Liujingfang1 commented Feb 7, 2019 • edited Loading

ekimekim commented Feb 8, 2019

hazcod commented Mar 29, 2020

fejta-bot commented Jun 27, 2020

fejta-bot commented Jul 27, 2020

hazcod commented Jul 27, 2020

stefanprodan commented Jul 27, 2020

fejta-bot commented Oct 25, 2020

easeway Feb 4, 2019 •

edited

Loading

Liujingfang1 Feb 4, 2019 •

edited

Loading

Liujingfang1 commented Feb 6, 2019 •

edited

Loading

crimsonfaith91 Feb 7, 2019 •

edited

Loading

crimsonfaith91 Feb 7, 2019 •

edited

Loading

crimsonfaith91 Feb 7, 2019 •

edited

Loading

Liujingfang1 commented Feb 7, 2019 •

edited

Loading