Proposal: introduce opt-in support for orphaned node destruction #1268

alekc · 2022-02-03T12:43:17Z

Tell us about your request
Introduce support for orphaned node destruction (both present in the cluster and outside)

Tell us about the problem you're trying to solve. What are you trying to do, and why is it hard?
Right now, karpenter's approach to provisioner deletion is "don't delete provisioned nodes" and that's a sensible approach in most cases.

However, I believe that some operators do want to ensure that once the provisioner is deleted, all nodes which are part of it should be deleted as well.

For example, in our pipeline for cluster bootstrapping the approach is following:
Create CLuster with asg consisting of 1 node tainted for karpenter -> install karpenter -> install provisioner -> install argocd -> install apps

on cluster destruction, the inverse process is not working because sometimes karpenter is deleted before getting to deal with empty nodes.
It would be nice if

provisioner is annotated with specific spec (something like deleteNodesOnDeletion)
the presence of that annotation puts finalizer on the provisioner itself and all nodes created from it are labelled with cluster UID and a flag indicating that the node should not exist without related provisioner
during the sync process we also check if there are any node with labels from above which do not have an existing provisioner and drop them if there are any. This should also help if there was a node deletion while karpenter was down thus creating an orphan.
on provisioner deletion we delete all related nodes before removing the finalizer thus maintaining a proper destruction and dependencies flow

Are you currently working around this issue?
I have to run aws ec2 describe-instances --filters Name=instance-state-name,Values=running --filter Name=tag:eks:cluster-name,Values=${TF_VAR_CLUSTER_NAME} --filters Name=tag-key,Values=karpenter.sh/provisioner-name --query "Reservations[*].Instances[*].InstanceId" --output text | xargs aws ec2 terminate-instances --instance-ids || true to delete all orphaned instances

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

The text was updated successfully, but these errors were encountered:

ellistarn · 2022-02-03T16:51:05Z

Thanks for writing this up!

My first concern:

When the cluster is being torn down, what if the Karpenter process becomes permanently unavailable since its ASG is gone. There's nothing to execute the node termination logic (including ec2 terminate instance).

alekc · 2022-02-03T19:37:08Z

Isn't it kind of the same right now? (assuming that if we do terminate node with karpenter on it, dependent nodes won't be deleted due to the presence of finalizer, preventing the destruction of the cluster?)

This feature would be opt in, and I assume that it would be used by people with strong IAC pipeline (with great power, etc), so hopefully, they would have a proper order of execution during the destruction stage.

ellistarn · 2022-02-03T19:42:46Z

I assume that it would be used by people with strong IAC pipeline

Wouldn't it be possible to orchestrate cleanup against EC2 directly (e.g. your workaround command)?

I love the idea of Karpenter handling this on uninstall, but unless Karpenter install/uninstall is codified more explicitly and/or run outside of the cluster it operates on, I don't see us being able to provide a robust solution. I've mentioned elsewhere (k8s slack, I think), that I think this is a great feature for Kubernetes installers (eks/kops/etc)

olemarkus · 2022-02-03T20:11:06Z

Completely agree this is belongs with installers.

kops already deletes all instances on cluster deletion, including karpenter-managed instances. Right now, it doesn't delete instances on provisioner deletion, but it's a trivial thing to implement and an expected feature given that we support this with ASG.

ellistarn · 2022-02-03T20:32:13Z

but it's a trivial thing to implement and an expected feature given that we support this with ASG.

We intentionally don't delete nodes on provisioner deletion. We think of provisioners as forward looking. It's the same reason that we don't apply labels to nodes if you update the provisioner's labels after a node is launched.

olemarkus · 2022-02-03T20:39:08Z

Semantically, users would be deleting the instance group, so it's not entirely the same as deleting the provisioner resource itself. It would be a much more deliberate action with clearer intent.

alekc added the feature New feature or request label Feb 3, 2022

njtran mentioned this issue Apr 28, 2022

Mega Issue: Deprovisioning Controls #1738

Open

18 tasks

ellistarn mentioned this issue Jun 14, 2022

Feat: Nodes are now owned by Provisioners, and will cascade on delete by default #1934

Merged

2 tasks

ellistarn closed this as completed in #1934 Jun 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: introduce opt-in support for orphaned node destruction #1268

Proposal: introduce opt-in support for orphaned node destruction #1268

alekc commented Feb 3, 2022

ellistarn commented Feb 3, 2022

alekc commented Feb 3, 2022

ellistarn commented Feb 3, 2022

olemarkus commented Feb 3, 2022

ellistarn commented Feb 3, 2022 •

edited

olemarkus commented Feb 3, 2022

Proposal: introduce opt-in support for orphaned node destruction #1268

Proposal: introduce opt-in support for orphaned node destruction #1268

Comments

alekc commented Feb 3, 2022

Community Note

ellistarn commented Feb 3, 2022

alekc commented Feb 3, 2022

ellistarn commented Feb 3, 2022

olemarkus commented Feb 3, 2022

ellistarn commented Feb 3, 2022 • edited

olemarkus commented Feb 3, 2022

ellistarn commented Feb 3, 2022 •

edited