Write proposal for controller pod management: adoption, orphaning, ownership, etc. (aka controllers v2) #14961

bgrant0607 · 2015-10-02T03:41:59Z

Write a comprehensive proposal for how controllers should manage sets of pods. The main goal is to make controller APIs more usable and less error-prone.

We've discussed a number of changes:

unique label generation: Generate unique label for Replication Controller, Job #12298, Server side defaulting of values prevents diff patches from working #34292
controllerRef to manage overlap: Validate no replicationController overlap. #2210 (comment)
server-side cascading deletion via existence dependences: Consistently support graceful and immediate termination for all objects #1535 (comment)
GC prevention via finalizers: Places for hooks #3585
deletion of non-conforming pods: https://github.com/kubernetes/kubernetes/blob/master/docs/design/daemon.md#cluster-mutations
generation/observedGeneration: Create per-object sequence number and report last value seen in status of each object #7328
status field containing selector query: Links to related object in json responses #3676
conditions: Add Conditions alongside Phases. #14181
aggregate status: Add status column for kubectl get replicationcontroller/service #7483
policy for naming generated resources: ReplicationController pod template ignores generateName (ignores all ObjectMeta in fact) #15776
reverse label lookup: Reverse lookup by labels #1348

We may want to split the following into separate issues:

Changes that would facilitate static work/role assignment:

identity assignment: PetSet (was nominal services) #260, Indexed Job #14188
PVC replication: Proposal: Allow anonymous inline claim in pod definition #12450

Long-standing idea to improve security and reusability around templates:

separate template: Separate the pod template from replicationController #170

Reusability could also be addressed by:

template proposal: initial template and parameterization proposal #18215

Also need to make it easier to update existing pods:

For labels, currently need to update pod template, then pods, then selector: https://github.com/kubernetes/kubernetes/blob/master/pkg/kubectl/rolling_updater.go#L616 . The loop to update pods after that wouldn't be necessary if it watched observedGeneration (Create per-object sequence number and report last value seen in status of each object #7328).
Inline updates: In-place rolling updates #9043
vertical auto-scaling

bgrant0607 · 2015-10-02T05:51:03Z

A wrinkle: updating a selector in Deployment #14894

bgrant0607 · 2015-10-02T07:24:51Z

Some use cases for orphaning and/or adoption of pods:

Stop the controller and restart it later. For instance, that's the way we pause Deployment at the moment.
Update some attributes of the controller that can't be updated by deleting and re-creating it. For instance, renaming the controller, which we do in "simple rolling update".
Adoption by an non-backward-compatible API resource.
Update labels on existing pods that would be incompatible with the controller. Delete controller, change pod labels, create new controller with new selector.
Bootstrapping. Create pods to start the control plane, then create their controllers.
System exploration/learning. Create a pod, then a controller to manage it.
Debugging. Change a pod's labels to orphan it so that it can be replaced and debugged out of the critical path.
Replacement. Replace a pod generated from the template with a special one, perhaps with special support for profiling, tracing, audit, monitoring, etc.

bgrant0607 · 2015-10-02T16:20:31Z

If we had an imperative API, a controllerRef backpointer could be manipulated to transfer ownership: transfer pods from controller X to controller Y. Orphaning would be somewhat weird in that a dangling controllerRef pointer would need to be left. Otherwise, there would be no attribute to select on for re-adoption. Or one could require transferring ownership before deleting the previous controller.

With our declarative API, we can't get rid of the labels and selector since those identify which pods should be adopted -- there needs to be some known unique set of attributes to select on.

Will also include nominal services #260, splitting out the template #170, and nominal jobs #14188 in the proposal. See also #12450.

Ref #8190, since this is a "next-gen" API proposal.

bgrant0607 · 2015-10-02T22:29:42Z

Demo of Kelsey removing a label in order to orphan a pod for debugging: https://www.youtube.com/watch?feature=player_detailpage&v=-8aUxpVrD40#t=2193
https://github.com/kelseyhightower/yapc-asia-2015/blob/master/demo/README.md#troubleshooting

pmorie · 2015-10-03T21:09:08Z

@kubernetes/rh-cluster-infra

davidopp · 2015-10-05T02:59:42Z

If we had an imperative API, a controllerRef backpointer could be manipulated to transfer ownership: transfer pods from controller X to controller Y. Orphaning would be somewhat weird in that a dangling controllerRef pointer would need to be left. Otherwise, there would be no attribute to select on for re-adoption. Or one could require transferring ownership before deleting the previous controller.

With our declarative API, we can't get rid of the labels and selector since those identify which pods should be adopted -- there needs to be some known unique set of attributes to select on.

Maybe it's a semantic quibble, but I don't think the issue is imperative vs. declarative -- it's implicit vs. explicit encoding of which controller owns a pod. I see the as both being declarative.

Do we have any examples/uses cases of adoption yet?

I think it would be great if we could allow people to write controllers that only support the explicit model, i.e. don't have a Selector. (The controller would fill in controllerRef for the pods it creates.)

bgrant0607 · 2015-10-08T02:52:47Z

By imperative, I meant operations like "transfer pods from controller X to controller Y".

Adoption requests: #11209
Other scenarios are described above.

nikhiljindal · 2015-10-08T18:24:37Z

/sub

bgrant0607 · 2015-11-09T21:35:10Z

cc @smarterclayton

soltysh · 2015-11-09T22:57:38Z

/sub

davidopp · 2015-12-01T10:27:56Z

Having a backpointer from pod to controller will greatly simplify scheduler code, consider for example the hoops we jump through to determine if a pod is controlled by a particular RC in CalculateSpreadPriority() and CalculateAntiAffinityPriority() (both in selector_spreading.go). A backpointer to the service would also be nice (would simplify CalculateSpreadPriority(), which spreads on both RC and service).

smarterclayton · 2015-12-01T11:17:57Z

Back pointer to service has other undesirable characteristics - like
services have to mutate pods to reference them, which means services
controller is another thing which has effective root access to the
cluster. Is the benefit high enough?

On Dec 1, 2015, at 5:28 AM, David Oppenheimer notifications@github.com
wrote:

Having a backpointer from pod to controller will greatly simplify scheduler
code, consider for example the hoops we jump through to determine if a pod
is controlled by a particular RC in CalculateSpreadPriority() and
CalculateAntiAffinityPriority() (both in selector_spreading.go). A
backpointer to the service would also be nice (would simplify
CalculateSpreadPriority(), which spreads on both RC and service).

—
Reply to this email directly or view it on GitHub
#14961 (comment)
.

bgrant0607 · 2015-12-05T04:24:55Z

Template proposal: #18215

PetSet proposal: #18016

bgrant0607 · 2016-04-28T01:47:00Z

For 1.3, we're working on:

controllerRef
cascading deletion
finalizers
generation/observedGeneration
PetSet
Templates

bgrant0607 · 2016-06-19T18:18:45Z

Note that unique label/selector generation similar to what we do for Job will require changes to kubectl expose, if will want users to be able to do a rolling update to an RC/RS with auto-generated labels.

#17902
https://groups.google.com/forum/?utm_medium=email&utm_source=footer#!msg/kubernetes-dev/WbqQVNkDZUE/1f5WZK2zCAAJ

0xmichalis · 2016-12-12T18:19:31Z

Make controllers aware of namespace termination: #38612

bgrant0607 · 2017-09-07T17:02:42Z

v1 plan: #42752

cc @kow3ns

fejta-bot · 2018-01-04T18:52:36Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

soltysh · 2018-01-16T12:59:18Z

/remove-lifecycle stale
/lifecycle frozen

bgrant0607 added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. kind/documentation Categorizes issue or PR as related to documentation. team/ux labels Oct 2, 2015

bgrant0607 self-assigned this Oct 2, 2015

bgrant0607 mentioned this issue Oct 2, 2015

Allow Users to change labels using deployment #14894

Closed

bgrant0607 changed the title ~~Write proposal for controller pod management: adoption, orphaning, ownership, etc.~~ Write proposal for controller pod management: adoption, orphaning, ownership, etc. (aka controllers v2) Oct 2, 2015

This was referenced Oct 8, 2015

DaemonSet: requirements for graduation to beta and then to v1 #15310

Closed

Job: requirements for graduation to beta #15312

Closed

Deployment: requirements for graduation to "beta" #15313

Closed

erictune mentioned this issue Oct 20, 2015

Job: requirements for graduation to stable #15985

Closed

4 tasks

mikedanese mentioned this issue Oct 27, 2015

BUG: Replication Controllers Break when Created With Already Used Template Config #16333

Closed

bgrant0607 mentioned this issue Oct 27, 2015

make Deployment PodTemplate not a pointer #16330

Merged

bgrant0607 mentioned this issue Nov 12, 2015

Kubectl describe of pod does not mention job or daemonset #14304

Closed

davidopp mentioned this issue Nov 20, 2015

ScheduledJob controller proposal #11980

Merged

bgrant0607 mentioned this issue Nov 21, 2015

DisruptionBudget object to define the max disruption that can be caused to pods #12611

Closed

mikedanese mentioned this issue Nov 27, 2015

Validate no replicationController overlap. #2210

Closed

bgrant0607 mentioned this issue Dec 5, 2015

Proposal for implementing nominal services AKA StatefulSets AKA The-Proposal-Formerly-Known-As-PetSets #18016

Merged

bgrant0607 removed this from the next-candidate milestone Apr 28, 2016

bgrant0607 removed their assignment Apr 28, 2016

davidopp mentioned this issue Apr 28, 2016

Implement controllerRef #24946

Closed

bgrant0607 modified the milestones: next-candidate, v1.3 May 12, 2016

soltysh mentioned this issue Jul 18, 2016

Proposal for ControllerReference #25256

Merged

bgrant0607 mentioned this issue Jul 9, 2023

Document API architectural approach for soundness and consistency kubernetes/website#41954

Open

bgrant0607 mentioned this issue Oct 7, 2016

Facilitate API orchestration #34363

Open

bgrant0607 mentioned this issue Nov 1, 2016

Server side defaulting of values prevents diff patches from working #34292

Open

liggitt mentioned this issue Nov 16, 2016

Avoid overlapping controllers (including StatefulSets) to fight with StatefulSets #36866

Closed

bgrant0607 mentioned this issue Nov 21, 2016

StatefulSets with the same selector interfere with each other #36859

Closed

bgrant0607 mentioned this issue Dec 12, 2016

controller: sync deployments once they don't overlap anymore #38080

Merged

janetkuo added the sig/apps Categorizes an issue or PR as relevant to SIG Apps. label Dec 14, 2016

This was referenced Mar 2, 2017

Document advice about generation/observedGeneration for controllers kubernetes/community#135

Closed

Workload API v1 requirements umbrella issue #42752

Closed

0xmichalis added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. and removed team/api (deprecated - do not use) labels May 20, 2017

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 4, 2018

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 16, 2018

kow3ns added this to Backlog in Workloads Mar 1, 2018

janetkuo mentioned this issue Mar 20, 2018

Deployment is incompatible with apps/v1 ReplicaSet #61433

Closed

seeker89 mentioned this issue Jul 10, 2020

Implement lightweight networking failure injection powerfulseal/powerfulseal#285

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Write proposal for controller pod management: adoption, orphaning, ownership, etc. (aka controllers v2) #14961

Write proposal for controller pod management: adoption, orphaning, ownership, etc. (aka controllers v2) #14961

bgrant0607 commented Oct 2, 2015 •

edited

bgrant0607 commented Oct 2, 2015

bgrant0607 commented Oct 2, 2015

bgrant0607 commented Oct 2, 2015

bgrant0607 commented Oct 2, 2015

pmorie commented Oct 3, 2015

davidopp commented Oct 5, 2015

bgrant0607 commented Oct 8, 2015

nikhiljindal commented Oct 8, 2015

bgrant0607 commented Nov 9, 2015

soltysh commented Nov 9, 2015

davidopp commented Dec 1, 2015

smarterclayton commented Dec 1, 2015

bgrant0607 commented Dec 5, 2015

bgrant0607 commented Apr 28, 2016

bgrant0607 commented Jun 19, 2016

0xmichalis commented Dec 12, 2016

bgrant0607 commented Sep 7, 2017

fejta-bot commented Jan 4, 2018

soltysh commented Jan 16, 2018

Write proposal for controller pod management: adoption, orphaning, ownership, etc. (aka controllers v2) #14961

Write proposal for controller pod management: adoption, orphaning, ownership, etc. (aka controllers v2) #14961

Comments

bgrant0607 commented Oct 2, 2015 • edited

bgrant0607 commented Oct 2, 2015

bgrant0607 commented Oct 2, 2015

bgrant0607 commented Oct 2, 2015

bgrant0607 commented Oct 2, 2015

pmorie commented Oct 3, 2015

davidopp commented Oct 5, 2015

bgrant0607 commented Oct 8, 2015

nikhiljindal commented Oct 8, 2015

bgrant0607 commented Nov 9, 2015

soltysh commented Nov 9, 2015

davidopp commented Dec 1, 2015

smarterclayton commented Dec 1, 2015

bgrant0607 commented Dec 5, 2015

bgrant0607 commented Apr 28, 2016

bgrant0607 commented Jun 19, 2016

0xmichalis commented Dec 12, 2016

bgrant0607 commented Sep 7, 2017

fejta-bot commented Jan 4, 2018

soltysh commented Jan 16, 2018

bgrant0607 commented Oct 2, 2015 •

edited