Feature Request: enable user-managed Pod Migration #43405

erictune · 2017-03-20T20:17:27Z

A user wants to extend Kubernetes to allow for application-specific migration in response to pod deletion events, whenever possible.

Normally, there should be 1 instance of a pod -- call it pod-1.
However, something (usually the system, e.g. rescheduler or node upgrades) wants to delete pod-1, then a replica, pod-2 should be created.
Before pod-1 is actually terminated, it will discover pod-2 and they will do an application-level handoff of state.
After handoff, scale down to just 1 pod, for economy.

This issue is created to suggest possible ways to implement this pattern.

The text was updated successfully, but these errors were encountered:

erictune · 2017-03-20T20:22:32Z

Possible approach 1:

Use statefulset for this pod. Initial size 1. Also a headless service.
When pod-1 gets graceful termination notice, it should call back to the API and scale its statefulset up to size 2, then wait to discover pod-2 via DNS, then sync with it, then exit.
When a new instance ofpod-1 is later created, sync the other way, and scale back down.

Advantages of this approach:

Can be implemented today without Kubernetes changes, and basically using any script or program that can do a scale up, scale down, and discover the peer using DNS.

Drawbacks to this approach:

Requires migrating twice, when in principle only 1 migration is needed, in order to get the stateful set down to size 1 again.
Requires authorizing the pod to scale its own controller. Might be undesirable for some security-sensitive applications.

erictune · 2017-03-20T20:34:37Z

Possible approach 2:

write an "operator", in the style of https://github.com/upmc-enterprises/elasticsearch-operator which implements a "migratingPod" concept, which has a pod Template.
It makes one pod with a random name, say pod-32jdg
When that pod gets a graceful termination, it waits for a peer to appear.
When the operator sees a deletion timestamp on pod-32jdg, then it creates a replacement such as pod-m2k87. It uses the same labels.
Both pods discover each other, either by watching DNS + using a headless service, or by direct message from the operator. They initiate migration.
pod-32jdg exits gracefully when migration is done.
A regular service in front of both of them can provide a stable name as the pods go through different random names. Operator could even manage endpoints if it wants to control the moment of handoff between the two instances (modulo pre-existing connections).

Advantages of this approach:

Can be implemented today without Kubernetes changes.
Only one migration needed. A regular service

Drawbacks to this approach:

Writing operator is more complex than Approach 1, but not too bad.

0xmichalis · 2017-03-23T13:09:52Z

Approach 1 seems like a custom strategy: #14510

cc: @kubernetes/sig-apps-feature-requests

dhilipkumars · 2017-03-27T11:42:43Z

This proposal can simplify both the approaches. I believe future Operators can become lightweight if we allow more elaborate cleanup mechanism.

bgrant0607 · 2017-10-31T15:36:53Z

Ref #3949

fejta-bot · 2018-01-29T16:00:15Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2018-02-28T16:45:57Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten
/remove-lifecycle stale

xtchenhui · 2018-08-08T15:51:54Z

@erictune have you tried the approach 2? Actually i'm investigating on the similar method now to migrate runv containters from one node to another.

ashish-billore · 2020-11-03T04:46:53Z

Related: #3949

davidopp added sig/apps Categorizes an issue or PR as relevant to SIG Apps. sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. labels Mar 21, 2017

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 29, 2018

kow3ns added this to Backlog in Workloads Feb 27, 2018

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 28, 2018

kow3ns added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. labels Mar 3, 2018

kow3ns removed this from Backlog in Workloads Mar 3, 2018

kow3ns added this to Backlog in Workloads via automation Mar 3, 2018

kow3ns moved this from Backlog to Freezer in Workloads Mar 3, 2018

krmayankk added this to Enterprise Readiness in Technical Debt Research Oct 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: enable user-managed Pod Migration #43405

Feature Request: enable user-managed Pod Migration #43405

erictune commented Mar 20, 2017

erictune commented Mar 20, 2017

erictune commented Mar 20, 2017 •

edited by 0xmichalis

0xmichalis commented Mar 23, 2017 •

edited

dhilipkumars commented Mar 27, 2017

bgrant0607 commented Oct 31, 2017

fejta-bot commented Jan 29, 2018

fejta-bot commented Feb 28, 2018

xtchenhui commented Aug 8, 2018

ashish-billore commented Nov 3, 2020

Feature Request: enable user-managed Pod Migration #43405

Feature Request: enable user-managed Pod Migration #43405

Comments

erictune commented Mar 20, 2017

erictune commented Mar 20, 2017

erictune commented Mar 20, 2017 • edited by 0xmichalis

0xmichalis commented Mar 23, 2017 • edited

dhilipkumars commented Mar 27, 2017

bgrant0607 commented Oct 31, 2017

fejta-bot commented Jan 29, 2018

fejta-bot commented Feb 28, 2018

xtchenhui commented Aug 8, 2018

ashish-billore commented Nov 3, 2020

erictune commented Mar 20, 2017 •

edited by 0xmichalis

0xmichalis commented Mar 23, 2017 •

edited