Author a node e2e test that verifies live-restore functionality in docker #38303

derekwaynecarr · 2016-12-07T18:02:05Z

Newer versions of docker have the ability to keep containers alive during daemon downtime.

See: https://docs.docker.com/engine/admin/live-restore/

For kubelet's that integrate with the docker runtime, we should author a node e2e test that verifies enabling this feature works as expected in kubelet and pods remain alive and well after a restart.

/cc @kubernetes/sig-node @sjenning @mrunalp

dchen1107 · 2016-12-13T00:35:04Z

I did some verification on docker 1.12 rc-x a while back, and ran into some issue.

mrunalp · 2016-12-13T23:52:14Z

@dchen1107 We recently got live-restore working even when nested within another container so let us know if you run into any issues. We had an issue where the shim process was in the same cgroup as docker daemon so reload was taking it down. The workaround was to put it in another cgroup. For k8s it would make sense to keep the shim processes in the pod cgroup.

derekwaynecarr · 2016-12-14T14:58:51Z

@dchen1107 -- would love to know more on what you found if you are able to share/recall. we plan to enable the function and do some of our own testing soon, will report back what we find.

resouer · 2016-12-19T14:09:34Z

@mrunalp If not the dind case, then we don't need to put shim in pod cgroup, right? I'm not very sure if shim belongs to pod cgroup ...

And what qos tier do you think shim belongs to? Always the same with pod?

derekwaynecarr · 2017-01-09T22:17:51Z

@hodovska on my team is going to get some testing in place that has this feature enabled to see where things break down. i think an option to node e2e that runs an optional test for this scenario where docker has this configured will help the broader community.

0xmichalis · 2017-06-25T01:27:32Z

/sig node

jsravn · 2017-08-29T14:03:25Z

FWIW live restore makes a big positive impact on reliabity for us. It'd be nice to prioritise making it official for k8s. In the past we had to restrict node sizes and things due to the instability of docker. We've been on live restore for the past 6mo or so, and I can't imagine living without it. We haven't noticed anything majorly bad (apart from kubelet complaining a lot after docker restarts).

yguo0905 · 2017-08-30T21:51:44Z

This is done in #50277 as part of #42926. Please close this issue.

yujuhong · 2017-08-30T21:54:28Z

/close

derekwaynecarr self-assigned this Dec 7, 2016

Random-Liu mentioned this issue Jan 7, 2017

NPD Kubernetes 1.6 Planning kubernetes/node-problem-detector#58

Closed

11 tasks

k8s-github-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label May 31, 2017

k8s-ci-robot added the sig/node Categorizes an issue or PR as relevant to SIG Node. label Jun 25, 2017

k8s-github-robot removed the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Jun 25, 2017

bboreham mentioned this issue Aug 29, 2017

e2e-node test for docker live-restore #40364

Closed

k8s-ci-robot assigned yujuhong Aug 30, 2017

k8s-ci-robot closed this as completed Aug 30, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Author a node e2e test that verifies live-restore functionality in docker #38303

Author a node e2e test that verifies live-restore functionality in docker #38303

derekwaynecarr commented Dec 7, 2016

dchen1107 commented Dec 13, 2016

mrunalp commented Dec 13, 2016

derekwaynecarr commented Dec 14, 2016

resouer commented Dec 19, 2016 •

edited

Loading

derekwaynecarr commented Jan 9, 2017

0xmichalis commented Jun 25, 2017

jsravn commented Aug 29, 2017 •

edited

Loading

yguo0905 commented Aug 30, 2017

yujuhong commented Aug 30, 2017

Author a node e2e test that verifies live-restore functionality in docker #38303

Author a node e2e test that verifies live-restore functionality in docker #38303

Comments

derekwaynecarr commented Dec 7, 2016

dchen1107 commented Dec 13, 2016

mrunalp commented Dec 13, 2016

derekwaynecarr commented Dec 14, 2016

resouer commented Dec 19, 2016 • edited Loading

derekwaynecarr commented Jan 9, 2017

0xmichalis commented Jun 25, 2017

jsravn commented Aug 29, 2017 • edited Loading

yguo0905 commented Aug 30, 2017

yujuhong commented Aug 30, 2017

resouer commented Dec 19, 2016 •

edited

Loading

jsravn commented Aug 29, 2017 •

edited

Loading