rkt: lifecycle management #6

philips · 2014-11-12T06:02:59Z

We need to define how rkt knows how all of the processes running in a given stage1 have been destroyed and that the root filesystem can be cleaned up.

jonboulle · 2014-11-24T19:18:33Z

@vcaputo so braindumping here:

rkt containers run in a standard location, $RKTDIR/containers/$UUID, where RKTDIR = the --dir flag to rkt (defaults to /var/lib/rkt) and UUID is generated at rkt run time (this is all the case in master)
stage1 creates each app as a systemd service with Type=simple (i.e. not oneshot), potentially with Restart= policies depending on AppManifest
stage1 creates (via the stage1 rootfs tarball and/or the stage1 app-> systemd conversion process) shutdown hooks so that when the app(s) finally exit (through failure or success), the shutdown target triggers a process that collects status information (effectively for i in $apps; systemctl status $app; done) in a known location in the stage1 rootfs (my thinking is having a dedicated rkt dir/file in the stage1, so for example /var/lib/rkt/containers/abc1234def/stage1/rkt/status)
rkt status $UUID is implemented to first check if the given container is running (and if so, effectively do a systemd-nspawn systemctl status $apps), and if not, it parses this status output file

Then rkt gc is implemented separately to garbage collect old containers based on an age policy, etc.

pierrebeaucamp · 2014-12-04T12:48:40Z

Excuse me if I'm wrong, but I think I'm missing something here.

stage1 creates [...] shutdown hooks so that when the app(s) finally exit (through failure or success), the shutdown target triggers a process that collects status information

As far as I can detect it, reaper.service is only called onFailure, so the systemctl status loop is never launched on success. Therefor nothing is created in stage1/rkt/status

rkt status $UUID is implemented to first check if the given container is running [...]

rkt/status.go doesn't do anything so far

vcaputo · 2014-12-05T20:30:10Z

See exit-watcher.service for how stage1/reaper.sh gets invoked on success.

Have you observed nothing being created in stage1/rkt/status on successful exit? That would be a bug, and inconsistent with my observations.

We're still sorting out the status/gc/lifecycle side of rkt, the big question being: How does one distinguish active from inactive containers?

An exclusive advisory lock on the container's /var/lib/rkt/containers/$uuid directory bound to the rkt process' lifetime is the current direction we're exploring. The issue of the moment on this path is systemd-nspawn closes fds not in LISTEN_FDS so our directory lock fd is closed prematurely in stage1.

We've been using nspawn in stage1 out of convenience but it's increasingly becoming less so as things mature. In the course of fleshing out the lifecycle details nspawn may end up being replaced entirely by something specialized and minimal.

jonboulle · 2014-12-05T20:32:42Z

An exclusive advisory lock on the container's /var/lib/rkt/containers/$uuid directory bound to the rkt process' lifetime is the current direction we're exploring. The issue of the moment on this path is systemd-nspawn closes fds not in LISTEN_FDS so our directory lock fd is closed prematurely in stage1.

Mentioned briefly with code in #35 fwiw if anyone wants to dig into this further :-) (currently distracted by other priorities)

pierrebeaucamp · 2014-12-09T09:30:12Z

@vcaputo

See exit-watcher.service for how stage1/reaper.sh gets invoked on success.

Sorry didn't catched that one

Have you observed nothing being created in stage1/rkt/status on successful exit? That would be a bug, and inconsistent with my observations.

I had the understanding that stage1 would be terminated after stage2 was launched, therefor creating something in this directory. But it appears that no status is created when stage2 is starting (and the app gets executed) as well as when you kill the process.

We're still sorting out the status/gc/lifecycle side of rkt, the big question being: How does one distinguish active from inactive containers?

I see now why this is a problem, as I thought status would work in a different way. What about copying machinectl to stage1 as well so it can be used to get the status of the containers?

vcaputo · 2014-12-09T23:34:47Z

@pierrebeaucamp

I see now why this is a problem, as I thought status would work in a different way. What about copying machinectl to stage1 as well so it can be used to get the status of the containers?

Since rkt run is intended to function on non-systemd hosts as well we're not relying on systemd-specific facilities in the host for general functionality. We do have systemctl in stage1, but that's limited to interacting with the container's stage1 systemd instance. In the future we'll probably register with the host's systemd when available for improved integration, but that doesn't preclude the need for good general solutions.

I've put together a hacky PR which gives us both working advisory locks and a recorded container pid here: #244

This is not an attractive long-term solution but it does facilitate primitives for gc, list, status, etc. I think it's a reasonable intermediate step enabling movement on the other pieces.

philips · 2014-12-31T06:03:53Z

@vcaputo Is there a document on the current state of rkt gc/etc that we can point users at?

vcaputo · 2015-01-23T03:35:15Z

#414 is a first stab at an explanation

jonboulle · 2015-01-24T01:45:02Z

Fixed by #414, thanks @vcaputo

philips added the core label Nov 12, 2014

philips added this to the v0.1.0 milestone Nov 12, 2014

jonboulle assigned vcaputo Nov 24, 2014

philips modified the milestones: v0.1.1, v0.1.0 Nov 27, 2014

pierrebeaucamp mentioned this issue Dec 2, 2014

rkt: add a list-containers subcommand #162

Closed

jonboulle mentioned this issue Dec 9, 2014

rkt: basic gc command #35

Closed

philips modified the milestones: v0.1.2, v0.1.1 Dec 11, 2014

jonboulle modified the milestones: v0.1.2, v0.2.0 Jan 17, 2015

jonboulle closed this as completed Jan 24, 2015

pskrzyns mentioned this issue Apr 20, 2016

Volume mounting witk rkt-kvm doesn't work #2469

Closed

lucab unassigned vcaputo Apr 5, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rkt: lifecycle management #6

rkt: lifecycle management #6

philips commented Nov 12, 2014

jonboulle commented Nov 24, 2014

pierrebeaucamp commented Dec 4, 2014

vcaputo commented Dec 5, 2014

jonboulle commented Dec 5, 2014

pierrebeaucamp commented Dec 9, 2014

vcaputo commented Dec 9, 2014

philips commented Dec 31, 2014

vcaputo commented Jan 23, 2015

jonboulle commented Jan 24, 2015

rkt: lifecycle management #6

rkt: lifecycle management #6

Comments

philips commented Nov 12, 2014

jonboulle commented Nov 24, 2014

pierrebeaucamp commented Dec 4, 2014

vcaputo commented Dec 5, 2014

jonboulle commented Dec 5, 2014

pierrebeaucamp commented Dec 9, 2014

vcaputo commented Dec 9, 2014

philips commented Dec 31, 2014

vcaputo commented Jan 23, 2015

jonboulle commented Jan 24, 2015