generate systemd: do not set `KillMode` #8889

vrothberg · 2021-01-05T14:57:37Z

KillMode=none has been deprecated in systemd and is now throwing big
warnings when being used. Users have reported the issues upstream
(see #8615) and on the mailing list.

This deprecation was mainly motivated by an abusive use of third-party
vendors causing all kinds of undesired side-effects. For instance, busy
mounts that delay reboot.

After talking to the systemd team, we came up with the following plan:

Short term: we can use TimeoutStopSec and remove KillMode=none which
will default to cgroup.

Long term: we want to change the type to sdnotify. The plumbing for
Podman is done but we need it for conmon. Once sdnotify is working, we
can get rid of the pidfile handling etc. and let Podman handle it.
Michal Seklatar came up with a nice idea that Podman increase the time
out on demand. That's a much cleaner way than hard-coding the time out
in the unit as suggest in the short-term solution.

This change is executing the short-term plan and sets a minimum timeout
of 60 seconds. User-specified timeouts are added to that.

Fixes: #8615
Signed-off-by: Valentin Rothberg rothberg@redhat.com

vrothberg · 2021-01-05T14:57:51Z

@giuseppe @msekletar PTAL

rhatdan

LGTM

vrothberg · 2021-01-05T15:17:05Z

Marked as WIP as I want to perform some manual tests.

giuseppe

LGTM

openshift-ci-robot · 2021-01-05T15:25:38Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: giuseppe, rhatdan, vrothberg

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [giuseppe,rhatdan,vrothberg]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

`KillMode=none` has been deprecated in systemd and is now throwing big warnings when being used. Users have reported the issues upstream (see containers#8615) and on the mailing list. This deprecation was mainly motivated by an abusive use of third-party vendors causing all kinds of undesired side-effects. For instance, busy mounts that delay reboot. After talking to the systemd team, we came up with the following plan: **Short term**: we can use TimeoutStopSec and remove KillMode=none which will default to cgroup. **Long term**: we want to change the type to sdnotify. The plumbing for Podman is done but we need it for conmon. Once sdnotify is working, we can get rid of the pidfile handling etc. and let Podman handle it. Michal Seklatar came up with a nice idea that Podman increase the time out on demand. That's a much cleaner way than hard-coding the time out in the unit as suggest in the short-term solution. This change is executing the short-term plan and sets a minimum timeout of 60 seconds. User-specified timeouts are added to that. Fixes: containers#8615 Signed-off-by: Valentin Rothberg <rothberg@redhat.com>

vrothberg · 2021-01-05T17:17:26Z

@containers/podman-maintainers PTAL

rhatdan · 2021-01-05T19:08:22Z

/lgtm

sshnaidm · 2021-10-13T10:22:29Z

pkg/systemd/generate/containers_test.go

 ExecStartPre=/bin/rm -f %t/jadda-jadda.pid %t/jadda-jadda.ctr-id
 ExecStart=/usr/bin/podman run --conmon-pidfile %t/jadda-jadda.pid --cidfile %t/jadda-jadda.ctr-id --cgroups=no-conmon -d --replace --name jadda-jadda --hostname hello-world awesome-image:latest command arg1 ... argN "foo=arg \"with \" space"
-ExecStop=/usr/bin/podman stop --ignore --cidfile %t/jadda-jadda.ctr-id -t 42
+ExecStop=/usr/bin/podman stop --ignore --cidfile %t/jadda-jadda.ctr-id -t 10


If we have TimeoutStopSec now, what's the point of leaving ExecStop then?

We want Podman to take care of stopping and to clean up properly. TimeoutStopSec is the big hammer of systemd to nuke everything in the cgroup and may leave resources behind.

Currently with KillMode=none, if podman -t hangs, the command repeats leaving the process and its cgroup around. But if a stop/start command hangs, we should not start another one. Instead time it out properly via TimeoutStopSec set to the 2x of the timeout given to the managed podman action. Then if it expires, kill its cgroup all the way (KillMode=control-group is a default) before rerunning the same operation. Also note that using KilMode process is not recommended by systemd man pages. Increase the grace stop timeout defaults 10->42s to align it with: containers/podman#8889 Closes-bug: #1945791 Change-Id: Iefe861f91cefe2a9cf773cae98b2440566ae8b5e Signed-off-by: Bogdan Dobrelya <bdobreli@redhat.com>

* Update tripleo-ansible from branch 'master' to badd5a3c10f0af9e1c69a9b88064e743a36d4f53 - Wrap stopping podman -t with systemd timeouts Currently with KillMode=none, if podman -t hangs, the command repeats leaving the process and its cgroup around. But if a stop/start command hangs, we should not start another one. Instead time it out properly via TimeoutStopSec set to the 2x of the timeout given to the managed podman action. Then if it expires, kill its cgroup all the way (KillMode=control-group is a default) before rerunning the same operation. Also note that using KilMode process is not recommended by systemd man pages. Increase the grace stop timeout defaults 10->42s to align it with: containers/podman#8889 Closes-bug: #1945791 Change-Id: Iefe861f91cefe2a9cf773cae98b2440566ae8b5e Signed-off-by: Bogdan Dobrelya <bdobreli@redhat.com>

* Update tripleo-ansible from branch 'master' to fdfc59cd1d1024d132e6046f3ba4746fed23c8c2 - Remove execpost and align systemd timeouts Align systemd timeouts to podman[1]. Remove stop command with ExecStopPost command, because it won't help much to run same command twice if it's stuck. Set default time for stop command to 50 if not defined otherwise. In case it's stuck, systemd will nuke everything (TimeoutStopSec=60) 60 seconds = 50 seconds of trying to stop + 10 seconds for avoiding the race. [1] containers/podman#8889 Change-Id: Ife90b6c45b7f4a3fd829b60f9aa785fc580a31b8

Align systemd timeouts to podman[1]. Remove stop command with ExecStopPost command, because it won't help much to run same command twice if it's stuck. Set default time for stop command to 50 if not defined otherwise. In case it's stuck, systemd will nuke everything (TimeoutStopSec=60) 60 seconds = 50 seconds of trying to stop + 10 seconds for avoiding the race. [1] containers/podman#8889 Change-Id: Ife90b6c45b7f4a3fd829b60f9aa785fc580a31b8

Currently with KillMode=none, if podman -t hangs, the command repeats leaving the process and its cgroup around. But if a stop/start command hangs, we should not start another one. Instead time it out properly via TimeoutStopSec set to the 2x of the timeout given to the managed podman action. Then if it expires, kill its cgroup all the way (KillMode=control-group is a default) before rerunning the same operation. Also note that using KilMode process is not recommended by systemd man pages. Increase the grace stop timeout defaults 10->42s to align it with: containers/podman#8889 Closes-bug: #1945791 Change-Id: Iefe861f91cefe2a9cf773cae98b2440566ae8b5e Signed-off-by: Bogdan Dobrelya <bdobreli@redhat.com> (cherry picked from commit badd5a3)

Currently with KillMode=none, if podman -t hangs, the command repeats leaving the process and its cgroup around. But if a stop/start command hangs, we should not start another one. Instead time it out properly via TimeoutStopSec set to the 2x of the timeout given to the managed podman action. Then if it expires, kill its cgroup all the way (KillMode=control-group is a default) before rerunning the same operation. Also note that using KilMode process is not recommended by systemd man pages. Increase the grace stop timeout defaults 10->42s to align it with: containers/podman#8889 Closes-bug: #1945791 Change-Id: Iefe861f91cefe2a9cf773cae98b2440566ae8b5e Signed-off-by: Bogdan Dobrelya <bdobreli@redhat.com> (cherry picked from commit badd5a3c10f0af9e1c69a9b88064e743a36d4f53)

1) Currently with KillMode=none, if podman -t hangs, the command repeats leaving the process and its cgroup around. But if a stop/start command hangs, we should not start another one. Instead time it out properly via TimeoutStopSec set to the 2x of the timeout given to the managed podman action. Then if it expires, kill its cgroup all the way (KillMode=control-group is a default) before rerunning the same operation. Also note that using KilMode process is not recommended by systemd man pages. Increase the grace stop timeout defaults 10->42s to align it with: containers/podman#8889 2) Make KillMode configurable for service units Libvirt container is a special beast that needs custom killmode value for its tripleo-manager service unit. Add kill_mode for the container values that defaults to control-group. 3) Fix TimeoutStopSec to integer convertion Related: rhbz#2010135 Closes-bug: #1945791 Change-Id: Iefe861f91cefe2a9cf773cae98b2440566ae8b5e Signed-off-by: Bogdan Dobrelya <bdobreli@redhat.com> (cherry picked from commit badd5a3)

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jan 5, 2021

rhatdan approved these changes Jan 5, 2021

View reviewed changes

vrothberg changed the title ~~generate systemd: do not set KillMode~~ WIP - generate systemd: do not set KillMode Jan 5, 2021

openshift-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 5, 2021

giuseppe approved these changes Jan 5, 2021

View reviewed changes

vrothberg force-pushed the run-1138 branch from f8f363f to 219c69e Compare January 5, 2021 15:37

vrothberg changed the title ~~WIP - generate systemd: do not set KillMode~~ generate systemd: do not set KillMode Jan 5, 2021

openshift-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 5, 2021

openshift-ci-robot assigned rhatdan Jan 5, 2021

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Jan 5, 2021

openshift-merge-robot merged commit 1f59276 into containers:master Jan 5, 2021

sshnaidm reviewed Oct 13, 2021

View reviewed changes

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 22, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

generate systemd: do not set `KillMode` #8889

generate systemd: do not set `KillMode` #8889

vrothberg commented Jan 5, 2021

vrothberg commented Jan 5, 2021

rhatdan left a comment

vrothberg commented Jan 5, 2021 •

edited

Loading

giuseppe left a comment

openshift-ci-robot commented Jan 5, 2021

vrothberg commented Jan 5, 2021

rhatdan commented Jan 5, 2021

sshnaidm Oct 13, 2021

vrothberg Oct 13, 2021

generate systemd: do not set KillMode #8889

generate systemd: do not set KillMode #8889

Conversation

vrothberg commented Jan 5, 2021

vrothberg commented Jan 5, 2021

rhatdan left a comment

Choose a reason for hiding this comment

vrothberg commented Jan 5, 2021 • edited Loading

giuseppe left a comment

Choose a reason for hiding this comment

openshift-ci-robot commented Jan 5, 2021

vrothberg commented Jan 5, 2021

rhatdan commented Jan 5, 2021

sshnaidm Oct 13, 2021

Choose a reason for hiding this comment

vrothberg Oct 13, 2021

Choose a reason for hiding this comment

generate systemd: do not set `KillMode` #8889

generate systemd: do not set `KillMode` #8889

vrothberg commented Jan 5, 2021 •

edited

Loading