podman-restart.service causes shutdown to hang #14434

andrin55 · 2022-05-31T15:48:02Z

Is this a BUG REPORT or FEATURE REQUEST? (leave only one on its own line)

/kind bug

Description

Enabling podman-restart.service causes shutdown to hang until the containers are killed after the timeout.

Steps to reproduce the issue:

Create container with restart-policy=always
Enable podman-restart.service
Restart and observe log

Describe the results you received:
Systemd waits 1m 30s for "libcrun container" until it kills it.

Describe the results you expected:
Graceful shutdown of containers on shutdown

Additional information you deem important (e.g. issue happens only occasionally):

Output of podman version:

4.0.2

Output of podman info --debug:

host:
  arch: amd64
  buildahVersion: 1.24.1
  cgroupControllers:
  - cpuset
  - cpu
  - io
  - memory
  - hugetlb
  - pids
  - rdma
  - misc
  cgroupManager: systemd
  cgroupVersion: v2
  conmon:
    package: conmon-2.1.0-1.el9.x86_64
    path: /usr/bin/conmon
    version: 'conmon version 2.1.0, commit: 3a898eb433ae426e729088ccdc2bdae44a3164da'
  cpus: 2
  distribution:
    distribution: '"rhel"'
    version: "9.0"
  eventLogger: journald
  hostname: localhost
  idMappings:
    gidmap: null
    uidmap: null
  kernel: 5.14.0-70.13.1.el9_0.x86_64
  linkmode: dynamic
  logDriver: journald
  memFree: 3225972736
  memTotal: 3865698304
  networkBackend: netavark
  ociRuntime:
    name: crun
    package: crun-1.4.4-2.el9_0.x86_64
    path: /usr/bin/crun
    version: |-
      crun version 1.4.4
      commit: 6521fcc5806f20f6187eb933f9f45130c86da230
      spec: 1.0.0
      +SYSTEMD +SELINUX +APPARMOR +CAP +SECCOMP +EBPF +CRIU +YAJL
  os: linux
  remoteSocket:
    exists: true
    path: /run/podman/podman.sock
  security:
    apparmorEnabled: false
    capabilities: CAP_NET_RAW,CAP_CHOWN,CAP_DAC_OVERRIDE,CAP_FOWNER,CAP_FSETID,CAP_KILL,CAP_NET_BIND_SERVICE,CAP_SETFCAP,CAP_SETGID,CAP_SETPCAP,CAP_SETUID,CAP_SYS_CHROOT
    rootless: false
    seccompEnabled: true
    seccompProfilePath: /usr/share/containers/seccomp.json
    selinuxEnabled: true
  serviceIsRemote: false
  slirp4netns:
    executable: /usr/bin/slirp4netns
    package: slirp4netns-1.1.12-4.el9.x86_64
    version: |-
      slirp4netns version 1.1.12
      commit: 7a104a101aa3278a2152351a082a6df71f57c9a3
      libslirp: 4.4.0
      SLIRP_CONFIG_VERSION_MAX: 3
      libseccomp: 2.5.2
  swapFree: 4202688512
  swapTotal: 4202688512
  uptime: 3m 52.77s
plugins:
  log:
  - k8s-file
  - none
  - passthrough
  - journald
  network:
  - bridge
  - macvlan
  volume:
  - local
registries:
  search:
  - registry.fedoraproject.org
  - registry.access.redhat.com
  - registry.centos.org
  - quay.io
  - docker.io
store:
  configFile: /etc/containers/storage.conf
  containerStore:
    number: 4
    paused: 0
    running: 4
    stopped: 0
  graphDriverName: overlay
  graphOptions:
    overlay.mountopt: nodev,metacopy=on
  graphRoot: /var/lib/containers/storage
  graphStatus:
    Backing Filesystem: xfs
    Native Overlay Diff: "false"
    Supports d_type: "true"
    Using metacopy: "true"
  imageCopyTmpDir: /var/tmp
  imageStore:
    number: 7
  runRoot: /run/containers/storage
  volumePath: /var/lib/containers/storage/volumes
version:
  APIVersion: 4.0.2
  Built: 1652984291
  BuiltTime: Thu May 19 20:18:11 2022
  GitCommit: ""
  GoVersion: go1.17.7
  OsArch: linux/amd64
  Version: 4.0.2

Package info (e.g. output of rpm -q podman or apt list podman):

podman-4.0.2-7.el9_0.x86_64

Have you tested with the latest version of Podman and have you checked the Podman Troubleshooting Guide? (https://github.com/containers/podman/blob/main/troubleshooting.md)

Yes (RHEL Podman)

Additional environment details (AWS, VirtualBox, physical, etc.):

Running latest RHEL9 Podman.

Due to podman-restart.service not having a ExecStop procedure, it fails when stopping:

systemd[1]: Stopping Podman Start All Containers With Restart Policy Set To Always...
systemd[1]: podman-restart.service: State 'stop-sigterm' timed out. Killing.
systemd[1]: podman-restart.service: Killing process 970 (conmon) with signal SIGKILL.
systemd[1]: podman-restart.service: Killing process 972 (gmain) with signal SIGKILL.
systemd[1]: podman-restart.service: Failed with result 'timeout'.
systemd[1]: Stopped Podman Start All Containers With Restart Policy Set To Always.

However this does not seem to stop the container, which then gets killed by systemd after the timeout.
Adding ExecStop to the podman-restart.service solves the issue (due to podman stop not supporting the "--filter" flag, I had to use this workarround):
ExecStop=/bin/sh -c '/usr/bin/podman $LOGGING stop $(/usr/bin/podman container ls --filter restart-policy=always -q)'

The text was updated successfully, but these errors were encountered:

mheon · 2022-05-31T17:55:57Z

Restart-policy doesn't actually use or require podman-restart.service - are you sure the two are related?

andrin55 · 2022-05-31T19:08:52Z

Since I use the podman-restart.service to start all containers with restart-policy=always I added the ExecStop with the same filter just to stop the same containers. The problem does not come from the restart-policy, it comes from the fact, that without ExecStop added to the podman-restart.service, systemd needs to kill the running containers forcefully in order to shut down (since nothing else seem to stop them gracefully).

rhatdan · 2022-05-31T19:10:30Z

Makes sense to me. @vrothberg WDYT?

vrothberg · 2022-06-01T08:48:00Z

Makes sense to me. @vrothberg WDYT?

Sounds good to me. Adding a new --filter flag to podman stop would be nice to make the ExecStop more elegant.

vrothberg · 2022-06-01T08:51:09Z

@andrin55, interested in opening a PR to fix the issue? We can add the --filter flag at some later point.

andrin55 · 2022-06-01T14:06:53Z

@vrothberg I made a new pull request. I messed something up with the signoff with the previous one.

openshift-ci bot added the kind/bug Categorizes issue or PR as related to a bug. label May 31, 2022

This was referenced Jun 1, 2022

podman-restart.service: Add ExecStop and dependencies to fix shutdown #14442

Closed

podman-restart.service: Add ExecStop and dependencies to fix shutdown #14446

Merged

openshift-merge-robot closed this as completed in #14446 Jun 2, 2022

urbenlegend mentioned this issue Aug 28, 2023

podman-restart.service doesn't like x-systemd.automount #19766

Closed

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 20, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

podman-restart.service causes shutdown to hang #14434

podman-restart.service causes shutdown to hang #14434

andrin55 commented May 31, 2022

mheon commented May 31, 2022

andrin55 commented May 31, 2022

rhatdan commented May 31, 2022

vrothberg commented Jun 1, 2022

vrothberg commented Jun 1, 2022

andrin55 commented Jun 1, 2022

podman-restart.service causes shutdown to hang #14434

podman-restart.service causes shutdown to hang #14434

Comments

andrin55 commented May 31, 2022

mheon commented May 31, 2022

andrin55 commented May 31, 2022

rhatdan commented May 31, 2022

vrothberg commented Jun 1, 2022

vrothberg commented Jun 1, 2022

andrin55 commented Jun 1, 2022