regression: can't kill and delete the container with shared(host) pid ns when the init process has dead #4047

lifubang · 2023-10-02T08:41:39Z

Description

After merge #3825 , if we create a container without PID namespace, and then exec some processes to this container, after the init process has dead, we can't kill and delete this container anymoe.
I think this is introduced by the commit f8ad20f .

Steps to reproduce the issue

create a container test without PID namespace, with the entry point is sleep 20;
runc exec -d test sleep infinity
wait 20 seconds, the init process exited;

Describe the results you received and expected

runc kill test KILL
received:
ERRO[0000] container not running

expected:
It should have the same effect like runc kill -a test KILL with runc 1.1.*

runc delete -f test
received:

ERRO[0000] Failed to remove paths: map[:/sys/fs/cgroup/unified/test blkio:/sys/fs/cgroup/blkio/user.slice/test cpu:/sys/fs/cgroup/cpu,cpuacct/user.slice/test cpuacct:/sys/fs/cgroup/cpu,cpuacct/user.slice/test cpuset:/sys/fs/cgroup/cpuset/test devices:/sys/fs/cgroup/devices/user.slice/test freezer:/sys/fs/cgroup/freezer/test hugetlb:/sys/fs/cgroup/hugetlb/test memory:/sys/fs/cgroup/memory/user.slice/user-1000.slice/session-8.scope/test misc:/sys/fs/cgroup/misc/test name=systemd:/sys/fs/cgroup/systemd/user.slice/user-1000.slice/session-8.scope/test net_cls:/sys/fs/cgroup/net_cls,net_prio/test net_prio:/sys/fs/cgroup/net_cls,net_prio/test perf_event:/sys/fs/cgroup/perf_event/test pids:/sys/fs/cgroup/pids/user.slice/user-1000.slice/session-8.scope/test rdma:/sys/fs/cgroup/rdma/test]

expected:
The container can be removed successfuly.

What version of runc are you using?

The main branch

Host OS information

NAME="Ubuntu"
VERSION="20.04.5 LTS (Focal Fossa)"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu 20.04.5 LTS"
VERSION_ID="20.04"
HOME_URL="https://www.ubuntu.com/"
SUPPORT_URL="https://help.ubuntu.com/"
BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"
PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"
VERSION_CODENAME=focal
UBUNTU_CODENAME=focal

Host kernel information

Linux acmcoder 5.15.0-84-generic #93~20.04.1-Ubuntu SMP Wed Sep 6 16:15:40 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux

The text was updated successfully, but these errors were encountered:

kolyshkin · 2023-11-07T00:52:15Z

So, there is a third scenario which is not described in the description. It is runc delete (without -f).

runc 1.1 did kill all the cgroup processes for a shared pid ns container with no init, and #3825 broke it.

Two ways to do when runc delete is called on such a container:

Kill the remaining processes (the logic behind this is "container init is dead, thus the container is dead, so runc delete should remove all its bits and pieces, including the leftover processes"). This is also backward-compatible with older runc.
Warn that container cgroup is not empty (and suggest to use runc kill or runc delete -f). The logic behind this is, this situation is not normal, and this container state is wrong (a stopped container should not have any leftover processes), so we want the user to know about it.

I'm not sure which one is better. For backward compatibility, (1) is a good choice. Logically, I like (2) more.

If we are to implement (2), I think we should also implement runc exec --ignore-stopped for such containers (so that a user can do something about it rather than killing those processes).

lifubang added the regression label Oct 2, 2023

lifubang changed the title ~~regression: can't kill and delete the container without pid ns when the init process has dead~~ regression: can't kill and delete the container with shared(hosted) pid ns when the init process has dead Oct 2, 2023

lifubang mentioned this issue Oct 2, 2023

Fix a regression when killing and deleting a container with shared(host) pid namespace #4048

Closed

lifubang changed the title ~~regression: can't kill and delete the container with shared(hosted) pid ns when the init process has dead~~ regression: can't kill and delete the container with shared(host) pid ns when the init process has dead Oct 2, 2023

kolyshkin mentioned this issue Oct 3, 2023

RFC: treat host pidns container with no init process as running if some processes exist in cgroup #4049

Closed

This was referenced Oct 8, 2023

HostPID Pod Container Cgroup path was residual after container restarts #4040

Closed

clarify kill and delete operation for shared pid namespace container opencontainers/runtime-spec#1234

Open

lifubang added this to the 1.2.0 milestone Oct 23, 2023

lifubang added the release-block This one should be resolved before draft an new release! label Oct 23, 2023

This was referenced Oct 31, 2023

Fix runc kill and runc delete for containers with no init and no private PID namespace #4102

Merged

Blockers for v1.2.0 #4114

Open

cyphar closed this as completed in #4102 Nov 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

regression: can't kill and delete the container with shared(host) pid ns when the init process has dead #4047

regression: can't kill and delete the container with shared(host) pid ns when the init process has dead #4047

lifubang commented Oct 2, 2023

kolyshkin commented Nov 7, 2023 •

edited

regression: can't kill and delete the container with shared(host) pid ns when the init process has dead #4047

regression: can't kill and delete the container with shared(host) pid ns when the init process has dead #4047

Comments

lifubang commented Oct 2, 2023

Description

Steps to reproduce the issue

Describe the results you received and expected

What version of runc are you using?

Host OS information

Host kernel information

kolyshkin commented Nov 7, 2023 • edited

kolyshkin commented Nov 7, 2023 •

edited