Automatically set `may_detach_mounts=1` on startup #34886

cpuguy83 · 2017-09-18T13:51:45Z

This is kernel config available in RHEL7.4 based kernels that enables
mountpoint removal where the mountpoint exists in other namespaces.
In particular this is important for making this pattern work:

umount -l /some/path
rm -r /some/path

Where /some/path exists in another mount namespace.
Setting this value will prevent device or resource busy errors when
attempting to the removal of /some/path in the example.

This setting is the default, and non-configurable, on upstream kernels
since 3.15.

runcom · 2017-09-18T13:57:48Z

Looks good, ping @rhvgoyal

rhvgoyal · 2017-09-20T13:07:53Z

LGTM.

We already do this with the help of sysctl interface. So over a boot, all the sysctl settings are applied and this is already enabled by the time docker runs.

thaJeztah

minor nits, but LGTM otherwise :)

thaJeztah · 2017-09-20T13:52:04Z

daemon/daemon_unix.go

+		if os.IsNotExist(err) {
+			return nil
+		}
+		return errors.Wrap(err, "error opening may_deatch_mounts kernel config file")


oh! typo: s/may_deatch_mounts/may_detach_mounts/

thaJeztah · 2017-09-20T13:53:44Z

daemon/daemon_unix.go

+		// unprivileged container. Ignore the error, but log
+		// it if we appear not to be in that situation.
+		if !rsystem.RunningInUserNS() {
+			logrus.Debugf("Permission denied writing %q to /proc/sys/fs/may_detach_mounts", "1")


Looks like there's no need to use %q here, because "1" is hardcoded

Well, it quotes it, which is pretty ugly to do otherwise.

backticks for the string?

`Permission denied writing "1" to ....`

This is kernel config available in RHEL7.4 based kernels that enables mountpoint removal where the mountpoint exists in other namespaces. In particular this is important for making this pattern work: ``` umount -l /some/path rm -r /some/path ``` Where `/some/path` exists in another mount namespace. Setting this value will prevent `device or resource busy` errors when attempting to the removal of `/some/path` in the example. This setting is the default, and non-configurable, on upstream kernels since 3.15. Signed-off-by: Brian Goff <cpuguy83@gmail.com>

thaJeztah

LGTM

kolyshkin · 2017-09-21T00:29:14Z

Wonderful! I was just going to work on a similar PR, but for the devmapper graph driver only (as its "deferred deletion" feature only works in case fs.may_detach_mounts=1). I did not realize it might be helpful in non-devmapper case, too. This patch will also help in case of autodetection whether deferred deletion is working or not (see mbentley/docker-devicemapper-setup#4).

The alternative approach is to add a /usr/lib/sysctl.d/90-docker.conf file to docker rpm spec, containing fs.may_detach_mounts=1, and adding a kludge to the .spec so that the setting is also applied upon rpm installation. This would be cleaner (and this is what Red Hat's runc rpm does) -- the only downside is anyone who's not installing from an rpm would not get it.

Yet another alternative is something like:

// -q: not display the value set, -e: ignore non-existent key
err := exec.Command("sysctl", "-q", "-e", "fs.may_detach_mounts=1").Run()
...

While exec sn ugly in general, this is only done once upon daemon (re)start so it's OK.

thaJeztah · 2017-09-21T11:07:55Z

Yet another alternative is something like:

I do like the alternative approach (cleaner); would the permissions denied error still be properly returned with that code?

kolyshkin · 2017-09-21T19:07:37Z

I do like the alternative approach (cleaner); would the permissions denied error
still be properly returned with that code?

Well, what we'll get is an exit code (most probably a generic 1) and a text from stderr (which we can parse for "permission denied" but it's a slippery slope as the locale might change that). I suggest treating any error, not just EPERM, in the same way, like:

s := "fs.may_detach_mounts=1"
// -q: not display the value set, -e: ignore non-existent key
err := exec.Command("sysctl", "-q", "-e", s).Run()
// ignore the error if the daemon is inside userns
if err != nil && !rsystem.RunningInUserNS() {
	logrus.Warnf("Error setting %s: %v", s, err)
}

Or maybe even not try setting this at all if we're in userns:

// do not try to set sysctls from inside userns
if !rsystem.RunningInUserNS() {
	s := "fs.may_detach_mounts=1"
	// -q: not display the value set, -e: ignore non-existent key
	if err := exec.Command("sysctl", "-q", "-e", s).Run(); err != nil {
		logrus.Warnf("Error setting %s sysctl: %v", s, err)
}

thaJeztah · 2017-09-21T19:10:18Z

(If @cpuguy83 agrees) feel free to open a PR with that change

cpuguy83 · 2017-09-21T19:28:03Z

Why use exec here?

cpuguy83 · 2017-09-21T19:29:16Z

I'd be fine with setting this in the systemd conf, however that's really a packaging issue which we don't really deal with in this repo (though there are some init scripts in contrib).

kolyshkin · 2017-09-21T19:29:50Z

Why use exec here?

As I said earlier, While exec sn ugly in general, this is only done once upon daemon (re)start so it's OK.

The benefits I see are:

using more standard interface
less code

moby/moby#22260 moby/moby#34886

GordonTheTurtle added the status/0-triage label Sep 18, 2017

cpuguy83 mentioned this pull request Sep 18, 2017

Unable to remove a stopped container: device or resource busy #22260

Closed

cpuguy83 requested review from runcom and mlaventure September 18, 2017 13:54

cpuguy83 added status/2-code-review and removed status/0-triage labels Sep 18, 2017

thaJeztah requested changes Sep 20, 2017

View reviewed changes

cpuguy83 force-pushed the may_detach_mount branch from b4a87a9 to 83c2152 Compare September 20, 2017 13:57

runcom approved these changes Sep 20, 2017

View reviewed changes

thaJeztah approved these changes Sep 20, 2017

View reviewed changes

thaJeztah added impact/changelog status/4-merge and removed status/2-code-review labels Sep 20, 2017

cpuguy83 added the rebuild/janky label Sep 20, 2017

GordonTheTurtle removed the rebuild/janky label Sep 20, 2017

yongtang merged commit 7d70d0f into moby:master Sep 20, 2017

cpuguy83 deleted the may_detach_mount branch September 21, 2017 00:17

opera443399 added a commit to opera443399/ops that referenced this pull request Sep 26, 2017

doc(docker): issue #22260 fixed in CentOS 7.4

2f65285

moby/moby#22260 moby/moby#34886

kolyshkin mentioned this pull request Sep 29, 2017

Multiple run of docker run --rm hangs in a host docker/for-linux#115

Open

3 tasks

cpuguy83 restored the may_detach_mount branch September 30, 2017 15:31

cpuguy83 deleted the may_detach_mount branch September 30, 2017 17:17

jamiejackson mentioned this pull request Oct 20, 2017

Kernel parameter fs.may_detach_mounts is necessary even if mount flag is set to slave haxorof/ansible-role-docker-ce#13

Closed

ndegory mentioned this pull request Oct 24, 2017

AMP on centos/redhat appcelerator-archive/amp#1672

Closed

robnagler mentioned this pull request Dec 11, 2017

"dead" probably appearing with overlay2 radiasoft/containers#83

Closed

nayihz mentioned this pull request Aug 31, 2023

Containerd service not restarting (and many pods stuck in Terminating status) containerd/containerd#6953

Open

thaJeztah added area/volumes area/storage labels Jun 22, 2024

thaJeztah mentioned this pull request Jul 22, 2024

daemon: remove setMayDetachMounts (set may_detach_mounts=1 on startup) #48210

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically set `may_detach_mounts=1` on startup #34886

Automatically set `may_detach_mounts=1` on startup #34886

cpuguy83 commented Sep 18, 2017

runcom commented Sep 18, 2017

rhvgoyal commented Sep 20, 2017

thaJeztah left a comment

thaJeztah Sep 20, 2017

thaJeztah Sep 20, 2017

cpuguy83 Sep 20, 2017

thaJeztah Sep 20, 2017

thaJeztah left a comment

kolyshkin commented Sep 21, 2017 •

edited

Loading

thaJeztah commented Sep 21, 2017

kolyshkin commented Sep 21, 2017

thaJeztah commented Sep 21, 2017

cpuguy83 commented Sep 21, 2017

cpuguy83 commented Sep 21, 2017

kolyshkin commented Sep 21, 2017 •

edited

Loading

Automatically set may_detach_mounts=1 on startup #34886

Automatically set may_detach_mounts=1 on startup #34886

Conversation

cpuguy83 commented Sep 18, 2017

runcom commented Sep 18, 2017

rhvgoyal commented Sep 20, 2017

thaJeztah left a comment

Choose a reason for hiding this comment

thaJeztah Sep 20, 2017

Choose a reason for hiding this comment

thaJeztah Sep 20, 2017

Choose a reason for hiding this comment

cpuguy83 Sep 20, 2017

Choose a reason for hiding this comment

thaJeztah Sep 20, 2017

Choose a reason for hiding this comment

thaJeztah left a comment

Choose a reason for hiding this comment

kolyshkin commented Sep 21, 2017 • edited Loading

thaJeztah commented Sep 21, 2017

kolyshkin commented Sep 21, 2017

thaJeztah commented Sep 21, 2017

cpuguy83 commented Sep 21, 2017

cpuguy83 commented Sep 21, 2017

kolyshkin commented Sep 21, 2017 • edited Loading

Automatically set `may_detach_mounts=1` on startup #34886

Automatically set `may_detach_mounts=1` on startup #34886

kolyshkin commented Sep 21, 2017 •

edited

Loading

kolyshkin commented Sep 21, 2017 •

edited

Loading