OSDOCS12626 GA User Name Space in OpenShift 4.20 by mburke5678 · Pull Request #84966 · openshift/openshift-docs

mburke5678 · 2024-11-14T14:47:45Z

https://issues.redhat.com/browse/OSDOCS-12626

Preview: Running pods in Linux user namespaces -- New assembly and included module.

QE review:

QE has approved this change.

ocpdocs-vale-bot · 2024-11-14T15:16:07Z

modules/nodes-pods-user-namespaces-configuring.adoc

-<3> Specifies the UID the container is run with.
-<4> Specifies which primary GID the containers is run with.
-<5> Requests that the pod is to be run in a user namespace. If `true`, the pod runs in the host user namespace. If `false`, the pod runs in a new user namespace that is created for the pod. The default is `true`.
+<3> Specifies the type of proc mount to use for the containers. The `unmasked` value ensures that a container's `/proc` is mounted as read-write by the container process. This bypasses the default masking behavior of the container runtime, and should only be used with an SCC that sets `hostUsers` to `false`. The default is `Default`.


🤖 [error] RedHat.TermsErrors: Use 'read/write' rather than 'read-write'. For more information, see RedHat.TermsErrors.

mburke5678 · 2024-11-26T23:18:41Z

@haircommander Can you PTAL?

modules/nodes-pods-user-namespaces-configuring.adoc

haircommander · 2024-11-27T15:15:23Z

modules/nodes-pods-user-namespaces-configuring.adoc

-After you save the changes, new machine configs are created, the machine config pools are updated, and scheduling on each node is disabled while the change is being applied.
+Also, configuring workloads as `procMount: unmasked` is generally considered as safe within a user namespace. Setting `procMount` to `unmasked` has benefits that are beyond the scope of this documentation. 
+
+To ensure that the namespace functionality exists on all nodes that you want to run in a user namespace, you can configure the minimum version of kubelet that is required for the nodes in your cluster. If the kubelet version in your cluster is lower than this version, new nodes are not scheduled and existing nodes are marked as degraded. For existing nodes with a lower version, the kubelet can only read the node object by using `oc get` or `oc update` commands, and the actions that the node can perform, by using a `SelfSubjectAccessReview`. The node is not allowed to gain access to any other API objects.     


If the kubelet version in your cluster is lower than this version, new nodes are not scheduled and existing nodes are marked as degraded

actually minimumKubeletVersion cannot be set if the current nodes in the cluster are too old. minimumKubeletVersion, once set, is only for guranteeing sure new nodes in the cluster are new enough

For existing nodes with a lower version, the kubelet can only read the node object by using oc get or oc update commands

the kubelet doesn't use oc , you can just say the kubelet can only read and update its own node object

@haircommander

actually minimumKubeletVersion cannot be set if the current nodes in the cluster are too old.

Does "too old" suggest v1.29 and lower?

older than the defined MinimumKubeletVersion

@haircommander

For existing nodes with a lower version, the kubelet can only read its node object.

Is this statement true only if I set a minimum kubelet version? Are there any negative ramifications that the user who runs lower kubelet version(s) should be aware of?

Is this statement true only if I set a minimum kubelet version

yeah

Are there any negative ramifications that the user who runs lower kubelet version(s) should be aware of?

There are two cases:

if the lower kubleet version exists in the cluster at the time min kubelet version is attempted to be set, the validation rejects the min kubelet update

if the node is being added after the min kubelet version is established, that node will not be able to connect to the cluster meaningfully. It would take manual intervention to fix

modules/nodes-pods-user-namespaces-configuring.adoc

nodes/pods/nodes-pods-user-namespaces.adoc

modules/nodes-pods-user-namespaces-configuring.adoc

mburke5678 · 2024-12-09T20:52:39Z

@lyman9966 When appropriate, can you please review this PR for QE? I believe you have not started testing the feature itself.

bergerhoffer · 2025-02-24T19:08:38Z

The branch/enterprise-4.19 label has been added to this PR.

This is because your PR targets the main branch and is labeled for enterprise-4.18. And any PR going into main must also target the latest version branch (enterprise-4.19).

If the update in your PR does NOT apply to version 4.19 onward, please re-target this PR to go directly into the appropriate version branch or branches (enterprise-4.x) instead of main.

openshift-bot · 2025-08-13T01:00:17Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

modules/nodes-pods-user-namespaces-configuring.adoc

mburke5678 · 2025-09-05T14:20:41Z

@lyman9966 Can you PTAL for QE?

lyman9966 · 2025-09-12T03:45:17Z

nodes/pods/nodes-pods-user-namespaces.adoc

-
-:FeatureName: Support for Linux user namespaces
-include::snippets/technology-preview.adoc[]
+You can configure Linux user namespace use by setting the `hostUsers` parameter to `false` in the pod spec, and a few other configurations, as shown in the following procedure.


I think we can changeYou can configure Linux user namespace use to You can configure Linux user namespace

lyman9966 · 2025-09-12T07:22:35Z

modules/nodes-pods-user-namespaces-configuring.adoc

+<2> Specifies whether the pod is to be run in a user namespace. If `false`, the pod runs in a new user namespace that is created for the pod. If `true`, the pod runs in the host user namespace. The default is `true`.
+<3>  `capabilities` permit privileged actions without giving full root access. Technically, setting capabilities inside of a user namespace is safer than setting them outside, as the scope of the capabilities are limited by being inside user namespace, and can generally be considered to be safe. However, giving pods capabilities like `CAP_SYS_ADMIN` to any untrusted workload could increase the potential kernel surface area that a containerized process has access to and could find exploits in. Thus, capabilities inside of a user namespace are allowed at `baseline` level in pod security admission.
+<4> Specifies that processes inside the container run with a user that has any UID other than 0.
+<5> Optional: Specifies the type of proc mount to use for the containers. The `unmasked` value ensures that a container's `/proc` file system is mounted as read/write by the container process. The default is `Default`.


[The default is Default] should be [The default is masked]?

@haircommander Can you help with this ^^?

@lyman9966 This is from the help text in the web console:

procMount denotes the type of proc mount to use for the containers. The default value is Default which uses the container runtime defaults for readonly paths and masked paths. This requires the ProcMountType feature flag to be enabled. Note that this field cannot be set when spec.os.name is windows. Possible enum values: - `"Default"` uses the container runtime defaults for readonly and masked paths for /proc. Most container runtimes mask certain paths in /proc to avoid accidental security exposure of special devices or information. - `"Unmasked"` bypasses the default masking behavior of the container runtime and ensures the newly created /proc the container stays in tact with no modifications. Allowed Values: Default Unmasked

I created a pod without specifying a procMount. But I am not sure where the parameter is shown.

I tried with the following:

set procMount: Default

not specify procMount
The both can lead to the same results as procMount: Unmasked. (i.e. rw permission). I'm not sure if there is a bug exist.
% oc exec nested-podman -- mount | grep "/proc type proc"
proc on /proc type proc (rw,nosuid,nodev,noexec,relatime)

@haircommander Can you take a look? Thanks

@lyman9966 can you show me the pod spec? I used

apiVersion: v1 kind: Pod metadata: name: nested-podman-1 annotations: io.kubernetes.cri-o.Devices: "/dev/net/tun" spec: hostUsers: false # user namespace containers: - name: nested-podman image: docker.io/lyman9966/baseline-nested-container:v1.0 args: - sleep - "1000000" securityContext: runAsUser: 0 # procMount: Unmasked capabilities: add: - "SETGID" - "SETUID"

and got

$ oc exec -ti pod/nested-podman-1 -- /bin/sh sh-5.2# ls ^C sh-5.2# sh-5.2# mount | grep proc proc on /proc type proc (rw,nosuid,nodev,noexec,relatime) tmpfs on /proc/acpi type tmpfs (ro,relatime,context="system_u:object_r:container_file_t:s0:c228,c594",size=0k,uid=3725918208,gid=3725918208,inode64) devtmpfs on /proc/interrupts type devtmpfs (ro,nosuid,seclabel,size=4096k,nr_inodes=1998457,mode=755,inode64) devtmpfs on /proc/kcore type devtmpfs (ro,nosuid,seclabel,size=4096k,nr_inodes=1998457,mode=755,inode64) devtmpfs on /proc/keys type devtmpfs (ro,nosuid,seclabel,size=4096k,nr_inodes=1998457,mode=755,inode64) tmpfs on /proc/scsi type tmpfs (ro,relatime,context="system_u:object_r:container_file_t:s0:c228,c594",size=0k,uid=3725918208,gid=3725918208,inode64) devtmpfs on /proc/timer_list type devtmpfs (ro,nosuid,seclabel,size=4096k,nr_inodes=1998457,mode=755,inode64) proc on /proc/bus type proc (ro,nosuid,nodev,noexec,relatime) proc on /proc/fs type proc (ro,nosuid,nodev,noexec,relatime) proc on /proc/irq type proc (ro,nosuid,nodev,noexec,relatime) proc on /proc/sys type proc (ro,nosuid,nodev,noexec,relatime) proc on /proc/sysrq-trigger type proc (ro,nosuid,nodev,noexec,relatime)

as expected

setting to default gives me the same

@haircommander I think I got the same results as you, but I misunderstand the meaning of masked procMount. I thought the "rw" mount of /proc should not appear in command "mount | grep proc" :
proc on /proc type proc (rw,nosuid,nodev,noexec,relatime)
But it's ok indeed, because there is "ro" mount for sub /proc.

yeah unmasked means one rw mount, rather than the default which has many ro mounts (basically it's a limited view into proc)

lyman9966 · 2025-09-12T10:21:40Z

modules/nodes-pods-user-namespaces-configuring.adoc

----
-<1> Specifies the machine config pool label.
-<2> Specifies the container runtime to deploy.
+To require a specific SCC for a workload, set the `openshift.io/required-scc` annotation in the object specification. For more information, see "Configuring a workload to require a specific SCC". Alternatively, you can add an SCC to a specific user or group by using the `oc adm policy add-scc-to-user` or `oc adm policy add-scc-to-group` command. For more information, see the "OpenShift CLI administrator command reference".


In my test, only set the openshift.io/required-scc annotation in the object specification can't pass pod admission.
For a non-admin user, if I only set the openshift.io/required-scc annotation, it will prompt:
unable to validate against any security context constraint: provider "restricted-v3": Forbidden: not usable by user or serviceaccount

@haircommander Can you help with this ^^?

you can remove the required-scc part I think

lyman9966 · 2025-09-16T05:06:14Z

/lgtm

haircommander · 2025-09-16T13:23:28Z

/lgtm

lahinson

@mburke5678 Looks pretty good. I caught one tiny thing to fix (a missing word) and made a couple of suggestions to handle in a future PR. Feel free to merge this when you're ready.

modules/nodes-pods-user-namespaces-configuring.adoc

openshift-ci · 2025-09-17T17:40:14Z

New changes are detected. LGTM label has been removed.

openshift-ci · 2025-09-17T17:52:49Z

@mburke5678: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

mburke5678 · 2025-09-17T19:35:51Z

/cherrypick enterprise-4.20

openshift-cherrypick-robot · 2025-09-17T19:35:55Z

@mburke5678: once the present PR merges, I will cherry-pick it on top of enterprise-4.20 in a new PR and assign it to you.

Details

In response to this:

/cherrypick enterprise-4.20

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

mburke5678 · 2025-09-17T19:52:20Z

/cherrypick enterprise-4.20

openshift-cherrypick-robot · 2025-09-17T19:52:30Z

@mburke5678: new pull request created: #99308

Details

In response to this:

/cherrypick enterprise-4.20

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2025-09-17T19:53:18Z

@mburke5678: new pull request could not be created: failed to create pull request against openshift/openshift-docs#enterprise-4.20 from head openshift-cherrypick-robot:cherry-pick-84966-to-enterprise-4.20: status code 422 not one of [201], body: {"message":"Validation Failed","errors":[{"resource":"PullRequest","code":"custom","message":"A pull request already exists for openshift-cherrypick-robot:cherry-pick-84966-to-enterprise-4.20."}],"documentation_url":"https://docs.github.com/rest/pulls/pulls#create-a-pull-request","status":"422"}

Details

In response to this:

/cherrypick enterprise-4.20

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

mburke5678 added the branch/enterprise-4.18 label Nov 14, 2024

mburke5678 added this to the Planned for 4.18 GA milestone Nov 14, 2024

openshift-ci bot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Nov 14, 2024

ocpdocs-vale-bot reviewed Nov 14, 2024

View reviewed changes

openshift-ci bot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Nov 22, 2024

haircommander reviewed Nov 27, 2024

View reviewed changes

modules/nodes-pods-user-namespaces-configuring.adoc Outdated Show resolved Hide resolved