Attacher and provisioner may need privileged too #32

wongma7 · 2019-03-29T18:04:12Z

It seems that on SELinux enabled systems, the provisioner & attacher pod can't access the socket created by the privileged plugin pod. Previously: kubernetes/kubernetes#69215 . I am not sure the exact reason, hopefully @bertinatto can explain :))

wongma7 · 2019-03-29T18:29:24Z

I think it is because /var/lib/kubelet/plugins/csi-hostpath needs to be relabelled to allow random containers to read it. The hostpath plugin will not relabel it for us https://github.com/kubernetes/kubernetes/blob/7f0e04a089125901dce18c7d96507f2b60560e18/pkg/volume/host_path/host_path.go#L213

bertinatto · 2019-04-01T08:22:04Z

I think it is because /var/lib/kubelet/plugins/csi-hostpath needs to be relabelled to allow random containers to read it

@wongma7, that's correct! On SELinux-enabled systems, --privileged containers will add certain labels to files that are created from them. If I'm not mistaken, those labels vary from namespace to namespace. Having said that, when a non-privileged container tries to access the socket file created by the privileged one, it'll get a permission error due to mismatched labels.

Recently, I found out a better way to handle this. We can assign a certain SELinux label tocsi-hostpath, like this:

SecurityContext: &v1.SecurityContext{
        Privileged: true
        SELinuxOptions: &v1.SELinuxOptions{
                Level: "s0:c0,c1",
        },
},

With that, all files created by this container will have the label above. Then we can assign the same labels to the attacher container:

SecurityContext: &v1.SecurityContext{
        SELinuxOptions: &v1.SELinuxOptions{
                Level: "s0:c0,c1",
        },
},

This should prevent the mismatch and thus the permission error.

pohly · 2019-04-01T08:37:32Z

What does s0:c0,c1 mean? Are these the right values for all SELinux-enabled systems or just for the one you tried this on?

bertinatto · 2019-04-01T13:07:45Z

s0:c0,c1 was just a random value that I tried.

The file /etc/selinux/targeted/setrans.conf contains a translation for these values. Note that it may vary from system to system as these values can be configured by the admin through the semanage tool.

pohly · 2019-04-01T13:21:09Z

s0:c0,c1 was just a random value that I tried. ... Note that it may vary from system to system

That's what I feared. This is a problem for the example deployment and for E2E testing, because we cannot simply put the sections above in our .yaml files.

Is there a certain set of commands that can be used to look up these values? Or is this simply something that a cluster admin needs to know and provide as parameter to the deployment script?

wongma7 · 2019-04-01T16:32:40Z

I think it is up to the cluster admin to know and assign the meaning of categories (c0,c1) on their machines, then provide them. They would reserve categories for their csi driver deployment and then enforce that other pods use other categories via PodSecurityPolicies

That is selinux levels/mls/multi-level security, but what about selinux types. On my system /var/lib/kubelet/plugins is system_u:object_r:var_lib_t:s0 and so containers with system_u:object_r:container_t:s0 can't read anything. So in this case the cluster admin must also relabel the socket file?

cc @jsafrane

bertinatto · 2019-04-02T09:08:33Z

That is selinux levels/mls/multi-level security, but what about selinux types. On my system /var/lib/kubelet/plugins is system_u:object_r:var_lib_t:s0 and so containers with system_u:object_r:container_t:s0 can't read anything.

I missed the fact that the driver creates the socket under /var/lib/kubelet. By default, new files inherit the type of their parent directories (even if the file was created by a --privileged container), so my solution above wouldn't work for this case.

So in this case the cluster admin must also relabel the socket file?

That's a possible solution, but it can be that can be cumbersome, specially in worker nodes.

bertinatto · 2019-04-02T10:15:24Z

To summarize:

Regarding the provisioner and attacher, I believe they don't need to be privileged because the driver shipped in the same pod (that should implement the Controller Service set of RPC calls) doesn't need to be privileged. (In kubernetes/kubernetes#69215 I also made the provisioner and attacher privileged because the driver (privileged) object was used with all sidecars. To prevent that we should have different driver objects specifically tailored each one of the 3 services/sidecars).

As for the Node Service, the driver does need to run as privileged because it formats and mounts volumes. As a result, the socket file it exposes to the registrar has a SELinux context that's not accessible by non-privileged containers.

bertinatto · 2019-04-02T10:16:07Z

So my question is: do we really want to allow a non-privileged container to access the socket file created by the driver (Node Service)?

If we do so, we might have a potential security problem because:

I could start a non-privileged container that access the socket file exposed by the (privileged) driver
Connect to the RPC server
And format a volume in the host

Whatever solution we find, we must make sure that we don't allow this to happen.

jsafrane · 2019-04-02T12:18:26Z

I triple-checked on a SELinux machine.

Attacher and provisioner need to be priviliged to access driver's socket. We can either:
- Run attacher and provisioner as privileged. HostPath is the only driver that needs it and it's experimental anyway.
- Run attacher and provisioner with the same SELinux context as privileged containers. The containers themselves are still not privileged - they can't mount() or do other privileged stuff, however, SELinux won't block access to /var/lib/kubelet. On RHEL/CentOS/Fedora, spc_t is the right context.
```
kind: StatefulSet
spec:
  template:
    spec:
      securityContext:
        seLinuxOptions:
          type: spc_t
      serviceAccountName: csi-provisioner
      containers:
...
```
Since SELinux context names are distro specific, I'd prefer just run all the containers as privileged (HostPath driver only!)
(Non-privileged) node registrar can access socket created by privileged driver in the same pod. They both run with system_u:system_r:spc_t:s0 context.
- Beware, with SELinuxOptions: &v1.SELinuxOptions{ Level: "s0:c0,c1", },, the registrar runs as container_t:s0:c0,c1, while the driver runs as spc_t:s0 and registrar can't read driver's socket! So any SELinuxOptions is actually harmful in this case! I spent half a day debugging that.

Edit: tested with cri-o as container runtime.

bertinatto · 2019-04-02T13:15:08Z

(Non-privileged) node registrar can access socket created by privileged driver in the same pod. They both run with system_u:system_r:spc_t:s0 context.

For the record, I just tested with:

Centos 7 host (SELinux enabled)
EBS CSI driver
- privileged driver container
- non-privileged node registrar container
docker/containerd runtime

The registrar was able to connect to the driver without problems. The non-privileged registrar used to be a problem, but I suppose the SELinux context of containers in the same pod was fixed at some point recently.

pohly · 2019-04-02T13:44:26Z

Run attacher and provisioner as privileged. HostPath is the only driver that needs it and it's experimental anyway.

Why does only the hostpath driver need this? Because only this deployment runs the driver in a separate pod? There are other deployments which might do the same, for whatever reasons.

jsafrane · 2019-04-02T14:35:28Z

Why does only the hostpath driver need this? Because only this deployment runs the driver in a separate pod? There are other deployments which might do the same, for whatever reasons.

Attacher + provisioner need to access a socket in /var/lib/kubelet/plugins/* on the host instead of EmptyDir. Any CSI driver that's going to use HostPath instead of EmptyDir will face the same issue.

Basically, SELinux does not like any HostPath volumes: we don't want processes that escaped their container messing up the host, even if they run as root. So either admin (or a package) labels special directories as allowed to be used by containers or cluster admin can run these special pods with a special policy.

This special policy will be either distro specific or even cluster specific and is then hard to configure from an e2e test.

pohly · 2019-04-02T14:52:19Z

So "HostPath is the only driver" isn't about the csi-driver-host-path? You meant the builtin hostpath storage driver?

jsafrane · 2019-04-03T07:59:08Z

It's so confusing. It affects only csi-driver-host-path, because that's the only one that uses in-tree HostPath volume in attacher/provisioner to get to driver socket created by a privileged container in another pod.

pohly · 2019-04-03T08:48:41Z

It affects only csi-driver-host-path, because that's the only one that uses in-tree HostPath volume in attacher/provisioner to get to driver socket created by a privileged container in another pod.

In other words, this:

csi-driver-host-path/deploy/kubernetes-1.13/hostpath/csi-hostpath-provisioner.yaml

Lines 51 to 55 in 486074d

    
           volumes: 
        
             - hostPath: 
        
                 path: /var/lib/kubelet/plugins/csi-hostpath 
        
                 type: DirectoryOrCreate 
        
               name: socket-dir

I can see how that is a bit special. Other CSI driver deployments probably have attacher/provisioner/driver all bundled up in a single pod.

fejta-bot · 2019-07-02T09:37:35Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2019-08-01T10:26:01Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

msau42 · 2019-08-02T02:18:23Z

Can we put all the sidecars in the same pod as the driver? Bundling them all together is our recommended way, and the fact that our sample driver is not doing that is confusing to driver devs.

fejta-bot · 2019-09-01T02:39:16Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2019-09-01T02:39:23Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 2, 2019

pohly mentioned this issue Jul 15, 2019

hostpath.csi.k8s.io not found in the list of registered CSI drivers #71

Closed

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 1, 2019

k8s-ci-robot closed this as completed Sep 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attacher and provisioner may need privileged too #32

Attacher and provisioner may need privileged too #32

wongma7 commented Mar 29, 2019

wongma7 commented Mar 29, 2019

bertinatto commented Apr 1, 2019

pohly commented Apr 1, 2019

bertinatto commented Apr 1, 2019

pohly commented Apr 1, 2019

wongma7 commented Apr 1, 2019

bertinatto commented Apr 2, 2019

bertinatto commented Apr 2, 2019

bertinatto commented Apr 2, 2019

jsafrane commented Apr 2, 2019 •

edited

Loading

bertinatto commented Apr 2, 2019 •

edited

Loading

pohly commented Apr 2, 2019

jsafrane commented Apr 2, 2019

pohly commented Apr 2, 2019

jsafrane commented Apr 3, 2019

pohly commented Apr 3, 2019

fejta-bot commented Jul 2, 2019

fejta-bot commented Aug 1, 2019

msau42 commented Aug 2, 2019

fejta-bot commented Sep 1, 2019

k8s-ci-robot commented Sep 1, 2019

Attacher and provisioner may need privileged too #32

Attacher and provisioner may need privileged too #32

Comments

wongma7 commented Mar 29, 2019

wongma7 commented Mar 29, 2019

bertinatto commented Apr 1, 2019

pohly commented Apr 1, 2019

bertinatto commented Apr 1, 2019

pohly commented Apr 1, 2019

wongma7 commented Apr 1, 2019

bertinatto commented Apr 2, 2019

bertinatto commented Apr 2, 2019

bertinatto commented Apr 2, 2019

jsafrane commented Apr 2, 2019 • edited Loading

bertinatto commented Apr 2, 2019 • edited Loading

pohly commented Apr 2, 2019

jsafrane commented Apr 2, 2019

pohly commented Apr 2, 2019

jsafrane commented Apr 3, 2019

pohly commented Apr 3, 2019

fejta-bot commented Jul 2, 2019

fejta-bot commented Aug 1, 2019

msau42 commented Aug 2, 2019

fejta-bot commented Sep 1, 2019

k8s-ci-robot commented Sep 1, 2019

jsafrane commented Apr 2, 2019 •

edited

Loading

bertinatto commented Apr 2, 2019 •

edited

Loading