udev: add default group for sgx enclave access #18944

keszybz · 2021-03-09T18:15:28Z

This creates a "well known" for sgx_enclave ownership. By doing this here we
avoid the risk that various projects making use of the device will provide
similar-but-slightly-incompatible installation instructions, in particular
using different group names.

ACLs are actually a better approach to grant access to users, but not in all
cases, so we want to provide a standard group anyway.

Mode is 0o660, not 0o666 because this is very new code and distributions are
likely to not want to give full access to all users. This might change in the
future, but being conservative is a good default in the beginning.

Rules for /dev/sgx_provision will be provided by libsg-ae-pce:
intel/linux-sgx#678.

bluca

lgtm

poettering · 2021-03-10T14:13:32Z

hmm, what's the point of the SYMLINK+= part? who would use that? Can you explain?

jethrogb · 2021-03-10T14:22:21Z

Symlink looks like backcompat with previous out of tree kernel drivers. Now that we have a stable driver to build from I think it's better to force a “reset” of the ecosystem.

keszybz · 2021-03-10T14:31:18Z

@haitaohuang, ptal.

woju · 2021-03-10T15:02:57Z

Yeah, pretty much looks like it. For reference here's a table of (header, device path) and a comment for a particular we use for driver autodetection: https://github.com/oscarlab/graphene/blob/cea083122941c52b8e9b21dfd327364ffbc3f0cb/Pal/src/host/Linux-SGX/link-intel-driver.py#L8-L19. This particular symlink is aimed at DCAP driver, which historically closely followed the repeated patchsets (they even have a nice table: https://github.com/intel/SGXDataCenterAttestationPrimitives/tree/master/driver/linux#compatibility-with-intelr-sgx-psw-releases).

With graphene hat: either way works for us. Thank you!

jethrogb · 2021-03-10T15:14:06Z

@woju I think you missed one https://github.com/fortanix/rust-sgx/blob/master/sgxs-loaders/src/isgx/mod.rs#L443-L446

Closes systemd#18669. This creates a "well known" for sgx_enclave ownership. By doing this here we avoid the risk that various projects making use of the device will provide similar-but-slightly-incompatible installation instructions, in particular using different group names. ACLs are actually a better approach to grant access to users, but not in all cases, so we want to provide a standard group anyway. Mode is 0o660, not 0o666 because this is very new code and distributions are likely to not want to give full access to all users. This might change in the future, but being conservative is a good default in the beginning. Rules for /dev/sgx_provision will be provided by libsg-ae-pce: intel/linux-sgx#678.

keszybz · 2021-03-10T16:17:01Z

Symlink looks like backcompat with previous out of tree kernel drivers. Now that we have a stable driver to build from I think it's better to force a “reset” of the ecosystem.

I dropped the symlink again.

poettering · 2021-03-10T17:01:29Z

lgtm

mbiebl · 2021-03-11T19:13:20Z

Hm, why not use uaccess here? The less special system groups we have, the better.

keszybz · 2021-03-11T19:28:28Z

uaccess is about logged in users. The set of such users might overlap with the set of users who shall have access to this, but it doesn't have to.

mbiebl · 2021-03-11T19:39:54Z

Sure, but do we have a need/use-case for such a non-logged in user?
Wouldn't it be better, if we added the uaccess tag first, so stuff works OOTB without requiring explicit mangling of group memberships. If the need arises for the non-logged in user case, we can still add the explicit sgx group later.

My fear is, that sgx is very much a specialized use case requiring a certain hardware and software combination. So I don't see the need (yet) to add such a group to everyones system. Who knows if sgx is still a thing a couple of years down the line.
Getting rid of static system groups once introduced is hard.

keszybz · 2021-03-11T19:56:50Z

There isn't any particular reason to tie this to uaccess. If the local policy is to use uaccess, this is very simple to configure. But such policy clearly isn't universal or recommended by existing packages.

My main goal with merging this here is to provide a standardized experience on all systems. Introducing a group like this is really cheap, the whole patch is two lines. Intel cpus are very popular, so even if only some fraction of users can use this, this fraction can still be a lot of people. If we at some point decide that it's not useful any more, we can drop it again. Existing systems will keep the group, no harm done. This is certainly better then a proliferation of scripts and instructions to add the group in every project that wants to make use of it.

mbiebl · 2021-03-11T19:59:04Z

Well, no. Removing existing system groups isn't cheap. Once introduced it's usually hard to get rid of them, as you might have files/directories using that group.
It's still not sufficiently justified, why uaccess is not suitable here.

mbiebl · 2021-03-11T20:03:00Z

Well, no. Removing existing system groups isn't cheap.

Which is why we should be careful introducing new system groups...

bluca · 2021-03-11T20:52:15Z

I guess if someone wants to use uaccess, then anyone who does will be compatible with each other out of the box, as you don't need a specific keyword. With group access, you do need to use the exact same name, and that's where standardisation at the top level helps.

Also, I definitely see use cases for this feature in service-level software, which won't run from a logged in user, and thus won't benefit from uaccess, right?

mbiebl · 2021-03-11T20:59:38Z

Also, I definitely see use cases for this feature in service-level software

Such as?

mbiebl · 2021-03-11T21:00:45Z

Honestly, I'm quite puzzled. For years we have been advocating that static group member ship is probably not such a great idea.
Now we make a 180 u-turn?

poettering · 2021-03-11T21:01:23Z

uaccess doesn't sounds like an appropriate thing I'd say. That#s really more about desktopish stuff, and SGX is entirely generic stuff. I mean, I'd expect sshd do use it and http servers and such that need crypto, and there's really no relevance to the desktop there.

As you might have seen with the AMD SEV stuff I generally try to push away stuff that is too special purpose, and where there's another package that is pretty much "the" maintainer of that subsystem. But I was told that SGX doesn't really fit into that much, it's pretty generic stuff that is useful outside of the virtualization space, and apparently is supposed to be pretty commonly available in intel CPUs.

I mean, we have to draw the line somewhere: we have the group concept and it makes sense for system-level stuff like this. Hence we should use it. Because otherwise there's no point in the groups at all, if we never use them.

The other options are only to say "meh, not our problem, let people fight for device node ownership", or to open up the device node. I doubt this is really in our interest.

We try to gently push distros to come to a common groups database, and we can do this via the sysusers stuff pretty OK. Of course it's open source, people can patch it out downstream again.

mbiebl · 2021-03-11T21:32:08Z

backup services that want tape access. or bitcoin miners that want render access. or printer servers that want lp access. They all may run (at least partially) unprivileged, with access to select hw and nothing else.

huh? you missed the point apparently

mbiebl · 2021-03-11T21:36:42Z

So, the responses so far convinced me even more to revert this change. But I'm open to actual use cases.

poettering · 2021-03-11T21:40:42Z

Unless there is a proper justification for this group, it will be patched out in Debian/Ubuntu (which would defeat the point of making it "standard")

I am not sure what a "proper" justification is supposed to be, beyond what we already gave you: we add groups for device nodes for reasonably common/generic device types, where system components have a good reason to get access to (while others shall not) and which are accessed by multiple projects directly as opposed to having a single userspace subsystem owner. That's all there is to it.

of course, these criteria are not binary decision points — "common" and "generic" are subjective and a continous scale. we came to the conclusion that these criteria apply so we merged it. If you disagree with that, it's your freedom.

So for the next time (iirc we had the same discussion about the "render" nodes stuff), let's semi formalize things:

must be a driver class for reasonably generic and common hw
must have more than one consumer in userspace, as opposed to have a single "primary" subsystem package (which should own the rules/group in that case)
must be something select system services shall get access to. i.e. if only users are desktop "human" users, or when world accessible is OK too, this doesn't apply.

poettering · 2021-03-11T21:43:15Z

So, the responses so far convinced me even more to revert this change. But I'm open to actual use cases.

I mean, knock yourself out, but we listed plenty usecases, don't claim otherwise. You just decided to ignore or discount them as irrelevant.

poettering · 2021-03-11T21:44:34Z

huh? you missed the point apparently

then explain it to me!

bluca · 2021-03-11T21:47:34Z

So, the responses so far convinced me even more to revert this change. But I'm open to actual use cases.

I'm not sure what you mean by "actual use case" - if you mean software that today can use enclaves, then the various ssl libraries have been mentioned. For example, you can run apache or nginx with wolfssl with sgx support. You know perfectly well you don't want to run web servers as root. That's a use case as concrete as it could possibly be, available today.
Then again, the feature is brand new. More will come.

mbiebl · 2021-03-11T21:58:12Z

I mean, knock yourself out, but we listed plenty usecases, don't claim otherwise.

Not really, no. But thanks for being so constructive.

You just decided to ignore or discount them as irrelevant.

Hm, that feels familiar.

mbiebl · 2021-03-11T22:05:28Z

You know perfectly well you don't want to run web servers as root

The web servers I know start off as root and only drop privileges after they've setup everything (like binding to privileged ports)

bluca · 2021-03-11T22:07:29Z

You know perfectly well you don't want to run web servers as root

The web servers I know start off as root and only drop privileges after they've setup everything (like binding to privileged ports)

I'm not an expert but I don't think that works here, as access is required at any time

mbiebl · 2021-03-11T22:07:38Z

huh? you missed the point apparently

then explain it to me!

you are talking about tape recorders. Not sure what that is supposed to mean when I was asking for real use cases.

mbiebl · 2021-03-11T22:09:02Z

I'm not an expert but I don't think that works here, as access is required at any time

It's an fd that can be passed to the children, no?

poettering · 2021-03-11T22:09:07Z

@mbiebl i listed two more cases, and iirc backups in big data centers is still all tapes, they even build new tape drivers pretty regularly.

poettering · 2021-03-11T22:10:01Z

The web servers I know start off as root and only drop privileges after they've setup everything (like binding to privileged ports)

Yeah, on sysvinit this is how things worked, but we try to move people away from schemes like that and just drop privs for them.

mbiebl · 2021-03-11T22:11:05Z

Yeah, on sysvinit this is how things worked, but we try to move people away from schemes like that and just drop privs for them.

that's still true today, even under systemd, afaics.

mbiebl · 2021-03-11T22:14:33Z

@mbiebl i listed two more cases, and iirc backups in big data centers is still all tapes, they even build new tape drivers pretty regularly.

how tape recoders are related to sgx still eludes me.

jethrogb · 2021-03-11T22:39:38Z

Any network service is a candidate for use of SGX, which they may use to protect sensitive information such as TLS keys or application data. However, not every network service requires root.

bluca · 2021-03-11T22:44:45Z

I'm not an expert but I don't think that works here, as access is required at any time

It's an fd that can be passed to the children, no?

AFAIK none of the libraries/implementations work like that, and the nodes are accessed directly. Another real, existing use case is Graphene, and this is how it instructs users to set up:

https://graphene.readthedocs.io/en/latest/sgx-intro.html#linux-kernel-drivers

iow, exactly what this PR does, as suggested in the ticket by #18669 (comment)

woju · 2021-03-11T23:22:34Z

@mbiebl: We (Graphene) and Fortanix (@jethrogb's project) both use /dev/sgx_enclave interface (and there are other projects who don't speak up here). Both of the frameworks are (should be) perfectly fine to install alongside each other. What should each of us suggest to our users? 0666 is generally a bad advice, because some people prefer to restrict access to SGX for various reasons (auditability is one concern, other people don't trust anything remotely related to Intel ME, which SGX depends upon, ...). So we'd need 0660 with a common group, which is how such problems are traditionally solved, and those examples with render, dialup and even tape served to prove the point.

We aim for as easy installation as possible (optimally apt install && gpasswd -a), because many users are not proficient linux admins (Graphene is widely used in ML scenarios by people who are data scientists by trade). The lack of this common group would make packaging our projects for Debian/Ubuntu problematic.

poettering · 2021-03-13T12:53:09Z

As it appears Signal (the IM program) can do its crypto in SGX. Not sure if on linux yet though. See https://signal.org/blog/secure-value-recovery/

mbiebl · 2021-03-13T14:02:43Z

As it appears Signal (the IM program) can do its crypto in SGX. Not sure if on linux yet though. See https://signal.org/blog/secure-value-recovery/

Perfect example that uaccess would be a better choice here.

bluca · 2021-03-13T15:33:11Z

As it appears Signal (the IM program) can do its crypto in SGX. Not sure if on linux yet though. See https://signal.org/blog/secure-value-recovery/

Perfect example that uaccess would be a better choice here.

Uhm, how so? It is used server-side, not by the user app - ie, a practical example of the "securely handle customer data" that I mentioned the other day. From the article:

However, we can invert the traditional SGX relationship to run a secure enclave on the server. An SGX enclave on the server would enable a service to perform computations on encrypted client data without learning the content of the data or the result of the computation.

mbiebl · 2021-03-13T16:15:45Z

As it appears Signal (the IM program) can do its crypto in SGX. Not sure if on linux yet though. See https://signal.org/blog/secure-value-recovery/

Perfect example that uaccess would be a better choice here.

Uhm, how so? It is used server-side, not by the user app -

So "Signal (the IM program)" is not referring to the client?

bluca · 2021-03-13T17:56:01Z

As it appears Signal (the IM program) can do its crypto in SGX. Not sure if on linux yet though. See https://signal.org/blog/secure-value-recovery/

Perfect example that uaccess would be a better choice here.

Uhm, how so? It is used server-side, not by the user app -

So "Signal (the IM program)" is not referring to the client?

No, it refers to the server, it's explained in the blog post, it's quite an interesting read

haitaohuang · 2021-03-18T19:03:32Z

Thanks @keszybz for submitting the patch.
Sorry I'm late for responding (I was off grid for a few days).

I would still prefer 0666 as default:

it is the easiest for user to install enclave apps without worrying about joining any special groups.
Access to sgx_enclave node only gives user permission to run code inside enclave. ISA guarantees only subset of instructions available inside enclave, e.g. no syscall, cpuid, io, etc. In other words, enclaves can only do a subset of things that the user can already do without it.
Enclaves does consume special EPC memory which is limited . So we do need mitigate DOS attack on EPC. Maybe a specific group would be helpful in that regard but probably not adequate . In future kernel, EPC usage will be monitored and controlled by cgroups. Once cgroups are supported, the extra group is redundant for this purpose?

BTW, SGX itself does not depend on ME as some may deduce from some implementations for client use cases that enclaves need use service from ME, such as trusted counters.

bluca added good-to-merge/waiting-for-ci 👍 PR is good to merge, but CI hasn't passed at time of review. Please merge if you see CI has passed udev labels Mar 9, 2021

bluca approved these changes Mar 9, 2021

View reviewed changes

poettering mentioned this pull request Mar 10, 2021

Add default udev rules and group for sgx_* dev nodes for new SGX support in kernel 5.11 #18669

Closed

poettering added reviewed/needs-rework 🔨 PR has been reviewed and needs another round of reworks and removed good-to-merge/waiting-for-ci 👍 PR is good to merge, but CI hasn't passed at time of review. Please merge if you see CI has passed labels Mar 10, 2021

keszybz force-pushed the sgx-enclave branch from f324768 to 7f82f97 Compare March 10, 2021 16:16

keszybz added good-to-merge/waiting-for-ci 👍 PR is good to merge, but CI hasn't passed at time of review. Please merge if you see CI has passed and removed reviewed/needs-rework 🔨 PR has been reviewed and needs another round of reworks labels Mar 10, 2021

keszybz removed the good-to-merge/waiting-for-ci 👍 PR is good to merge, but CI hasn't passed at time of review. Please merge if you see CI has passed label Mar 10, 2021

keszybz merged commit c9c4899 into systemd:main Mar 10, 2021

keszybz deleted the sgx-enclave branch March 11, 2021 19:48

guillemj mentioned this pull request May 14, 2021

New sgx group seems overly generic and prone to collision #19610

Closed

woju mentioned this pull request Sep 9, 2021

Packaging gramineproject/gramine#9

Closed

24 tasks

This was referenced Jan 10, 2022

sgx-sdk, sgx-psw: improve samples NixOS/nixpkgs#153237

Merged

Rely on /dev/sgx_{enclave,provision} instead of /dev/sgx/{enclave,provision} intel/linux-sgx#772

Closed

rvolosatovs mentioned this pull request Aug 9, 2022

AMD SEV device node improvements #15593

Closed

woju mentioned this pull request Apr 4, 2024

[Docs] Add more documentation about SGX gramineproject/gramine#1827

Merged

udev: add default group for sgx enclave access #18944

udev: add default group for sgx enclave access #18944

Conversation

keszybz commented Mar 9, 2021

bluca left a comment

Choose a reason for hiding this comment

poettering commented Mar 10, 2021

jethrogb commented Mar 10, 2021

keszybz commented Mar 10, 2021

woju commented Mar 10, 2021

jethrogb commented Mar 10, 2021

keszybz commented Mar 10, 2021

poettering commented Mar 10, 2021

mbiebl commented Mar 11, 2021 • edited

keszybz commented Mar 11, 2021

mbiebl commented Mar 11, 2021 • edited

keszybz commented Mar 11, 2021

mbiebl commented Mar 11, 2021

mbiebl commented Mar 11, 2021

bluca commented Mar 11, 2021

mbiebl commented Mar 11, 2021

mbiebl commented Mar 11, 2021

poettering commented Mar 11, 2021

mbiebl commented Mar 11, 2021 • edited

mbiebl commented Mar 11, 2021

poettering commented Mar 11, 2021

poettering commented Mar 11, 2021

poettering commented Mar 11, 2021

bluca commented Mar 11, 2021

mbiebl commented Mar 11, 2021

mbiebl commented Mar 11, 2021

bluca commented Mar 11, 2021

mbiebl commented Mar 11, 2021

mbiebl commented Mar 11, 2021

poettering commented Mar 11, 2021

poettering commented Mar 11, 2021

mbiebl commented Mar 11, 2021

mbiebl commented Mar 11, 2021 • edited

jethrogb commented Mar 11, 2021

bluca commented Mar 11, 2021

woju commented Mar 11, 2021

poettering commented Mar 13, 2021

mbiebl commented Mar 13, 2021

bluca commented Mar 13, 2021

mbiebl commented Mar 13, 2021

bluca commented Mar 13, 2021

haitaohuang commented Mar 18, 2021

mbiebl commented Mar 11, 2021 •

edited

mbiebl commented Mar 11, 2021 •

edited

mbiebl commented Mar 11, 2021 •

edited

mbiebl commented Mar 11, 2021 •

edited