NVIDIA/Xorg segfaults with hardened profile #61489

ivan · 2019-05-14T11:48:29Z

Issue description

With the hardened profile imported, Xorg on NVIDIA always segfaults on startup with:

[    43.702] (EE) Backtrace:
[    43.703] (EE) 0: /nix/store/aq6widncxzcr8kndsrr42a9c61r39qll-xorg-server-1.20.4/bin/X (OsSigHandler+0x29) [0x59af79]
[    43.703] (EE) 1: /nix/store/xqs95fqkjb1kd102yjv5h5q57gcsafb3-glibc-2.27/lib/libpthread.so.0 (funlockfile+0x50) [0x7bdefc8598af]
[    43.704] (EE) 2: /nix/store/xqs95fqkjb1kd102yjv5h5q57gcsafb3-glibc-2.27/lib/libc.so.6 (__memmove_avx_unaligned_erms+0x211) [0x7bdefc7e3bf1]
[    43.705] (EE) 3: /nix/store/db8dmfz95lk5pfamvfl9sjg71wnilj7f-malloc-provider-graphene-hardened/lib/libhardened_malloc.so (realloc+0x35d) [0x7bdefced970d]
[    43.705] (EE) 4: /nix/store/aq6widncxzcr8kndsrr42a9c61r39qll-xorg-server-1.20.4/bin/X (XNFreallocarray+0x9) [0x59dc29]
[    43.705] (EE) 5: /nix/store/aq6widncxzcr8kndsrr42a9c61r39qll-xorg-server-1.20.4/bin/X (xf86AllocateEntityPrivateIndex+0x58) [0x475a38]
[    43.705] (EE) 6: /nix/store/xdj3r6d4ci73d65adh57hk6b03qg81lr-libglvnd-1.0.0/lib/libGLdispatch.so.0 (nvidiaAddDrawableHandler+0x4f72da) [0x721f8650e084]
[    43.705] (EE) 
[    43.705] (EE) Segmentation fault at address 0x723e5a4544c0

Steps to reproduce

Use nixpkgs master on a machine with an NVIDIA card.

In the system configuration:

Use

services.xserver = { videoDrivers = [ "nvidia" ]; };
hardware.opengl = { enable = true; driSupport = true; };
services.xserver = { enable = true; displayManager.startx.enable = true; };

Import <nixpkgs/nixos/modules/profiles/hardened.nix>
Keep the default environment.memoryAllocator.provider = "graphene-hardened"

startx and observe a segfault.

Technical details

Please run nix-shell -p nix-info --run "nix-info -m" and paste the
results.

system: "x86_64-linux"
host os: Linux 4.19.42-hardened, NixOS, 19.09.git.11851bd (Loris)
multi-user?: yes
sandbox: yes
version: nix-env (Nix) 2.2.2
channels(root): "nixos-18.09.1922.97e0d53d669"
nixpkgs: /nix/var/nix/profiles/per-user/root/channels/nixos

Maybe environment.memoryAllocator.provider = "libc"; should remain the default in hardened, or maybe graphene-hardened should be marked incompatible with nvidia?

The text was updated successfully, but these errors were encountered:

joachifm · 2019-05-14T13:22:40Z

In an ideal world, crashes caused by legitimate bugs (assuming this is one) would be fixed upstream, but I suppose that's out of the question in this case.

I prefer to keep the default as-is for now and improve docs instead, but I can see how it will be quite annoying for people when things start crashing for no obvious reason ... I'm open to changing the default, obviously, but I'd like to hold off on doing so just a little bit. (To elaborate: the hardened defaults are expected to break things, under the assumption that the admin relaxes settings to suit their specific needs).

delroth · 2019-05-30T13:08:46Z

I would suggest we repurpose this issue for general brokenness with the Graphene hardened allocator picked by default in the hardened profile.

According to #62238 (comment) it seems like the Graphene hardened allocator can't even run sshd properly. Nix also segfaults / runs out of memory when using that allocator. I think it's probably an OK choice for select applications but definitely not for systemwide use at this point.

delroth · 2019-05-30T13:15:43Z

(I'm also fine if you want to roll back #62238 temporarily to damage-control before making a decision on this.)

delroth · 2019-05-30T13:22:42Z

I will slightly amend my statement "can't even run sshd properly" -- the issue seems to be that openssh's seccomp sandbox does not allow mprotect, and Graphene's allocator makes copious use of that syscall (for guard pages). This could be addressed through disabling the seccomp sandbox or applying a custom patch.

joachifm · 2019-05-30T14:05:54Z

Another possibility is to add service specific overrides to the malloc module to paper over known incompatibilities.

delroth · 2019-05-30T14:08:31Z

Having to whitelist sshd from these hardening measures would be a pretty sad long term situation though.

Izorkin · 2019-05-30T19:23:06Z

Service opessh worked with memoryAllocator provider jemalloc and scudo.
With memoryAllocator provider scudo not worked services - nginx unit and dovecot2

joachifm · 2019-05-30T20:21:56Z

@Izorkin do you think those are fixable in a maintainable way? I'm not too concerned about Nix or proprietary gfx drivers not working, but if things like ssh, httpd &c break, then enabling this by default may be too much.

Izorkin · 2019-05-30T20:37:15Z

I do not have normal working ideas(

yotann · 2019-06-12T17:01:53Z

FWIW, I got the same crash with the open-source radeon driver. I solved it by changing environment.memoryAllocator.provider. The problem seems to be in xf86AllocateEntityPrivateIndex:

(EE) Backtrace:
(EE) 0: /nix/store/.../bin/X (OsSigHandler+0x29) [0x59aef9]
(EE) 1: /nix/store/.../lib/libpthread.so.0 (funlockfile+0x50) [0x7f4f176368af]
(EE) 2: /nix/store/.../lib/libc.so.6 (__memmove_avx_unaligned_erms+0x211) [0x7f4f175bebf1]
(EE) 3: /nix/store/.../lib/libhardened_malloc.so (realloc+0x35d) [0x7f4f17cb470d]
(EE) 4: /nix/store/.../bin/X (XNFreallocarray+0x9) [0x59dba9]
(EE) 5: /nix/store/.../bin/X (xf86AllocateEntityPrivateIndex+0x58) [0x4759a8]
(EE) 6: /nix/store/.../lib/xorg/modules/drivers/radeon_drv.so (radeon_platform_probe+0x185) [0x75f31c6fbe85]
(EE) 7: /nix/store/.../bin/X (doPlatformProbe+0x194) [0x49b3b4]
(EE) 8: /nix/store/.../bin/X (probeSingleDevice+0x9a) [0x49b4aa]
(EE) 9: /nix/store/.../bin/X (xf86platformProbeDev+0x18d) [0x49be8d]
(EE) 10: /nix/store/.../bin/X (xf86CallDriverProbe+0xa5) [0x474be5]
(EE) 11: /nix/store/.../bin/X (xf86BusConfig+0x45) [0x475505]
(EE) 12: /nix/store/.../bin/X (InitOutput+0x944) [0x482e04]
(EE) 13: /nix/store/.../bin/X (dix_main+0x18f) [0x4461ff]
(EE) 14: /nix/store/.../lib/libc.so.6 (__libc_start_main+0xee) [0x7f4f1748eb8e]
(EE) 15: /nix/store/.../bin/X (_start+0x2a) [0x43010a]
(EE) 
(EE) Segmentation fault at address 0x75ffd0c12890

This reverts commit 48ff4f1. Causes too much breakage to be enabled by default [1][2]. [1]: #61489 [2]: #65000

joachifm · 2019-08-21T21:31:20Z

For future reference regarding incompatibilities with hardened_malloc https://github.com/GrapheneOS/hardened_malloc/issues/89#issuecomment-522236117

ivan · 2019-09-01T10:14:14Z

#66687 was merged, so this should no longer happen.

joachifm · 2019-09-18T18:41:31Z

@delroth ssh should work with graphene after openssh/openssh-portable@f6906f9

joachifm self-assigned this May 14, 2019

joachifm mentioned this issue May 29, 2019

nixos: configurable system-wide malloc #60187

Merged

joachifm added this to the 19.09 milestone May 30, 2019

eadwu mentioned this issue Jul 2, 2019

Nix doesn't like the hardened profile #64078

Closed

joachifm mentioned this issue Aug 15, 2019

Revert "nixos/hardened: use graphene-hardened malloc by default" #66687

Merged

mmahut pushed a commit that referenced this issue Aug 19, 2019

Revert "nixos/hardened: use graphene-hardened malloc by default"

4ead3d2

This reverts commit 48ff4f1. Causes too much breakage to be enabled by default [1][2]. [1]: #61489 [2]: #65000

ivan closed this as completed Sep 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA/Xorg segfaults with hardened profile #61489

NVIDIA/Xorg segfaults with hardened profile #61489

ivan commented May 14, 2019 •

edited

joachifm commented May 14, 2019

delroth commented May 30, 2019

delroth commented May 30, 2019

delroth commented May 30, 2019

joachifm commented May 30, 2019

delroth commented May 30, 2019

Izorkin commented May 30, 2019

joachifm commented May 30, 2019

Izorkin commented May 30, 2019

yotann commented Jun 12, 2019

joachifm commented Aug 21, 2019

ivan commented Sep 1, 2019

joachifm commented Sep 18, 2019

NVIDIA/Xorg segfaults with hardened profile #61489

NVIDIA/Xorg segfaults with hardened profile #61489

Comments

ivan commented May 14, 2019 • edited

Issue description

Steps to reproduce

Technical details

joachifm commented May 14, 2019

delroth commented May 30, 2019

delroth commented May 30, 2019

delroth commented May 30, 2019

joachifm commented May 30, 2019

delroth commented May 30, 2019

Izorkin commented May 30, 2019

joachifm commented May 30, 2019

Izorkin commented May 30, 2019

yotann commented Jun 12, 2019

joachifm commented Aug 21, 2019

ivan commented Sep 1, 2019

joachifm commented Sep 18, 2019

ivan commented May 14, 2019 •

edited