Support C++ exceptions for rtld-c18n using otypes. #2003

dstolfa · 2024-02-07T00:10:02Z

Opening this to mainly look for feedback, this should not be merged, especially not into CheriBSD directly. The reason the PR is here is so that the code can easily be built for testing. This approach is similar to the approach taken by @dpgao and the differences are summarized below.

This PR implements an initial version of DWARF unwinding for Morello with the c18n runtime linker. It implementation uses otypes, and therefore might not be compatible with RISC-V.

This implementation is under #ifdef _LIBUNWIND_SANDBOX_OTYPES, as we will likely want to explore different designs for this.

Since libunwind's unw_step() is a part of the public API, adding a new return code when a compartment boundary is encountered is not feasible as it would break third party consumers. Furthermore, I have tried to be careful about introducing any compile-time ABI changes outside of ones under __CHERI_PURE_CAPABILITY__ so that we don't need a secondary libgcc_s for c18n in CheriBSD. This all seems to work and doesn't seem to break any third party software in my testing, but there are probably still some edge cases to catch.

Because libunwind has to be thread-safe and its main approach to doing that is using a context, the executive stack pointer was placed into the context. However, the pointer never leaves libunwind without being sealed. This is also true for all the frame and callee-saved registers that are not sealed using otypes or sentries. Unfortunately this still means that we might leak sentries in the context, but I do not have a good way of addressing that right now.

We still call into the rtld to fetch and restore the executive stack pointer, but I don't see a way around this using this kind of design. Furthermore, the CHERI-specific defines are mostly there as a hack and should probably not live in libunwind.

dpgao · 2024-02-07T14:10:50Z

contrib/subrepo-cheri-libunwind/src/Registers.hpp

  return false;
 }

+#ifdef _LIBUNWIND_SANDBOX_OTYPES
+inline uintcap_t
+Registers_arm64::getSealedExecutiveStack(uintcap_t sealer) const {


This function does not seem to be used anywhere.

dpgao · 2024-02-07T14:13:35Z

contrib/subrepo-cheri-libunwind/src/Registers.hpp

@@ -1970,7 +2015,14 @@ inline void Registers_arm64::setRegister(int regNum, uintptr_t value) {
 #ifdef __CHERI_PURE_CAPABILITY__
  else if ((regNum >= UNW_ARM64_C0) && (regNum <= UNW_ARM64_C31))
    _registers.__x[regNum - UNW_ARM64_C0] = value;
-#endif
+#ifdef _LIBUNWIND_SANDBOX_OTYPES
+  else if (regNum == UNW_ARM64_ECSP) {


Should getRegister also support this new regNum?

dpgao · 2024-02-07T14:19:41Z

libexec/rtld-elf/aarch64/rtld_c18n_asm.S

+	ldr	c2, sealer_unwbuf
+	ldr	x10, [csp]
+	scvalue	c1, csp, x10
+	cseal	c1, c1, c2


It seems that this is the only place where sealer_unwbuf is used in RTLD. Would it be a good idea to move sealer_unwbuf to libunwind and let it perform this cseal?

We can use linkage policy to ensure that only libunwind can call _rtld_unw_getcontext.

dpgao · 2024-02-07T14:22:53Z

contrib/subrepo-cheri-libunwind/src/AddressSpace.hpp

+  capability_t        getCapability(pint_t addr) { return get<capability_t>(addr); }
+#if defined(__CHERI_PURE_CAPABILITY__) && defined(_LIBUNWIND_SANDBOX_OTYPES)
+  static uintcap_t    getUnwindSealer();
+  capability_t        getSealedCapability(pint_t addr) {


This function does not seem to be used anywhere.

dstolfa · 2024-02-08T00:08:10Z

After some discussion with @dpgao, a few issues were identified:

Current _rtld_unw_setcontext doesn't actually unwind the restricted stack pointers correctly. This needs to be done in the RTLD itself. This requires a modification in how rtld-c18n treats the next trusted frame in the return trampoline (work in progress by @dpgao). Alternatively, _rtld_unw_setcontext could be renamed to _rtld_unw_resume and implemented as a tail call in libunwind, however that raises some questions about the ABI of that interface and is probably better avoided in favor of the first solution.
In order to make the interfaces consistent, all the valid capabilities should always be sealed when leaving libunwind, and unsealed before we restore the register context and resume in the handler.
I've attempted to move the sealer into libunwind itself, however it fails to build because libunwind is not built with -fPIC and I can't access the sealer symbol from the assembly files. However, with the incoming changes, RTLD itself will expect to get a sealed executive stack pointer and will have to unseal it in order to keep the invariant of capabilities not leaving these interfaces unsealed.

dstolfa · 2024-02-21T21:24:04Z

Updated the review:

Correctly unwind the trusted frames in rtld
Make the use of otypes in libunwind itself optional under _LIBUNWIND_SANDBOX_HARDENED
Clean the code up and separate out the Morello-specific bits better.
Support the benchmark ABI.

Presently, there are some questions about what the name of the defines should really be, but this seems okay for now. There are also failing libunwind tests on Morello, but perhaps that is a separate PR as they were failing to begin with. This is still not properly tested outside of Morello and the update is mainly to get some high-level discussion going.

davidchisnall · 2024-03-20T11:00:17Z

Is there a design doc for this? A few things are not clear to me:

It looks as the unwinder runs on the stack of the faulting compartment (and then on the stacks that you unwind through). What is the expected information leakage here?
What is the threat model with respect to corrupting the unwinder's state? This is just a heap allocation, and doesn't appear to be sealed, and is reachable from the stack during unwind.
The personality functions in the libraries will provide cleanup blocks to run. These will have data-dependent control flow and can corrupt anything reachable from the stack and have their control-flow influenced by anything that an attacker can corrupt.
I don't see any libcxxrt changes. I would expect __cxa_throw and friends to be modified to seal everything except the thrown object so that the C++ runtime's state can be isolated from the code being unwound.
In which context does the destructor run?

Presumably the exception object and its type information are trusted. This causes some problems because the throwing compartment is the one that initialises the object and provides the pointer to the destructor. If the object type is provided by another library then it's possible for the throwing library to construct an arbitrary object that the destructor will run on. This is basically a COOP gadget that will run in another compartment's context with the program in a state that cannot be reached by normal control flow. The security implications of this are not obvious to me. It may be no worse than calling destructors on local objects, it may be a compartment escape.

dstolfa · 2024-03-20T12:26:30Z

Thanks for the feedback!

Is there a design doc for this? A few things are not clear to me:

Not yet, we are still going through the design and trying to figure out what the right way to approach the problem is. I took this design as a starting point, effectively opting to treat libunwind as a sort of a TCB, but it is by no means something I am convinced is "the right way" to do things at the moment. I'm waiting on @dpgao to land a change to make unwinding easier at which point the whole hash table bit should go away since we will no longer need to maintain any state there. After that, I'll write down the design document and go through the design in more scrutiny.

It looks as the unwinder runs on the stack of the faulting compartment (and then on the stacks that you unwind through). What is the expected information leakage here?

My information leakage concerns with this design have to do with the register context, notably callee-saved registers, restricted stack pointers from other compartments and even the executive stack pointer could easily leak out of the libunwind boundary and be accessible on the caller stack. The "hardening" happens by sealing anything that is a pointer and is not sealed already (and thus is unable to seal sentries with an otype), but I am fairly certain there are other security concerns here that we'll have to address.

What is the threat model with respect to corrupting the unwinder's state? This is just a heap allocation, and doesn't appear to be sealed, and is reachable from the stack during unwind.

Which one in particular? If you mean the simple hash map allocated as a part of CompartmentInfo, I believe that one should be sealed here:

#ifdef _LIBUNWIND_SANDBOX_HARDENED
    capability_t sealer = addressSpace.getUnwindSealer();
    if (sealer != addressSpace.to_capability_t(-1))
      stackTable = __builtin_cheri_seal(stackTable, sealer);
#endif

However when #2061 lands, this entire hash table should no longer be necessary as we won't need to read and write to the bottom of the restricted stack anymore. Perhaps you are talking about a different heap allocation and I'm misunderstanding which one?

The personality functions in the libraries will provide cleanup blocks to run. These will have data-dependent control flow and can corrupt anything reachable from the stack and have their control-flow influenced by anything that an attacker can corrupt.

This is very likely to be a problem and needs to be thought about further than I have at this stage. The attacker should not be able to corrupt any of the sealed registers themselves, but I'm not quite sure how to protect the context itself seeing as it lives on the caller stack.

I don't see any libcxxrt changes. I would expect __cxa_throw and friends to be modified to seal everything except the thrown object so that the C++ runtime's state can be isolated from the code being unwound.

Agreed. For some background, we thought that libunwind support was necessary to get the desktop stack running properly for a demo that is due next week (and we still aren't sure that it isn't) so the goal was simply to get it working as fast as possible in an experimental state. I took the view of isolating the changes to libunwind as much as possible, but this design is probably a far cry from the thing we want to eventually have.

In which context does the destructor run?

I'm not sure which destructor you mean here, but if it's the exception destructor I think (@dpgao can confirm this) that it runs in the same compartment that libcxxrt is. I believe that Dapeng was looking at handling function pointers in the near-ish future, at which point it should jump through the c18n rtld and be called in a different one (probably with supporting code on the libcxxrt end...?).

Presumably the exception object and its type information are trusted. This causes some problems because the throwing compartment is the one that initialises the object and provides the pointer to the destructor. If the object type is provided by another library then it's possible for the throwing library to construct an arbitrary object that the destructor will run on. This is basically a COOP gadget that will run in another compartment's context with the program in a state that cannot be reached by normal control flow. The security implications of this are not obvious to me. It may be no worse than calling destructors on local objects, it may be a compartment escape.

Agreed, and I would love to have a more detailed discussion about pretty much all of the above if you are available some time (ideally with at least @dpgao also being present as he knows the rtld bits in detail). Even though this "works" it's just a starting point and likely requires more thought to claim any kind of security. If this does land for the demo, it would be have to be documented as experimental with a disclaimer on security. FWIW, one design point I'm interested in is handling exceptions by having each compartment unwind itself up to the boundary and then re-raising the exception in the next compartment. It's unclear to me at this point how it would integrate with libunwind's public APIs, which is part of the reason why I went for this sort of design as a starting point.

dstolfa · 2024-03-21T01:25:51Z

Update the code to remove the hash table which is no longer necessary.

This commit pulls out the functionality necessary to implement stack unwinding in rtld into macros and implements longjmp in terms of them. Using these macros, this commit implements the functionality necessary to support exception handling for libunwind. Additionally, it adds a new otype which is reserved for the unwinding library to use.

Instead, use the get_trusted_frame macro to obtain the trusted frame in C.

This commit also fixes the missing unw_getcontext_unsealed in trusted symbols and moves libunwind symbols closer to the setjmp/longjmp ones.

dstolfa requested review from jrtc27, dpgao and bsdjhb February 7, 2024 00:10

dstolfa force-pushed the cppexcept_otypes branch 2 times, most recently from f2f29dd to 5f265bf Compare February 7, 2024 02:26

dpgao reviewed Feb 7, 2024

View reviewed changes

dstolfa changed the base branch from main to dev February 21, 2024 18:33

dstolfa force-pushed the cppexcept_otypes branch from 5f265bf to 52d9d7f Compare February 21, 2024 20:28

dstolfa force-pushed the cppexcept_otypes branch from 52d9d7f to 2005c18 Compare March 4, 2024 23:26

This was referenced Mar 4, 2024

c18n, libgcc_s: Support a c18n-aware libunwind. #2032

Merged

[libunwind] Support rtld-c18n as the runtime linker. CTSRD-CHERI/llvm-project#731

Open

dstolfa force-pushed the cppexcept_otypes branch from 2005c18 to 353318d Compare March 18, 2024 15:09

dstolfa force-pushed the cppexcept_otypes branch from 353318d to cbd89dd Compare March 21, 2024 01:25

dstolfa force-pushed the cppexcept_otypes branch 2 times, most recently from 43a348e to 5e60df9 Compare March 26, 2024 22:49

dstolfa added 2 commits March 27, 2024 16:11

libgcc_s: Add new rtld-c18n symbols and flags to libunwind.

15ec6ea

dstolfa force-pushed the cppexcept_otypes branch from 5e60df9 to 0124316 Compare March 27, 2024 19:01

dpgao and others added 3 commits April 2, 2024 13:38

c18n: Add get_trusted_frame macro

ae89653

c18n: Remove assembly wrappers for _rtld_{setjmp,longjmp,unw_*}

2542735

Instead, use the get_trusted_frame macro to obtain the trusted frame in C.

c18n: Allow _rtld_unw_resume to resume purecap binaries.

b9424e6

dstolfa force-pushed the cppexcept_otypes branch from 0124316 to 61813fe Compare April 2, 2024 16:07

dstolfa added 2 commits April 3, 2024 11:56

c18n: Add a libunwind policy.

17e70d2

This commit also fixes the missing unw_getcontext_unsealed in trusted symbols and moves libunwind symbols closer to the setjmp/longjmp ones.

libunwind: Import from LLVM repository.

cba61f2

dstolfa force-pushed the cppexcept_otypes branch from 61813fe to cba61f2 Compare April 3, 2024 11:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support C++ exceptions for rtld-c18n using otypes. #2003

Support C++ exceptions for rtld-c18n using otypes. #2003

dstolfa commented Feb 7, 2024

dpgao Feb 7, 2024

dpgao Feb 7, 2024

dpgao Feb 7, 2024

dpgao Feb 7, 2024

dstolfa commented Feb 8, 2024

dstolfa commented Feb 21, 2024

davidchisnall commented Mar 20, 2024

dstolfa commented Mar 20, 2024

dstolfa commented Mar 21, 2024

Support C++ exceptions for rtld-c18n using otypes. #2003

Are you sure you want to change the base?

Support C++ exceptions for rtld-c18n using otypes. #2003

Conversation

dstolfa commented Feb 7, 2024

dpgao Feb 7, 2024

Choose a reason for hiding this comment

dpgao Feb 7, 2024

Choose a reason for hiding this comment

dpgao Feb 7, 2024

Choose a reason for hiding this comment

dpgao Feb 7, 2024

Choose a reason for hiding this comment

dstolfa commented Feb 8, 2024

dstolfa commented Feb 21, 2024

davidchisnall commented Mar 20, 2024

dstolfa commented Mar 20, 2024

dstolfa commented Mar 21, 2024