consult dlerror() only if a dl*() call fails #74469

jclulow · 2020-07-18T01:40:31Z

The string returned from dlerror() is purely diagnostic and should not
itself be used to determine whether a previous call to dlopen() or
dlsym() has failed. Those functions are documented with specific return
values that signal failure; i.e., returning NULL.

If we assume a non-NULL return from dlerror() means the prior dlsym()
call failed, we are vulnerable to a race with another thread outside of
Rust control concurrently inducing dynamic linking operations. This
manifests on illumos systems with an intermittent spurious failure from
rustc:

error: ld.so.1: rustc: fatal: _ex_unwind: can't find symbol

The illumos libc checks for the existence of an "_ex_unwind" symbol via
dlsym() under some conditions when a thread exits, as part of an old
contract with a particular C++ standard library. If another thread
exits at the same time that rustc is attempting to load a plugin, we can
hit this race and report an error that does not belong to us.

The string returned from dlerror() is purely diagnostic and should not itself be used to determine whether a previous call to dlopen() or dlsym() has failed. Those functions are documented with specific return values that signal failure; i.e., returning NULL. If we assume a non-NULL return from dlerror() means the prior dlsym() call failed, we are vulnerable to a race with another thread outside of Rust control concurrently inducing dynamic linking operations. This manifests on illumos systems with an intermittent spurious failure from rustc: error: ld.so.1: rustc: fatal: _ex_unwind: can't find symbol The illumos libc checks for the existence of an "_ex_unwind" symbol via dlsym() under some conditions when a thread exits, as part of an old contract with a particular C++ standard library. If another thread exits at the same time that rustc is attempting to load a plugin, we can hit this race and report an error that does not belong to us.

rust-highfive · 2020-07-18T01:40:34Z

r? @ecstatic-morse

(rust_highfive has picked a reviewer for you, use r? to override)

tesuji · 2020-07-18T05:27:18Z

src/librustc_metadata/dynamic_lib.rs

+                    let s = CStr::from_ptr(last_error).to_bytes();
+                    Err(str::from_utf8(s).unwrap().to_owned())


Suggested change

let s = CStr::from_ptr(last_error).to_bytes();

Err(str::from_utf8(s).unwrap().to_owned())

let s = CStr::from_ptr(last_error).to_str().unwrap();

Err(s.to_owned())

ollie27 · 2020-07-18T15:16:01Z

src/librustc_metadata/dynamic_lib.rs

+            // dlerror reports the most recent failure that occured during a
+            // dynamic linking operation and then clears that error; we call
+            // once in advance of our operation in an attempt to discard any
+            // stale prior error report that may exist:
            let _old_error = libc::dlerror();


Is this call still needed? Surely any prior error will be replaced if there's a new error.

As @ollie27 said, there's no need to do this anymore if we don't use the return value of dlerror to determine whether an error occurred.

ecstatic-morse · 2020-07-19T19:05:15Z

src/librustc_metadata/dynamic_lib.rs

+            if ptr::null() != result {
                Ok(result)
            } else {


Since the else block now has a condition inside, could you switch to an early return for the happy path?

ecstatic-morse · 2020-07-19T19:05:20Z

src/librustc_metadata/dynamic_lib.rs

+            // We should only check dlerror() in the event that the operation
+            // fails, which we determine by checking for a NULL return.  This
+            // covers at least dlopen() and dlsym().


Can you document these semantics at the function level? Specifically, if f returns a null pointer, this function returns Err with the string in dlerror.

Also, just to be sure, do all the functions we pass to this helper return NULL and only NULL to indicate an error? There's no (void *) 1 weirdness or something?

For dlsym at least, the current approach is explicitly recommended on linux and seems to be necessary on illumos as well, since NULL can indicate either a "symbol not found" error or a found symbol with the value NULL. We should be checking the return value of dlopen, but we will need to find a different workaround here.

ecstatic-morse · 2020-07-19T19:07:57Z

~~r=me with nits addressed. I don't think you need to change the existing CStr conversion, although to_string_lossy would be more appropriate here.~~

The current approach is specifically mandated for dlsym, so we need to keep using it. See above.

Dylan-DPC-zz · 2020-08-07T00:54:58Z

@jclulow closing this due to inactivity. When you have the time, you can submit a new pr that works in a way that addresses the above concerns. Thanks for taking the time to contribute

This works around behavior observed on illumos in rust-lang#74469, in which foreign code (libc according to the OP) was racing with rustc to check `dlerror`.

Refactor dynamic library error checking on *nix The old code was checking `dlerror` more often than necessary, since (unlike `dlsym`) checking the return value of [`dlopen`](https://www.man7.org/linux/man-pages/man3/dlopen.3.html) is enough to indicate whether an error occurred. In the first commit, I've refactored the code to minimize the number of system calls needed. It should be strictly better than the old version. The second commit is an optional addendum which fixes the issue observed on illumos in rust-lang#74469, a PR I reviewed that was ultimately closed due to inactivity. I'm not sure how hard we try to work around platform-specific bugs like this, and I believe that, due to the way that `dlerror` is specified in the POSIX standard, libc implementations that want to run on conforming systems cannot call `dlsym` in multi-threaded programs.

rust-highfive assigned ecstatic-morse Jul 18, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jul 18, 2020

tesuji reviewed Jul 18, 2020

View reviewed changes

ollie27 reviewed Jul 18, 2020

View reviewed changes

ecstatic-morse reviewed Jul 19, 2020

View reviewed changes

Muirrum added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Aug 6, 2020

Dylan-DPC-zz closed this Aug 7, 2020

Dylan-DPC-zz added S-inactive Status: Inactive and waiting on the author. This is often applied to closed PRs. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Aug 7, 2020

ecstatic-morse mentioned this pull request Aug 22, 2020

Refactor dynamic library error checking on *nix #75811

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consult dlerror() only if a dl*() call fails #74469

consult dlerror() only if a dl*() call fails #74469

jclulow commented Jul 18, 2020

rust-highfive commented Jul 18, 2020

tesuji Jul 18, 2020

ollie27 Jul 18, 2020

ecstatic-morse Jul 19, 2020

ecstatic-morse Jul 19, 2020

ecstatic-morse Jul 19, 2020

ecstatic-morse Jul 19, 2020 •

edited

ecstatic-morse commented Jul 19, 2020 •

edited

Dylan-DPC-zz commented Aug 7, 2020

		let s = CStr::from_ptr(last_error).to_bytes();
		Err(str::from_utf8(s).unwrap().to_owned())

consult dlerror() only if a dl*() call fails #74469

consult dlerror() only if a dl*() call fails #74469

Conversation

jclulow commented Jul 18, 2020

rust-highfive commented Jul 18, 2020

tesuji Jul 18, 2020

Choose a reason for hiding this comment

ollie27 Jul 18, 2020

Choose a reason for hiding this comment

ecstatic-morse Jul 19, 2020

Choose a reason for hiding this comment

ecstatic-morse Jul 19, 2020

Choose a reason for hiding this comment

ecstatic-morse Jul 19, 2020

Choose a reason for hiding this comment

ecstatic-morse Jul 19, 2020 • edited

Choose a reason for hiding this comment

ecstatic-morse commented Jul 19, 2020 • edited

Dylan-DPC-zz commented Aug 7, 2020

ecstatic-morse Jul 19, 2020 •

edited

ecstatic-morse commented Jul 19, 2020 •

edited