rt: cleanup and simplify scheduler (scheduler v2.5) #2273

carllerche · 2020-02-25T17:44:24Z

A refactor of the scheduler internals focusing on simplifying and
reducing unsafety. There are no fundamental logic changes. That said,
I recommend reading the new files themselves and not the diff.

The state transitions of the core task component are refined and
reduced.
basic_scheduler has (almost) all unsafety removed.
local_set has most unsafety removed.
threaded_scheduler limits unsafety to its queue implementation.

A refactor of the scheduler internals focusing on simplifying and reducing unsafety. There are no fundamental logic changes. * The state transitions of the core task component are refined and reduced. * `basic_scheduler` has all unsafety removed. * `local_set` has most unsafety removed. * `threaded_scheduler` limits unsafety to its queue implementation.

tokio/src/loom/std/alloc.rs

carllerche · 2020-02-25T17:49:03Z

cc @tmiasko, this is the cleanup I mentioned before. If you can help w/ CI configuration, I would be happy to set this up w/ TSAN as long as we can remove false positives. You mentioned you had a strategy to do so?

carllerche · 2020-02-25T18:13:04Z

And I only ran with MAX_PREEMPTIONS=1 locally before the final push because it takes too long 😢

tmiasko · 2020-02-25T19:05:26Z

Regarding the thread sanitizer, the first step would be to remove the uses of
atomic fences in Tokio. Now there is only one left so this shouldn't be an
issue. Then I use thread sanitizer as follows:

# Install the source code of the standard library.
$ rustup component add rust-src

# Replace fences with atomic loads in Arc implementation.
$ patch ~/.rustup/toolchains/nightly-x86_64-unknown-linux-gnu/lib/rustlib/src/rust/src/liballoc/sync.rs <<EOF
405c405
<         atomic::fence(Acquire);
---
>         this.inner().strong.load(Acquire);
742c742
<             atomic::fence(Acquire);
---
>             self.inner().weak.load(Acquire);
1246c1246
<         atomic::fence(Acquire);
---
>         self.inner().strong.load(Acquire);
1704c1704
<             atomic::fence(Acquire);
---
>             inner.weak.load(Acquire);
EOF

# Enable thread sanitizer.
$ export RUSTFLAGS=-Zsanitizer=thread RUSTDOCFLAGS=-Zsanitizer=thread

# Run tests with instrumented standard library.
$ cargo -Zbuild-std test --workspace --target x86_64-unknown-linux-gnu

BTW. Looks like there is a data race on the scheduler field, since task is
bound to the scheduler only after future returns pending, but by that time it
is already possible to use associated waker which reads the same field.

carllerche · 2020-02-25T19:06:28Z

@tmiasko Yes re: race. I failed to run the full suite locally before pushing after making a tweak... working on a fix now.

tokio/src/park/thread.rs

tokio/src/macros/scoped_tls.rs

tokio/src/runtime/mod.rs

tokio/src/runtime/thread_pool/mod.rs

tokio/src/runtime/thread_pool/queue.rs

jonhoo · 2020-02-25T19:26:18Z

tokio/src/runtime/thread_pool/queue.rs

+
+/// Create a new local run-queue
+pub(super) fn local<T: 'static>() -> (Steal<T>, Local<T>) {
+    debug_assert!(LOCAL_QUEUE_CAPACITY >= 2 && LOCAL_QUEUE_CAPACITY.is_power_of_two());


Is there a way for us to make this a static assertion instead?

do you have thoughts on how?

https://github.com/nvzqz/static-assertions-rs/blob/v1.0.0/src/const_assert.rs

Seems like hax 😆 you want me to lift that macro?

tokio/src/runtime/thread_pool/queue.rs

hawkw

i would still like to spend more time reading over this change, but i don't see anything that i want to block merging it over.

ci/azure-loom.yml

tokio/src/runtime/task/core.rs

tokio/src/runtime/task/harness.rs

tokio/src/runtime/task/core.rs

hawkw · 2020-03-02T23:20:42Z

tokio/src/runtime/task/core.rs

+    /// requires ensuring mutal exclusion between any concurrent thread that
+    /// might modify the future or output field.
+    ///
+    /// The mutual exclusion is implemented by `Harness` and the `Lifecycle`


nit: do we still call this lifecycle, or is it called stage now?

Co-Authored-By: Eliza Weisman <eliza@buoyant.io>

rt: cleanup and simplify scheduler (scheduler v2.5) (tokio-rs#2273)

carllerche changed the title ~~rt: cleanup and simplify scheduler~~ rt: cleanup and simplify scheduler (scheduler v2.5) Feb 25, 2020

carllerche commented Feb 25, 2020

View reviewed changes

tokio/src/loom/std/alloc.rs Show resolved Hide resolved

Use 2 max pre-emptions when running loom tests

87fe922