Add a configuration option to skip the `lifo_slot` optimization #4051

guswynn · 2021-08-20T21:32:14Z

Motivation

In our (my coworkers and I) RPC stack, our server overload detection relies on being able to detect when request processing throughput is stalled. Currently, we derive this metric interactively by injecting futures (via spawn() in our case) into the Runtime and measures how long they took to begin execution. (this is a vague overview and I can go into more detail if needed)

However, this flow relies on 'mostly-FIFO' ordering of request handling.

While tokio does not explicitly provide a FIFO execution guarantee* for spawn()'d futures, the lifo_slot optimization path results in pathological behavior for this workload.

*I stand to be corrected, but there are 2 parts of the runtime that are not strict-fifo, and they could be altered to be fifo to varying degrees (though, if this PR is accepted, I would be interested in exploring both more concretely):

when a Local queue is filled, work is pulled off the front half of that queue and added to the global Inject queue. I can imagine a scheme where this is changed to the back half or something, but upon reading the queue module, I think this would be complex to implement
tokio's runtime is work-stealing, which I think regardless of implementation can't be made loosely fifo. I was thinking rayon implemented this with spawn_fifo, but that appears to be per-thread fifo? I admittedly, don't know what I'm talking about here.

We also believe that the LIFO optimization creates surprising behavior in systems that have similar expectations of 'mostly-FIFO-ness'. Given that the LIFO optimization is an optimization and not key to the functioning of the runtime, we believe it to be reasonable to empower users who find this behavior pathological for their workflows to disable it at runtime.

Solution

Change the runtime to allow Core's to skip the lifo_slot optimization, and allow this to be configured in the runtime Builder

Darksonn · 2021-08-21T06:05:49Z

It sounds like adding an on_thread_park option to the builder as described in #3975 would be a better way of solving your problem?

guswynn · 2021-08-23T15:45:13Z

@Darksonn is the idea that the closure in on_thread_park would spawn a new task, and would be called each time a worker thread runs out of work? I am not sure I understand that issue

Darksonn · 2021-08-23T16:11:33Z

The closure would be called every time the runtime is about to park. Whether or not you spawn a task in the closure is up to you. The reason I ask is because being able to know whether the runtime has recently parked could be one way to figure out how loaded the runtime is.

Matthias247 · 2021-08-28T04:05:40Z

Currently, we derive this metric interactively by injecting futures (via spawn() in our case) into the Runtime and measures how long they took to begin execution.

Won't the work on runtime metrics (#3845, #4043 , #4074 ) be a reasonable way to do this?

And even without it: A yield() in a task is guaranteed not to use LIFO and can be [mis]used to measure executor latency.

Darksonn · 2021-09-15T09:44:55Z

I think we would rather avoid the complexity of making this configurable for now. It sounds like #4070 or use of tokio::task::yield_now() could achieve the same goal.

guswynn added 3 commits August 20, 2021 12:40

make lifo_slot configurable

15386b6

fmt

f6e1a0e

more fixes

3cb6cd2

Darksonn added A-tokio Area: The main tokio crate M-runtime Module: tokio/runtime labels Aug 21, 2021

Darksonn closed this Sep 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a configuration option to skip the `lifo_slot` optimization #4051

Add a configuration option to skip the `lifo_slot` optimization #4051

guswynn commented Aug 20, 2021

Darksonn commented Aug 21, 2021

guswynn commented Aug 23, 2021

Darksonn commented Aug 23, 2021

Matthias247 commented Aug 28, 2021

Darksonn commented Sep 15, 2021

Add a configuration option to skip the lifo_slot optimization #4051

Add a configuration option to skip the lifo_slot optimization #4051

Conversation

guswynn commented Aug 20, 2021

Motivation

Solution

Darksonn commented Aug 21, 2021

guswynn commented Aug 23, 2021

Darksonn commented Aug 23, 2021

Matthias247 commented Aug 28, 2021

Darksonn commented Sep 15, 2021

Add a configuration option to skip the `lifo_slot` optimization #4051

Add a configuration option to skip the `lifo_slot` optimization #4051