Bias UDP send loop towards the shutdown signal #372

GeorgeHahn · 2022-11-01T04:10:55Z

What does this PR do?

We've observed an issue where the UDP generator will run forever, ignoring the shutdown signal. This PR biases the UDP generation task towards polling the shutdown signal to avoid this.

Motivation

Stop lading processes from spinning on UDP forever.

Related issues

N/A

Additional Notes

We may consider refactoring shutdown to abort async tasks rather than signal and expect cooperation.

GeorgeHahn · 2022-11-02T00:35:57Z

This is pretty wild. Even rewritten as its own async task, the UDP sender causes other tasks on the runtime not to be polled. a699c70 works around the issue by adding an explicit yield point. This is on my radar to min-repr and file upstream after further investigation.

To repro with lading:

Set up a local lading target (I'm using an agent regression container)
Add a log output to the UDP send loop
Add an independent task that logs periodically
Test with & without the yield point

Here's the lading config I'm using:

generator:
  - udp:
      seed: [2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53,
             59, 61, 67, 71, 73, 79, 83, 89, 97, 101, 103, 107, 109, 113, 127, 131]
      addr: "127.0.0.1:10000"
      variant: "json"
      bytes_per_second: "500 Mb"
      block_sizes: ["1Kb"]
      maximum_prebuild_cache_size_bytes: "256 Mb"

blackhole:
  - http:
      binding_addr: "127.0.0.1:9091"

blt · 2022-11-02T01:31:57Z

Even rewritten as its own async task, the UDP sender causes other tasks on the runtime not to be polled. a699c70 works around the issue by adding an explicit yield point.

Dang. Getting upstream consideration of this would be excellent. I have wondered now and again if we might not profitably split lading's internals up so that each component runs on a single-threaded executor on top of an OS thread we manage from startup. We'd get better isolation, at least, for things like this. Non-trivial change, but it's crossed my mind now and again.

blt · 2022-11-02T01:33:05Z

src/generator/udp.rs

@@ -149,6 +149,7 @@ impl Udp {
                    "UDP packet too large (over 65507 B)"
                );
                if let Some(sock) = &connection {
+                    tokio::task::yield_now().await;


I guess at-speed we always have bytes ready, considering the size of the max payload. Hrm.

What's wild to me is that we have cores - 1 tokio worker threads sitting idle while this is happening. This should be very interesting to min-repr and dig into.

My initial thought is that we only have one task active. Maybe tokio tries to squeeze in its scheduling at a low priority after workers go idle. That could lead to exactly this situation.

GeorgeHahn · 2022-11-02T16:19:51Z

@blt sorry for the review re-request, I missed your comments earlier

GeorgeHahn · 2022-11-02T16:22:11Z

Dang. Getting upstream consideration of this would be excellent. I have wondered now and again if we might not profitably split lading's internals up so that each component runs on a single-threaded executor on top of an OS thread we manage from startup. We'd get better isolation, at least, for things like this. Non-trivial change, but it's crossed my mind now and again.

This issue has me thinking about the same thing. Either a separate OS thread or another tokio runtime on its own pool of threads.

blt · 2022-11-02T16:23:34Z

Dang. Getting upstream consideration of this would be excellent. I have wondered now and again if we might not profitably split lading's internals up so that each component runs on a single-threaded executor on top of an OS thread we manage from startup. We'd get better isolation, at least, for things like this. Non-trivial change, but it's crossed my mind now and again.

This issue has me thinking about the same thing. Either a separate OS thread or another tokio runtime on its own pool of threads.

There's a lot to be said for the performance of a shared-nothing program with tasks broken down per-thread running on a single OS thread, assuming we know that the threads are dedicated to a busy bit of work.

Bias UDP send loop towards the shutdown signal

c0b3306

GeorgeHahn added the bug Something isn't working label Nov 1, 2022

GeorgeHahn requested a review from a team November 1, 2022 04:10

blt approved these changes Nov 1, 2022

View reviewed changes

GeorgeHahn added 2 commits November 2, 2022 00:29

Rewrite UDP sender as a standalone async task

9b4a915

Add an explicit yield point

a699c70

blt reviewed Nov 2, 2022

View reviewed changes

GeorgeHahn requested a review from blt November 2, 2022 16:10

blt approved these changes Nov 2, 2022

View reviewed changes

GeorgeHahn merged commit ac6c9e6 into main Nov 2, 2022

GeorgeHahn deleted the george/fix-udp-shutdown branch November 2, 2022 16:29

This was referenced Nov 2, 2022

Min-repr tokio issue and report upstream #375

Closed

Prepare release 0.10.3 #376

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bias UDP send loop towards the shutdown signal #372

Bias UDP send loop towards the shutdown signal #372

GeorgeHahn commented Nov 1, 2022

GeorgeHahn commented Nov 2, 2022

blt commented Nov 2, 2022

blt Nov 2, 2022

GeorgeHahn Nov 2, 2022

GeorgeHahn commented Nov 2, 2022

GeorgeHahn commented Nov 2, 2022

blt commented Nov 2, 2022 •

edited

Loading

Bias UDP send loop towards the shutdown signal #372

Bias UDP send loop towards the shutdown signal #372

Conversation

GeorgeHahn commented Nov 1, 2022

What does this PR do?

Motivation

Related issues

Additional Notes

GeorgeHahn commented Nov 2, 2022

blt commented Nov 2, 2022

blt Nov 2, 2022

Choose a reason for hiding this comment

GeorgeHahn Nov 2, 2022

Choose a reason for hiding this comment

GeorgeHahn commented Nov 2, 2022

GeorgeHahn commented Nov 2, 2022

blt commented Nov 2, 2022 • edited Loading

blt commented Nov 2, 2022 •

edited

Loading