Turn on queuing by default #7213

gjoseph92 · 2022-10-28T00:09:29Z

gjoseph92 · 2022-10-28T16:45:38Z

A key question: what should the default worker-saturation value be?

Previously, we'd discussed using 1.1. Due to round-up, this means that small workers would usually have 1 or 2 extra root tasks in memory; big workers would have more. So we'd still have a little oversaturation, but with a bound on it. The motivation for this, instead of plain 1.0, was that for very fast tasks on slow networks, it would mitigate the scheduler<->worker delay.

After re-running some benchmarks, I'm personally still leaning towards 1.0.

Comparing non-1.0* to 1.0, using a value >1.0 does not give a definitive decrease in runtime. In many tests, the difference is not statistically significant—it doesn't show up above the normal variation. The cases where it is definitively faster, it's only by a small amount, in the 2-8% range.

But >1.0 unquestionably hurts memory usage. We see up to 30% higher memory usage than 1.0. Every benchmark increases in memory usage.

Comparing those two charts, memory use shows a strong signal that >1.0 is worse, but runtime does not show much signal that >1.0 is better.

Of course, these benchmarks are only run with one cluster configuration (10 2cpu, 8gib workers). We haven't tested how more/larger workers affects this.

The main motivation for >1.0 was, "what about super slow corporate networks, or workers in multiple regions, etc?"?

Tiny, fast tasks are not good practice in Dask to begin with. If your task runtime is on the same order as network latency plus scheduler overhead, you should be making your tasks bigger. Dask overhead will be a problem in these cases regardless of queuing for everything but the most basic client.map.

Yes, client.map with instantaneous tasks is 15% slower in 1.0 vs 1.1. But client.map with realistic tasks (that take ~1s) is only 1% slower.
This doesn't seem like that common of a workload, and is a rather niche case to penalize novice users in typical situations for.

All this said: from a theoretical standpoint, it does make sense that 1.1 would be somewhat faster that 1.0, so I'm surprised that we don't see a clearer runtime difference.

* I used 1.2 in this case, which due to worker size was equivalent to 1.1—either one meant 1 extra task per worker.

crusaderky · 2022-10-28T17:06:06Z

Do we have a measure of the scheduler<->worker RTT on coiled?
CC @ntabris

crusaderky · 2022-10-28T17:09:51Z

what about super slow corporate networks

This is wrongly posed. What about corporate networks that were set up ~5 years ago and do not receive the amount of constant attention that AWS does?

ntabris · 2022-10-28T17:46:36Z

Do we have a measure of the scheduler<->worker RTT on coiled?

How much does worker.latency give you what you want? Maybe you'd want to look at min latency over time, since latency is (I suspect) pretty sensitive to how responsive the scheduler is when it gets worker heartbeat?

Worker latency is captured by Prometheus so we have lots of recent data there, e.g.,

I've also done a couple tests with scheduler and worker in different zones, and latencies looked like this:

(see https://github.com/coiled/platform/issues/165 for more details on that)

gjoseph92 · 2022-10-28T17:54:05Z

What about corporate networks that were set up ~5 years ago and do not receive the amount of constant attention that AWS does?

True, it may not be that dramatic. My point is just that, as far as I know, it's already not good Dask practice to have tasks which are so fast that their runtime is the same order of magnitude as network latency (whatever that latency may be). You should probably increase task size in this case regardless, and if you do, it would make the worker-saturation effect pretty much moot.

All things being equal, I certainly want to improve performance in these higher-latency cases. It's just that the benchmark data makes it seem like that choice may come at a cost. And I wouldn't want to pick a default to help a less-common, bad-practice use case if it comes at the expense of a more common use-case.

crusaderky · 2022-10-31T11:51:18Z

How much does worker.latency give you what you want? Maybe you'd want to look at min latency over time, since latency is (I suspect) pretty sensitive to how responsive the scheduler is when it gets worker heartbeat?

It actually would be pretty interesting to see the difference between kernel-level ping (dictated almost exclusively by network) and application-level latency, where the GIL plays a major role. If we saw that the latter eclipses the former, it would inform us that even if the network became much worse we probably wouldn't notice as long as ping remains << than GIL-dominated RTT.

~250ms for the 4th quintile in your first plot above is pretty horrid. I'd be surprised if AWS were to blame for it.

gjoseph92 mentioned this issue Oct 28, 2022

[DNM] Queue by default #7191

Closed

2 tasks

fjetter mentioned this issue Oct 28, 2022

Better instrumentation for Worker.gather_dep #7217

Open

gjoseph92 mentioned this issue Oct 28, 2022

Release 2022.10.1 dask/community#283

Closed

6 tasks

crusaderky assigned gjoseph92 Oct 28, 2022

This was referenced Oct 31, 2022

Volatility introduced in tests since approx September 18th - potentially package sync coiled/benchmarks#446

Open

decide_worker could be expensive on large clusters with queuing enabled #7246

Closed

fjetter mentioned this issue Nov 9, 2022

Queue by default #7279

Merged

gjoseph92 mentioned this issue Nov 10, 2022

Consistent worker selection for no-deps cases #7280

Open

2 tasks

fjetter closed this as completed in #7279 Nov 10, 2022

Vesyrak mentioned this issue Jun 26, 2023

worker-saturation documentation/implementation mismatch #7948

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Turn on queuing by default #7213

Turn on queuing by default #7213

gjoseph92 commented Oct 28, 2022 •

edited

gjoseph92 commented Oct 28, 2022

crusaderky commented Oct 28, 2022

crusaderky commented Oct 28, 2022

ntabris commented Oct 28, 2022

gjoseph92 commented Oct 28, 2022

crusaderky commented Oct 31, 2022

Turn on queuing by default #7213

Turn on queuing by default #7213

Comments

gjoseph92 commented Oct 28, 2022 • edited

gjoseph92 commented Oct 28, 2022

crusaderky commented Oct 28, 2022

crusaderky commented Oct 28, 2022

ntabris commented Oct 28, 2022

gjoseph92 commented Oct 28, 2022

crusaderky commented Oct 31, 2022

gjoseph92 commented Oct 28, 2022 •

edited