Data lingers in memory due to imbalance of worker priorities

We experience some excess memory use because different workers are processing tasks of different priorities.

When we create task graphs we run `dask.order` on them, which provides a good ordering in order to minimize memory use.  When this graph goes out to the workers it gets cut up, and tasks that are very close to each other in the ordering may end up on different workers.  Those workers may then get distracted by different things, which means that while some tasks early in the ordering are complete, their co-dependents may not be complete, and are instead trapped on another worker not running, despite their high priority.

We might resolve this in a few ways:

1.  Preferentially steal tasks by their priority.  This is possibly expensive (sorting is hard) but might be worth it for tasks without dependencies, or in cases where the number of tasks is not high
2.  Revert back to scheduling tasks only as needed.  Currently we schedule all runnable tasks immediately.  This helps ensure saturation of hardware.  We could rethink this move.
3.  Don't do anything, and rely on mechanisms to slow down workers when they get too much data in memory, allowing their peers to catch up.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Data lingers in memory due to imbalance of worker priorities #1747

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Data lingers in memory due to imbalance of worker priorities #1747

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions