Massive memory (100GB) used by dask-scheduler

Cross-posting since seems now to be mostly a dask.distributed problem.

Maybe related:
dask/distributed#3898
dask/dask#3530
dask/dask#6762

See for code/repro: https://github.com/dmlc/xgboost/issues/6388#issuecomment-726589459

In very short order workers and the scheduler hit OOM killers due to them keep accumulating memory, including across cleanly completed python client code.

This issue basically makes using dask with, e.g. NVIDIA RAPIDS/xgboost not possible as a multi-GPU or multi-node solution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Massive memory (100GB) used by dask-scheduler #4243

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Massive memory (100GB) used by dask-scheduler #4243

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions