[Dask-on-Ray] Add support for Dask annotations. #22057

clarkzinzow · 2022-02-02T20:24:21Z

This PR adds support for Dask annotations, allowing users to specify Ray-level resource requests (or any other Ray task options) for both individual Dask operations, using the dask.annotate API, or for the Dask workload as a whole by passing in .compute(resources={...}, ray_remote_args={...}) at compute time.

See #21536 for more details on this feature.

Related issue number

Closes #21536

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

ericl · 2022-02-02T23:13:03Z

python/ray/util/dask/examples/dask_ray_annotate_example.py

+# NOTE: We disable graph optimization since it can break annotations,
+# see this issue: https://github.com/dask/dask/issues/7036.
+result = sum_.compute(
+    resources={"another_custom_resource": 0.01},


Is resources something from Dask? It's a bit confusing to understand how this interacts with ray remote args, such as num_cpus.

Dask Distributed has a similar resources concept, which we try to accommodate by mapping e.g. resources={"CPU": 1, "GPU: 1} to num_cpus=1, num_gpus=1.

It should be noted that the dask.compute() API allows for arbitrary, scheduler-specific kwargs, so we can ask the user to specify whatever Ray-specific things they want here and it will be transparently passed on to the Dask-on-Ray scheduler. The same goes for dask.annotate(), it allows arbitrary annotations which the scheduler can interpret however it wants.

ericl · 2022-02-02T23:13:53Z

python/ray/util/dask/scheduler.py

+    ("CPU", "num_cpus"),
+    ("GPU", "num_gpus"),
+    ("memory", "memory"),
+    ("object_store_memory", "object_store_memory"),


Dask knows about object store?

Dask does not know about the object store; this map can be thought of as "resources to pluck out of resources and give as top-level Ray task options under a new name", i.e. these are resources that we do not allow to be in the ray.remote(resources={...}) argument and that have to be given as top-level args. Dask Distributed users are going to be used to giving their CPU and GPU resource requests as part of the resources={...} dict, either to .compute() or as an .annotate() annotation, so this mapping makes this transition a bit easier.

The other option is to let Ray throw an error when these resources are given in the ray.remote(resources={...}) argument and make the user give num_cpus, num_gpus, etc., as top-level .compute() and .annotate() arguments. I'd like to give the user the option to keep all resource requests in the resources={...} dict since I think that's a nice, clean option, but happy to keep this more in line with our ray.remote() API.

The other option is to let Ray throw an error when these resources are given in the ray.remote(resources={...}) argument.

Upon further thought I like this option. Can we do this, and also update the docs to make it clear resources is for Dask compatibility only? Users can just use ray_remote_args if they don't need compat.

Also, object_store_memory isn't actually a requestable resource...

I actually agree with your initial point. I'm thinking that it'd be better to either completely accommodate the top-level resources={...} API (as currently exists in the PR), or only accept ray_remote_args. Only supporting the resources={...} API for custom (non-system) resources seems like bad split-API UX.

# (1) Current PR with dask.annotate(resources={"CPU": 1, "foo": 1}): col = ... # (2) Custom resources + remote args with dask.annotate(resources={"foo": 1}, ray_remote_args={"num_cpus": 1}): col = ... # (3) Remote args only with dask.annotate(ray_remote_args={"num_cpus": 1, "resources": {"foo: 1}}}): col = ...

Hmm, then maybe we should go for (3), which optimizes for clarity over perfect compatibility. In that case, we can raise an warning/error if "resources" is found in the kwargs, telling the user to use ray_remote_args.

Agreed! I'll make that change.

Change is made. Compared to allowing a top-level resources along with num_cpus, num_gpus, etc., I feel as if this is slightly worse UX, see example: e2a8a17#diff-cc2090d8130a6fee7ad03b26c646dc1640107533f4e05f6cca02eb7d8933553e

If this UX hit seems tenable to you, then I'm cool with it.

clarkzinzow requested review from ericl and scv119 as code owners February 2, 2022 20:24

clarkzinzow force-pushed the dask-on-ray/feat/annotations branch 2 times, most recently from c0748dc to c8df319 Compare February 2, 2022 20:28

Add support for Dask annotations.

d40ca6e

clarkzinzow force-pushed the dask-on-ray/feat/annotations branch from c8df319 to d40ca6e Compare February 2, 2022 20:30

clarkzinzow assigned ericl and scv119 Feb 2, 2022

ericl reviewed Feb 2, 2022

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Feb 2, 2022

clarkzinzow requested a review from ericl February 3, 2022 17:45

clarkzinzow added tests-ok The tagger certifies test failures are unrelated and assumes personal liability. and removed @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. labels Feb 3, 2022

ericl added @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. and removed tests-ok The tagger certifies test failures are unrelated and assumes personal liability. labels Feb 3, 2022

Centralize API around ray_remote_args.

e2a8a17

clarkzinzow force-pushed the dask-on-ray/feat/annotations branch from 6e8a8fc to e2a8a17 Compare February 3, 2022 20:47

ericl approved these changes Feb 4, 2022

View reviewed changes

ericl merged commit 743ce65 into ray-project:master Feb 4, 2022

clarkzinzow deleted the dask-on-ray/feat/annotations branch February 5, 2022 00:58

clarkzinzow restored the dask-on-ray/feat/annotations branch April 1, 2023 02:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Dask-on-Ray] Add support for Dask annotations. #22057

[Dask-on-Ray] Add support for Dask annotations. #22057

clarkzinzow commented Feb 2, 2022 •

edited

ericl Feb 2, 2022

clarkzinzow Feb 3, 2022

ericl Feb 2, 2022

clarkzinzow Feb 3, 2022 •

edited

ericl Feb 3, 2022 •

edited

clarkzinzow Feb 3, 2022 •

edited

ericl Feb 3, 2022

clarkzinzow Feb 3, 2022

clarkzinzow Feb 3, 2022 •

edited

[Dask-on-Ray] Add support for Dask annotations. #22057

[Dask-on-Ray] Add support for Dask annotations. #22057

Conversation

clarkzinzow commented Feb 2, 2022 • edited

Related issue number

Checks

ericl Feb 2, 2022

Choose a reason for hiding this comment

clarkzinzow Feb 3, 2022

Choose a reason for hiding this comment

ericl Feb 2, 2022

Choose a reason for hiding this comment

clarkzinzow Feb 3, 2022 • edited

Choose a reason for hiding this comment

ericl Feb 3, 2022 • edited

Choose a reason for hiding this comment

clarkzinzow Feb 3, 2022 • edited

Choose a reason for hiding this comment

ericl Feb 3, 2022

Choose a reason for hiding this comment

clarkzinzow Feb 3, 2022

Choose a reason for hiding this comment

clarkzinzow Feb 3, 2022 • edited

Choose a reason for hiding this comment

clarkzinzow commented Feb 2, 2022 •

edited

clarkzinzow Feb 3, 2022 •

edited

ericl Feb 3, 2022 •

edited

clarkzinzow Feb 3, 2022 •

edited

clarkzinzow Feb 3, 2022 •

edited