ENH: Add subclass of Adaptive that works with unknown tasks by elaineejiang · Pull Request #4973 · dask/distributed

elaineejiang · 2021-06-24T20:13:54Z

Related to #4816

I've found that Adaptive doesn't work as well when the functions I'm submitting are labeled as "unknown tasks". At work, we use a lot of internal functions that have highly variable durations. I added a subclass of Adaptive to scale based on the number of unblocked tasks instead of estimated durations. I recently gave a short talk on this extension at Dask Summit: https://zoom.us/rec/play/3jXhd0X69egba6uXWXsYhrBlNTKef2-J3dTX0Hr0j15NOU-RteQcple[…]g.1623702769220.3b25454921a97baf96ba551741201890&_x_zm_rhtaid=511 (skip to 00:10:42).

Curious to hear what you all think!

Also, not set on the subclass name ElasticAdaptive (a bit redundant) -- happy to take suggestions.

fjetter · 2021-06-29T15:31:06Z

Interesting. This somehow raises the question if it is worth measuring variance of task duration as well. At least theoretically we could then define the target using a confidence interval (smth like target = mean + 3 sigma)

@jrbourbeau I am not aware of other cases of subclasses to existing functionality in our code base. Any experience how we would want to proceed here? The functionality seems general enough that it may be useful for others (given enough docs)

Also, not set on the subclass name ElasticAdaptive (a bit redundant) -- happy to take suggestions.

Your subclass proposes to use Tasks instead of Occupancy. Maybe this is something we can work with.

we could do

class _Adaptive:
    pass


class OccupancyAdaptive(_Adaptive):
    pass

class TaskAdaptive(_Adaptive):
    pass

# Default / backwards compat
Adaptive = OccupancyAdaptive

fjetter · 2021-06-29T15:34:48Z

+    async def recommendations(self, target: int) -> dict:
+        """
+        Make scale up/down recommendations based on current state and target
+        """


Why do we need another recommendation method? Can this be somehow merged (haven't investigated the diff, yet)

The diff is that this recommendation method doesn't close workers that haven't arrived yet.

It removes this logic (https://github.com/dask/distributed/blob/main/distributed/deploy/adaptive_core.py#L171):

not_yet_arrived = requested - observed to_close = set() if not_yet_arrived: to_close.update(toolz.take(len(plan) - target, not_yet_arrived))

This might be specific to my use case since I'm using off-prem workers that take a while to start up.

TomAugspurger · 2021-06-29T15:52:31Z

This somehow raises the question if it is worth measuring variance of task duration as well

#4023 / #4028 proposes to capture the variance at a task-prefix level.

elaineejiang · 2021-07-06T18:59:57Z

@fjetter I like the Occupany vs. Task naming convention -- makes the distinction very clear. Will update the PR.

And thanks @TomAugspurger! Taking into account task variance would definitely be useful.

…Adaptive

…ic-adaptive

GPUtester · 2022-01-04T21:30:12Z

Can one of the admins verify this patch?

elaineejiang · 2022-01-12T16:56:23Z

@fjetter or @TomAugspurger - would either of you mind giving this another review? The failed tests don't appear related. Thanks!

fjetter reviewed Jun 29, 2021

View reviewed changes

elaineejiang added 2 commits July 7, 2021 16:33

ENH: Add subclass of Adaptive that works with unknown tasks

b5a90dd

Rename ElasticAdaptive to TaskAdaptive; default Adaptive to Occupancy…

016f49e

…Adaptive

elaineejiang force-pushed the elastic-adaptive branch from 3fec481 to 016f49e Compare July 15, 2021 14:51

Fix merge conflict in test_adaptive.py

b5f01a0

elaineejiang requested a review from fjetter July 20, 2021 13:31

elaineejiang added 2 commits January 4, 2022 15:40

merge upstream changes

35fc67b

Merge branch 'main' of https://github.com/dask/distributed into elast…

bdce5cd

…ic-adaptive

elaineejiang force-pushed the elastic-adaptive branch from 988b157 to bdce5cd Compare January 4, 2022 21:30

elaineejiang requested a review from jacobtomlinson as a code owner January 23, 2024 10:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ENH: Add subclass of Adaptive that works with unknown tasks#4973

ENH: Add subclass of Adaptive that works with unknown tasks#4973
elaineejiang wants to merge 5 commits intodask:mainfrom
elaineejiang:elastic-adaptive

elaineejiang commented Jun 24, 2021

Uh oh!

fjetter commented Jun 29, 2021

Uh oh!

fjetter Jun 29, 2021

Uh oh!

elaineejiang Jul 6, 2021

Uh oh!

TomAugspurger commented Jun 29, 2021

Uh oh!

elaineejiang commented Jul 6, 2021

Uh oh!

GPUtester commented Jan 4, 2022

Uh oh!

elaineejiang commented Jan 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

elaineejiang commented Jun 24, 2021

Uh oh!

fjetter commented Jun 29, 2021

Uh oh!

fjetter Jun 29, 2021

Choose a reason for hiding this comment

Uh oh!

elaineejiang Jul 6, 2021

Choose a reason for hiding this comment

Uh oh!

TomAugspurger commented Jun 29, 2021

Uh oh!

elaineejiang commented Jul 6, 2021

Uh oh!

GPUtester commented Jan 4, 2022

Uh oh!

elaineejiang commented Jan 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants