[Data] Streamlining `DefaultActorPoolAutoscaler` by alexeykudinkin · Pull Request #61385 · ray-project/ray

alexeykudinkin · 2026-02-27T18:10:54Z

Description

Inlining ActorPoolResizingPolicy
Rebasing _ActorPool to compute utilization based on all actors, not just running
Allow autoscaler to scale up while pending actors are still starting up
Updated tests

Related issues

Link related issues: "Fixes #1234", "Closes #1234", or "Related to #1234".

Additional information

Optional: Add implementation details, API changes, usage examples, screenshots, etc.

gemini-code-assist

Code Review

This pull request refactors the DefaultActorPoolAutoscaler by inlining the ActorPoolResizingPolicy, which simplifies the structure. It also improves the autoscaling logic by calculating utilization based on all actors (including pending ones) and allowing scale-up operations even when there are pending actors. The changes are well-reasoned and the accompanying test updates are thorough. I have a few suggestions to improve code clarity and fix a potential issue in one of the tests.

gemini-code-assist · 2026-02-27T18:12:41Z

python/ray/data/tests/test_autoscaler.py

    assert actor_pool.get_pool_util() == 1.5
    assert_autoscaling_action(
-        delta=1,
+        delta=5,


The expected delta of 5 seems incorrect. Given the test setup, actor_pool_max_upscaling_delta defaults to 1. The calculated scale-up delta is ceil(10 * (1.5 - 1)) = 5, but this should be capped by max_scale_up, which evaluates to min(inf, 1, 5) = 1. Therefore, the final delta should be 1. Please verify the logic or update the test expectation.

Suggested change

delta=5,

delta=1,

gemini-code-assist · 2026-02-27T18:12:41Z

python/ray/data/_internal/actor_autoscaler/default_actor_autoscaler.py

-                delta = min(delta, max_scale_up)
-            delta = max(1, delta)  # At least scale up by 1
+
+            delta = self._compute_upscale_delta(actor_pool, op_state)


The op_state parameter is unused in the _compute_upscale_delta method (defined at line 230). Consider removing it from the method signature and this call site for cleaner code.

Suggested change

delta = self._compute_upscale_delta(actor_pool, op_state)

delta = self._compute_upscale_delta(actor_pool)

gemini-code-assist · 2026-02-27T18:12:42Z

python/ray/data/_internal/actor_autoscaler/default_actor_autoscaler.py

+def _estimate_expected_tasks(
+    op_state: OpState,
+) -> float:
+    # Each task consumes `average_num_inputs_per_task` input blocks on average,
+    # so the total expected number of tasks:
+    #
+    #   ceil(num enqueued blocks / avg_inputs_per_task)
+    #
+    avg_input_blocks_per_task = op_state.op.metrics.average_num_inputs_per_task or 1
+    return math.ceil(op_state.total_enqueued_input_blocks() / avg_input_blocks_per_task)


The function _estimate_expected_tasks estimates a number of tasks but returns a float. Since tasks are discrete units, it would be more idiomatic and type-safe to return an int.

Suggested change

def _estimate_expected_tasks(

op_state: OpState,

) -> float:

# Each task consumes `average_num_inputs_per_task` input blocks on average,

# so the total expected number of tasks:

#

# ceil(num enqueued blocks / avg_inputs_per_task)

#

avg_input_blocks_per_task = op_state.op.metrics.average_num_inputs_per_task or 1

return math.ceil(op_state.total_enqueued_input_blocks() / avg_input_blocks_per_task)

def _estimate_expected_tasks(

op_state: OpState,

) -> int:

# Each task consumes `average_num_inputs_per_task` input blocks on average,

# so the total expected number of tasks:

#

# ceil(num enqueued blocks / avg_inputs_per_task)

#

avg_input_blocks_per_task = op_state.op.metrics.average_num_inputs_per_task or 1

return int(math.ceil(op_state.total_enqueued_input_blocks() / avg_input_blocks_per_task))

python/ray/data/context.py

python/ray/data/_internal/actor_autoscaler/__init__.py

goutamvenkat-anyscale · 2026-02-27T23:41:47Z

python/ray/data/_internal/actor_autoscaler/default_actor_autoscaler.py

            )

+    def _compute_upscale_delta(
+        self, actor_pool: AutoscalingActorPool, op_state: OpState


op_state is not being used

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

… running; Removed dead methods; Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

cursor · 2026-03-13T21:31:24Z

python/ray/data/_internal/actor_autoscaler/default_actor_autoscaler.py

+        return math.ceil(
+            actor_pool.current_size()
+            * (actor_pool.get_pool_util() / self._actor_pool_scaling_up_threshold - 1)
+        )


Unused op_state parameter in _compute_upscale_delta

Low Severity

The op_state parameter in _compute_upscale_delta is accepted but never used in the function body. The function only uses actor_pool and self._actor_pool_scaling_up_threshold. This was confirmed in the PR discussion, where the author acknowledged the issue and said they would fix it.

alexeykudinkin requested a review from a team as a code owner February 27, 2026 18:10

alexeykudinkin added the go add ONLY when ready to merge, run all tests label Feb 27, 2026

gemini-code-assist bot reviewed Feb 27, 2026

View reviewed changes

cursor bot reviewed Feb 27, 2026

View reviewed changes

python/ray/data/context.py Show resolved Hide resolved

python/ray/data/_internal/actor_autoscaler/__init__.py Show resolved Hide resolved

ray-gardener bot added the data Ray Data-related issues label Feb 27, 2026

goutamvenkat-anyscale reviewed Feb 27, 2026

View reviewed changes

alexeykudinkin added 4 commits March 12, 2026 16:58

Inlined ActorPoolResizingPolicy

0d74ec9

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

Tidying up

f6eeea5

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

Rebased ActorPool utilization to be counted from all actors, not just…

b76958a

… running; Removed dead methods; Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

Updated tests

b62f31c

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

goutamvenkat-anyscale approved these changes Mar 13, 2026

View reviewed changes

Fixed tests

3c2232f

Signed-off-by: Alexey Kudinkin <ak@anyscale.com>

alexeykudinkin force-pushed the ak/act-ascl-clup branch from 365c5c1 to 3c2232f Compare March 13, 2026 21:28

alexeykudinkin enabled auto-merge (squash) March 13, 2026 21:28

cursor bot reviewed Mar 13, 2026

View reviewed changes

alexeykudinkin merged commit f1821c1 into master Mar 13, 2026
6 of 7 checks passed

alexeykudinkin deleted the ak/act-ascl-clup branch March 13, 2026 22:09

ayushk7102 mentioned this pull request Mar 17, 2026

... #61799

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] Streamlining `DefaultActorPoolAutoscaler`#61385

[Data] Streamlining `DefaultActorPoolAutoscaler`#61385
alexeykudinkin merged 5 commits intomasterfrom
ak/act-ascl-clup

alexeykudinkin commented Feb 27, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 27, 2026

Uh oh!

goutamvenkat-anyscale Mar 13, 2026

Uh oh!

gemini-code-assist bot Feb 27, 2026

Uh oh!

gemini-code-assist bot Feb 27, 2026

Uh oh!

Uh oh!

Uh oh!

goutamvenkat-anyscale Feb 27, 2026

Uh oh!

alexeykudinkin Mar 12, 2026

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Mar 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	delta = self._compute_upscale_delta(actor_pool, op_state)
	delta = self._compute_upscale_delta(actor_pool)

Conversation

alexeykudinkin commented Feb 27, 2026

Description

Related issues

Additional information

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

goutamvenkat-anyscale Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

goutamvenkat-anyscale Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

alexeykudinkin Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 13, 2026

Choose a reason for hiding this comment

Unused op_state parameter in _compute_upscale_delta

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Unused `op_state` parameter in `_compute_upscale_delta`