[Data] Use runtime object memory for scheduling #41383

bveeramani · 2023-11-26T22:38:07Z

Why are these changes needed?

When selecting an operator to run, the scheduler doesn't consider how much object store memory an operator consumes. If an operator produces large blocks, the scheduler might select the operator too frequently, and your cluster can out of memory.

To prevent this from happening, this PR updates the scheduler so that it considers the incremental object store memory usage when selecting an operator.

Related issue number

Fixes #41190

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

python/ray/data/_internal/execution/streaming_executor_state.py

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

bveeramani · 2023-11-28T02:21:37Z

python/ray/data/context.py

@@ -102,6 +102,11 @@
    int(os.environ.get("RAY_DATA_USE_STREAMING_EXECUTOR", "1"))
 )

+# Whether to use the runtime object store memory metrics for scheduling.
+DEFAULT_USE_RUNTIME_METRICS_SCHEDULING = bool(
+    int(os.environ.get("DEFAULT_USE_RUNTIME_METRICS_SCHEDULING", "1"))


Will set this to 0 before merging. Currently enabled to ensure tests pass.

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

raulchen · 2023-11-28T22:05:38Z

python/ray/data/_internal/execution/streaming_executor_state.py

+    if (
+        DataContext.get_current().use_runtime_metrics_scheduling
+        and global_ok_sans_memory
+        and op.metrics.average_bytes_change_per_task is not None
+        and op.metrics.average_bytes_change_per_task <= 0
+    ):
+        return True
+
    return global_ok_sans_memory and downstream_memory_ok


Suggested change

if (

DataContext.get_current().use_runtime_metrics_scheduling

and global_ok_sans_memory

and op.metrics.average_bytes_change_per_task is not None

and op.metrics.average_bytes_change_per_task <= 0

):

return True

return global_ok_sans_memory and downstream_memory_ok

if DataContext.get_current().use_runtime_metrics_scheduling:

return (

global_ok_sans_memory

and op.metrics.average_bytes_change_per_task is not None

and op.metrics.average_bytes_change_per_task <= 0

)

else:

return global_ok_sans_memory and downstream_memory_ok

I tested this out with test_large_e2e_backpressure, and doing an if / else might not work. We're over the memory limit and average_bytes_change_per_task is initially None, so consume tasks aren't launched; as a result, we keep pulling data from the produce tasks until all of the produce tasks are complete.

Left this code as the original change for now. Hopefully that should minimize the risk, while still providing some benefit. In the long term, I think we need to devise a strategy for when metrics are None.

Co-authored-by: Hao Chen <chenh1024@gmail.com> Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

Initial commit

578f676

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

bveeramani requested review from ericl, scv119, c21, amogkam, scottjlee, raulchen, stephanie-wang and Zandew as code owners November 26, 2023 22:38

bveeramani marked this pull request as draft November 26, 2023 22:38

Replace change with output in incremental usage

bea4431

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

bveeramani commented Nov 27, 2023

View reviewed changes

python/ray/data/_internal/execution/streaming_executor_state.py Outdated Show resolved Hide resolved

bveeramani added 6 commits November 26, 2023 21:49

Small fix

41b6ffd

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

Add feature flag

07405f5

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

Merge branch 'master' into runtime-metric-scheduling

57e01b7

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

Fix tests

fe3cfcb

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

Appease lint

77071a5

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

Add docstrings

8040357

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

bveeramani commented Nov 28, 2023

View reviewed changes

bveeramani marked this pull request as ready for review November 28, 2023 02:21

Fix test_stats

81a7455

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

raulchen approved these changes Nov 28, 2023

View reviewed changes

bveeramani and others added 4 commits November 29, 2023 01:12

Update python/ray/data/_internal/execution/streaming_executor_state.py

a45f943

Co-authored-by: Hao Chen <chenh1024@gmail.com> Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

Merge branch 'master' into runtime-metric-scheduling

2cd79ec

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

Revert change

089bb11

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

Disable by default

c72150e

Signed-off-by: Balaji Veeramani <balaji@anyscale.com>

bveeramani merged commit 64e5373 into ray-project:master Nov 29, 2023
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] Use runtime object memory for scheduling #41383

[Data] Use runtime object memory for scheduling #41383

bveeramani commented Nov 26, 2023 •

edited

Loading

bveeramani Nov 28, 2023

raulchen Nov 28, 2023

bveeramani Nov 29, 2023

[Data] Use runtime object memory for scheduling #41383

[Data] Use runtime object memory for scheduling #41383

Conversation

bveeramani commented Nov 26, 2023 • edited Loading

Why are these changes needed?

Related issue number

Checks

bveeramani Nov 28, 2023

Choose a reason for hiding this comment

raulchen Nov 28, 2023

Choose a reason for hiding this comment

bveeramani Nov 29, 2023

Choose a reason for hiding this comment

bveeramani commented Nov 26, 2023 •

edited

Loading