[core] Remove `is_from_worker` flag and `consumed_bytes` plasma counter #55260

edoakes · 2025-08-05T19:54:56Z

We have a deep abstraction leak in plasma, where we set is_from_worker when requesting objects from plasma and thread it through many layers. It is used for one thing: to decide whether or not to record a metric on the number of "consumed" bytes. Note that the flag is not always true from the worker (in GetIfLocal we set it to false). Original PR: #13853

The metric is only used for the "memory summary", which is output by the ray memory CLI command. For end users, this metric is very very unlikely to be useful. It is also used for the object spilling tests, which parse the string and make an assertion on the embedded values. This is very very brittle. If/when we revisit spilling we will test it in a more robust way.

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

gemini-code-assist

Summary of Changes

Hello @edoakes, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

I've made changes to the core Ray system to remove an unnecessary is_from_worker flag and the consumed_bytes metric from the Plasma object store. This refactoring addresses an abstraction leak and eliminates a metric that provided little value to users while complicating testing and maintenance. The changes simplify the Plasma client API, clean up related C++ and Python code, and update the protobuf schema, resulting in a more streamlined and robust system.

Highlights

Abstraction Leak Removal: I've removed the is_from_worker flag from the Plasma object store's Get operations. This flag was identified as an abstraction leak, unnecessarily complicating the Plasma API by exposing internal worker-related concerns.
Metric Deprecation: The consumed_bytes plasma counter and its associated logic have been entirely removed. This metric was primarily used for the ray memory CLI command and brittle object spilling tests, offering limited utility to end-users and proving difficult to maintain.
Test Code Simplification: The assert_no_thrashing test utility and all its invocations within test_object_spilling.py have been deleted. This cleanup was possible because the utility relied on the consumed_bytes metric, which is no longer relevant.
Protobuf Schema Update: The ObjectStoreStats protobuf definition has been updated to remove the consumed_bytes field, ensuring that the system's data structures accurately reflect the current state of metrics tracking.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments or fill out our survey to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request removes the is_from_worker flag and the consumed_bytes metric from the plasma store, which simplifies the code by removing an abstraction leak. The changes are consistent across the C++ and Python codebase, including updates to the plasma protocol, client, and store implementations. The corresponding tests that relied on these features have also been updated. My review found one issue in the test files where an assertion related to the removed metric was not deleted, which would cause a test to fail. Otherwise, the changes look good and align with the stated goals of the PR.

python/ray/tests/test_object_spilling.py

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

israbbani

🚢

…es/rm-is-from-worker

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

…es/rm-is-from-worker

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

…es/rm-is-from-worker

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

…es/rm-is-from-worker

…er (#55260) We have a deep abstraction leak in plasma, where we set `is_from_worker` when requesting objects from plasma and thread it through many layers. It is used for one thing: to decide whether or not to record a metric on the number of "consumed" bytes. Note that the flag is not always `true` from the worker (in `GetIfLocal` we set it to `false`). Original PR: #13853 The metric is only used for the "memory summary", which is output by the `ray memory` CLI command. For end users, this metric is very very unlikely to be useful. It is also used for the object spilling tests, which parse the string and make an assertion on the embedded values. This is very very brittle. If/when we revisit spilling we will test it in a more robust way. --------- Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com> Signed-off-by: sampan <sampan@anyscale.com>

…er (ray-project#55260) We have a deep abstraction leak in plasma, where we set `is_from_worker` when requesting objects from plasma and thread it through many layers. It is used for one thing: to decide whether or not to record a metric on the number of "consumed" bytes. Note that the flag is not always `true` from the worker (in `GetIfLocal` we set it to `false`). Original PR: ray-project#13853 The metric is only used for the "memory summary", which is output by the `ray memory` CLI command. For end users, this metric is very very unlikely to be useful. It is also used for the object spilling tests, which parse the string and make an assertion on the embedded values. This is very very brittle. If/when we revisit spilling we will test it in a more robust way. --------- Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com> Signed-off-by: Andrew Grosser <dioptre@gmail.com>

…er (ray-project#55260) We have a deep abstraction leak in plasma, where we set `is_from_worker` when requesting objects from plasma and thread it through many layers. It is used for one thing: to decide whether or not to record a metric on the number of "consumed" bytes. Note that the flag is not always `true` from the worker (in `GetIfLocal` we set it to `false`). Original PR: ray-project#13853 The metric is only used for the "memory summary", which is output by the `ray memory` CLI command. For end users, this metric is very very unlikely to be useful. It is also used for the object spilling tests, which parse the string and make an assertion on the embedded values. This is very very brittle. If/when we revisit spilling we will test it in a more robust way. --------- Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com> Signed-off-by: jugalshah291 <shah.jugal291@gmail.com>

edoakes added 4 commits August 5, 2025 14:50

fix

3a501f9

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

fix

03d77eb

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

fix

4fe8668

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

fix

a71a8d4

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

edoakes requested a review from a team as a code owner August 5, 2025 19:54

edoakes added the go add ONLY when ready to merge, run all tests label Aug 5, 2025

gemini-code-assist bot reviewed Aug 5, 2025

View reviewed changes

python/ray/tests/test_object_spilling.py Show resolved Hide resolved

fix

2d56f0f

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

israbbani approved these changes Aug 5, 2025

View reviewed changes

edoakes added 4 commits August 5, 2025 17:35

Merge branch 'master' of https://github.com/ray-project/ray into eoak…

f9fdec4

…es/rm-is-from-worker

fix

d01898c

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

fix

4e35d4b

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

fix

9860fed

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

edoakes enabled auto-merge (squash) August 5, 2025 22:51

edoakes added 2 commits August 6, 2025 07:00

Merge branch 'master' of https://github.com/ray-project/ray into eoak…

6ba0358

…es/rm-is-from-worker

fix

3ad4fa4

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

github-actions bot disabled auto-merge August 6, 2025 12:01

edoakes added 2 commits August 6, 2025 09:08

Merge branch 'master' of https://github.com/ray-project/ray into eoak…

9857ca2

…es/rm-is-from-worker

fix

ace8881

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>

edoakes enabled auto-merge (squash) August 6, 2025 14:55

edoakes added 2 commits August 6, 2025 17:32

Merge branch 'master' of https://github.com/ray-project/ray into eoak…

137aeaf

…es/rm-is-from-worker

Merge branch 'master' of https://github.com/ray-project/ray into eoak…

ec556bf

…es/rm-is-from-worker

github-actions bot disabled auto-merge August 6, 2025 22:55

edoakes added 3 commits August 7, 2025 07:54

Merge branch 'master' of https://github.com/ray-project/ray into eoak…

e19c658

…es/rm-is-from-worker

Merge branch 'master' into eoakes/rm-is-from-worker

5aa0bfc

Merge branch 'master' into eoakes/rm-is-from-worker

7b4e1a3

edoakes enabled auto-merge (squash) August 8, 2025 03:02

github-actions bot disabled auto-merge August 8, 2025 03:02

Merge branch 'master' into eoakes/rm-is-from-worker

11ece1a

edoakes merged commit 795a874 into ray-project:master Aug 8, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[core] Remove `is_from_worker` flag and `consumed_bytes` plasma counter #55260

[core] Remove `is_from_worker` flag and `consumed_bytes` plasma counter #55260

Uh oh!

edoakes commented Aug 5, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

israbbani left a comment

Uh oh!

Uh oh!

Uh oh!

[core] Remove is_from_worker flag and consumed_bytes plasma counter #55260

[core] Remove is_from_worker flag and consumed_bytes plasma counter #55260

Uh oh!

Conversation

edoakes commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

israbbani left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[core] Remove `is_from_worker` flag and `consumed_bytes` plasma counter #55260

[core] Remove `is_from_worker` flag and `consumed_bytes` plasma counter #55260

edoakes commented Aug 5, 2025 •

edited

Loading