[Data] Fix unclear metadata warning and incorrect operator name logging#61380
[Data] Fix unclear metadata warning and incorrect operator name logging#61380bveeramani merged 5 commits intoray-project:masterfrom
Conversation
Signed-off-by: “Alex <alexchien130@gmail.com>
Signed-off-by: “Alex <alexchien130@gmail.com>
There was a problem hiding this comment.
Code Review
This pull request effectively addresses the issue of unclear operator names in metadata timeout warnings. By adding an operator_name parameter to DataOpTask and updating the warning message, the logging becomes much more informative and user-friendly. The changes are well-implemented across physical_operator.py, hash_shuffle.py, and map_operator.py. The inclusion of a unit test is also good. I have one suggestion to further improve the test coverage to ensure the new warning message is verified.
Signed-off-by: “Alex <alexchien130@gmail.com>
Signed-off-by: “Alex <alexchien130@gmail.com>
|
@bveeramani PTAL~ Thank you |
bveeramani
left a comment
There was a problem hiding this comment.
Left comments.
ty @machichima for doing the first pass
slfan1989
left a comment
There was a problem hiding this comment.
Thanks to @machichima and @bveeramani for the detailed comments! I don’t have additional suggestions, but I do have one point: The current log warning says 'waiting for operator,' but it’s actually waiting for metadata from the operator. It might be clearer to change it to 'waiting for metadata from operator …' to avoid any confusion."
Signed-off-by: “Alex <alexchien130@gmail.com>
|
bbd79a1 - Updated based on the review comments and Slack discussion |
| def test_operator_name_parameter(self, ray_start_regular_shared): | ||
| streaming_gen = create_stub_streaming_gen(block_nbytes=[1]) | ||
| task = DataOpTask(0, streaming_gen, operator_name="MapBatches(fn)") | ||
| assert task._operator_name == "MapBatches(fn)" |
There was a problem hiding this comment.
In general, I think we should avoid testing against internal attributes, but won't block since I know this has been in-flight for a while
There was a problem hiding this comment.
Thanks for noting. Will keep this in mind for future contributions.
…ng (ray-project#61380) ## Why are these changes needed? The metadata timeout warning always shows `operator=DataOpTask` instead of the actual operator name, making it impossible to identify which operator is stuck. ## What's included? **Add operator_name parameter to DataOpTask** - New optional parameter in `DataOpTask.__init__` stores the actual operator name **Rewrite warning message in plain English** **Add unit test** - `test_operator_name_parameter` verifies operator_name parameter is stored correctly ## Screenshots <img width="710" height="437" alt="Screenshot 2026-02-27 at 00 05 56" src="https://github.com/user-attachments/assets/5f168fe3-bf4a-4268-b471-1126ddcc4cee" /> <img width="710" height="334" alt="Screenshot 2026-02-27 at 01 57 34" src="https://github.com/user-attachments/assets/6e9648b0-0427-4667-b6e4-2beed9f9edab" /> ## Benefits - Users can identify which specific operator is stuck in their pipeline - Warning message uses plain English ## Note Closes ray-project#61334 --------- Signed-off-by: “Alex <alexchien130@gmail.com> Signed-off-by: Ayush Kumar <ayushk7102@gmail.com>
…ng (ray-project#61380) ## Why are these changes needed? The metadata timeout warning always shows `operator=DataOpTask` instead of the actual operator name, making it impossible to identify which operator is stuck. ## What's included? **Add operator_name parameter to DataOpTask** - New optional parameter in `DataOpTask.__init__` stores the actual operator name **Rewrite warning message in plain English** **Add unit test** - `test_operator_name_parameter` verifies operator_name parameter is stored correctly ## Screenshots <img width="710" height="437" alt="Screenshot 2026-02-27 at 00 05 56" src="https://github.com/user-attachments/assets/5f168fe3-bf4a-4268-b471-1126ddcc4cee" /> <img width="710" height="334" alt="Screenshot 2026-02-27 at 01 57 34" src="https://github.com/user-attachments/assets/6e9648b0-0427-4667-b6e4-2beed9f9edab" /> ## Benefits - Users can identify which specific operator is stuck in their pipeline - Warning message uses plain English ## Note Closes ray-project#61334 --------- Signed-off-by: “Alex <alexchien130@gmail.com>
Why are these changes needed?
The metadata timeout warning always shows
operator=DataOpTaskinstead of the actual operator name, making it impossible to identify which operator is stuck.What's included?
Add operator_name parameter to DataOpTask
DataOpTask.__init__stores the actual operator nameRewrite warning message in plain English
Add unit test
test_operator_name_parameterverifies operator_name parameter is stored correctlyScreenshots
Benefits
Note
Closes #61334