New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[core][enable_task_events] Options to disable task tracing on task/actor #42431
Conversation
Please update the related issue! Also why is this draft? (is this ready to be reviewd? ) |
Yeah, i would love to get a quick round of review for the high-level approach first. |
Will the "_report_task_events" set for the actor task override the "_report_task_events" set during the creation of the actor?
Does the actor task g report task events or not? |
Yes, an actor task level config would override the actor level config. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approach lgtm
is this PR ready? |
not yet - iterating on this. |
python/ray/actor.py
Outdated
@@ -1449,6 +1494,8 @@ def remote(self, *args, **kwargs): | |||
False, # retry_exceptions | |||
False, # is_generator | |||
self._ray_method_generator_backpressure_num_objects.get(item, -1), | |||
# TODO(rickyx): how to determinte the defautls here? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not too sure about the default here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Most of changes are just plumbing to add the option.
Major changes are in /core_worker
for the task event buffer refactoring and changes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In general LGTM. NIT questions
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approved given we choose the arguments discussed in api review
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just some nits.
@rkooo567 can you take a quick look at the delta in the commits? I made a few changes to make tests passed
|
Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com> Signed-off-by: Ricky Xu <xuchen727@hotmail.com>
You mean enable_task_events right? This makes sense
Hmm I don't understand why we need this? |
It's just a better name than the |
Hmm I feel like we should make it just orthogonal. It makes things confusing if we start coupling user-level config and system level config?
Btw, it is not a hard blocker. |
This is actually exactly how it's behaved in the PR, with the exception of not using existing flag (i.e. RAY_task_events_report_interval):
I am also open to keeping with existing flags. |
Updated with using existing flags. |
…tor (ray-project#42431) --------- Signed-off-by: rickyyx <rickyx@anyscale.com> Signed-off-by: Ricky Xu <xuchen727@hotmail.com> Co-authored-by: angelinalg <122562471+angelinalg@users.noreply.github.com>
Why are these changes needed?
There are requests to disable task events reporting at the actor/task level to avoid excessive data/communication:
This PR will allow users to disable task events reporting (both task status events + profile events) for a task/actor through a
enable_task_events=False
flag on the remote for actor and task:Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.