-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Data] Streaming executor logging verbosity not configurable? #42191
Comments
Currently we do not have a way to configure file/module specific logging (without modifying the underlying classes/files). We could expose this as a parameter from DataContext specifically for streaming executor logs (e.g. can generalize the |
I think it would go a long way to solving this by just printing this information once, rather than on every batch. I'm using this with I've tried these to no avail
and my current workaround is to delete the offending lines from the modules aha here is couple seconds worth of output
|
My team uses Ray with Jupyter notebooks and this issue is a big problem in this use case. The official documentation says that you can configure the logging level of different ray components, but that doesn't seem to work. We use devcontainers so workarounds like monkey-patching or deleting ray logging lines aren't great. |
Also running into this issue! |
#43360 and #43735 cleans up a lot of the Ray Data logging to stdout. Full logs will still be written to the Ray Data log file. You can try out the latest nightly, or wait for the upcoming Ray 2.10 release which will include this change. Please feel free to re-open this issue for further followups questions or requests! |
What happened + What you expected to happen
Any time I perform a set of operations on a
ray.data.Dataset
, I see at least three info-level logging messages fromstreaming_executor.py
:I would like fewer logging messages from the streaming executor. I've tried several different approaches without success:
ray.init(runtime_env={"worker_process_setup_hook": shut_up_streaming_executor_func})
I expect that this logging should be configurable, I just don't know how to do it. Yes, I've read through this docs page.
Versions / Dependencies
macOS 14.1
PY 3.10
ray 2.9.0
Reproduction script
outputs
Issue Severity
Low: It annoys or frustrates me.
The text was updated successfully, but these errors were encountered: