-
Notifications
You must be signed in to change notification settings - Fork 577
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
perf: performance issue when there is only barrier passing the cluster #17646
Comments
Will it be caused by the problem described in #17612? |
Perhaps fixed by #17612. |
Recurs today in another case |
Recurs today in another case |
The overhead (CPU usage) comes from the passing the barriers. We have to admit that passing a barrier will cost some overhead, although the overhead is neglectable in most cases. It reminds me of #6943, where we set the As written in #6943, the motivation was because of OOM issues. Shall we reduce number from 10000 to some lower number such as 1000 or 100, in order to reach a balance point between these two issues? |
In some real-world scenarios, we have encountered this situation.
In this situation, the only flow in the graph is the barrier. In our assumption, when there is no data between barriers, the barriers should flow through quickly. Otherwise, if the time for barriers to pass through the entire graph exceeds one second, then barriers will continue to accumulate, and the cluster will always be in a state of backpressure. more infos here https://www.notion.so/risingwave-labs/CVTE-2024-07-10-barrier-only-ecc23aa5b9ee4664a97a17c97a25d709?pvs=4
Below are some suspected causes and potential optimizations we are currently considering for this situation:
Even with some conjectures, I believe we first need to find a way to reliably reproduce this situation before attempting to make improvements. This way, we can verify the effectiveness of the optimizations.
The text was updated successfully, but these errors were encountered: