Throttle clog slow requests sent to monitors #39199
Closed
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
monclinet: Throttle clog slow requests sent to monitors
A recent change https://tracker.ceph.com/issues/43975 logs details for each slow request and sends to monitors
But on large cluster, it could overwhelm monitors with spurious logs when performance issue happens
and cause further instability in the cluster
In our case, ceph.log growed to more than 14GB quickly, and we need to restart all monitors to recover
This patch throttles clog slow requests instead of sending every slow request details to monitors
and also sends out a summary of how many slow requests an OSD has and the oldest slow request info
Fixes: https://tracker.ceph.com/issues/48909
Signed-off-by: Gerald Yang gerald.yang@canonical.com