-
Notifications
You must be signed in to change notification settings - Fork 5.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
scrub/osd: add clearer reminders that a scrub is blocked #46643
Conversation
Note: the 'since' seconds counters in the 'dump pgs' listing is not updated fast enough: currently, an unrelated |
jenkins retest this please |
jenkins test make check |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few minor clarification comments.
bcb1a15
to
acee06d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. Might still be good to have a +1 from another reviewer.
Whenever a scrub session is waiting for an excessive length of time for a locked object to be unlocked, the total number of concurrent scrubs in the system is reduced. The existing cluster warning issued on such occurrences is easily overlooked. Here we add a constant reminder each time the OSD tries to schedule scrubs. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
@ljflores, @Matan-B, @neha-ojha : added a 2'nd commit, to disable the warning message in some |
See |
jenkins test make check |
As some Teuthology tests seem to block objects for long minutes, we must not issue the "scrub is blocked for too long" warning (that warning causes the tests to fail). A new configuration parameter now controls the grace period before the warning is issued. Some tests were modified to set this configuration parameter to a large value. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
http://pulpito.front.sepia.ceph.com/?branch=wip-rf-blocked |
Whenever a scrub session is waiting for an excessive length
of time for a locked object to be unlocked, the total
number of concurrent scrubs in the system is reduced.
The existing cluster warning issued on such occurrences is
easily overlooked. Here we add a constant reminder each time
the OSD tries to schedule scrubs.
Signed-off-by: Ronen Friedman rfriedma@redhat.com