New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nautilus: osd/OSD: Log slow ops/types to cluster logs #33503
Conversation
In addition to logging slow ops in mon and osd specific log files, re-introduce logging the same information along with slow op type details to cluster logs as well. The objective is to make debugging slow ops easier. Modify the log whitelisting string to "slow request" within qa suites in order to make the search for the new warning log message within the cluster log successful. This should not cause any issue as it's a substring of the earlier string. Fixes: https://tracker.ceph.com/issues/43975 Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com> (cherry picked from commit d20f570)
|
NOTE: #33497 would need to be backported as well which I guess should be merged separately with its own backport tracker. |
@sseshasa Actually, I much prefer to cherry-pick any follow-on fixes directly into the main backport PR as we have done so to great advantage in the past. So please feel free to cherry-pick it right into this PR. |
|
(Following the thought a little further) If the follow-on fix gets its own tracker, there is a danger that:
|
@smithfarm #33497 is already merged and I don't see any tracker for it. I will therefore cherry-pick the commits from that PR. |
|
make check failure is It is most likely unrelated to this PR, since we see it happening in other nautilus PRs as well. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
agreed to cherry-pick commits from #33497
The mons may have slow ops. Signed-off-by: Sage Weil <sage@redhat.com> (cherry picked from commit 07badf0)
Signed-off-by: Sage Weil <sage@redhat.com> (cherry picked from commit f4156ae)
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com> (cherry picked from commit e527067) Conflicts: qa/suites/rados/singleton-nomsgr/all/osd_stale_reads.yaml This file doesn't exist in nautilus.
All related commits are now cherry-picked from master. |
|
@yuriw @neha-ojha Maybe including this in 14.2.8 QE will make the testing smoother? |
@smithfarm I don't think this PR has any dependency on 14.2.8, and can wait for the next release. |
|
Jenkins retest this please. |
|
I updated the PR body/description to show where the commits came from. |
Signed-off-by: Sage Weil <sage@redhat.com> (cherry picked from commit a4a3a3c)
@smithfarm Good catch! I am not sure how this got missed. But I have now cherry picked that missed commit. Thanks! |
@smithfarm I took the liberty to modify the PR description after I included the missed commit. I hope that's okay? |
|
Any change that makes the PR description more accurate is okay with me! |
backport tracker: https://tracker.ceph.com/issues/44259
backport of:
parent tracker: https://tracker.ceph.com/issues/43975
this backport was staged using ceph-backport.sh version 15.1.0.437
find the latest version at https://github.com/ceph/ceph/blob/master/src/script/ceph-backport.sh