New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hammer: Testing: SIGABRT in TrackedOp::dump() via dump_ops_in_flight() #1081
Conversation
* add a wrapper to log uncaught exception to self.logger, greenlet also prints the backtrace and exception to stderr, but teuthology.log does not capture stderr. so we need to catch them by ourselves to reveal more info to root-cause this issue. * log uncaught exception thrown by Thrasher.do_thrash() to self.log. See: #10630 Signed-off-by: Kefu Chai <kchai@redhat.com> (cherry picked from commit 64de3cd)
Signed-off-by: David Zafman <dzafman@redhat.com> (cherry picked from commit 4ad3b86)
Add dump_ops_enable which continuously dumps ops using 3 commands Add optrack_toggle_delay to alternate op tracking enablement Signed-off-by: David Zafman <dzafman@redhat.com> (cherry picked from commit 7a52876) Conflicts: tasks/ceph_manager.py (trivial) tasks/thrashosds.py (trivial)
Signed-off-by: David Zafman <dzafman@redhat.com> (cherry picked from commit 447bf87)
Signed-off-by: David Zafman <dzafman@redhat.com> (cherry picked from commit a595651)
Only merge with ceph/ceph#10255 |
For some reason I got 11 crashes of python during a rados:thrash run. Using strings on the cores revealed a common strings: /usr/bin/python /usr/bin/ceph tell osd.* injectargs --osd_enable_op_tracker=... |
Signed-off-by: David Zafman <dzafman@redhat.com>
Needed to add stdout option to pass down to run() Signed-off-by: David Zafman <dzafman@redhat.com> (cherry picked from commit d762fa5)
Signed-off-by: David Zafman <dzafman@redhat.com> (cherry picked from commit 9d52d10) Conflicts: tasks/ceph_manager.py (trivial)
The last 2 commits come from #1122 which hasn't merged yet |
Passed with latest version: |
@dzafman Can we remove DNM now? I have prepared a |
Superseded by #1278 |
Backport 3 pull requests and add fix specifically for Hammer not to use osd.* since it crashes the command.
#344
#911
#1122