New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
octopus: rocksdb: do not use non-zero recycle_log_file_num setting #45040
Conversation
This forces RocksDB to use less reliable kTolerateCorruptedTailRecords mode for wal recovery. Fixes: https://tracker.ceph.com/issues/54288 Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
@ifed01 should try to get this into 15.2.16? though it might be a bit late |
I think that's not required. There is a workaround - one can adjust the setting manually if needed. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
makes sense to me
@ifed01 does this failure look familiar to you? It came up in twice in teuthology runs, and I don't see it tracked anywhere. http://pulpito.front.sepia.ceph.com/yuriw-2022-05-09_21:49:19-rados-wip-yuri6-testing-2022-05-09-0734-octopus-distro-default-smithi/6829109/
|
Current analysis of the test run. @ifed01 I opened https://tracker.ceph.com/issues/49287 to track the BlueFS failure. Let me know what you think of it. http://pulpito.front.sepia.ceph.com/?branch=wip-yuri6-testing-2022-05-09-0734-octopus A few jobs failed due to problems in infrastructure, but passed in a rerun. Failures: Details: |
@ljflores - sorry for the late response. |
Rados suite results: https://pulpito.ceph.com/?branch=wip-yuri5-testing-2022-06-22-0914-octopus One unrelated dead cephadm job, which passed in the rerun. |
Hi @ifed01, please see https://tracker.ceph.com/issues/55636#note-2. I suspect that this commit actually did cause the bug from https://tracker.ceph.com/issues/55636. The reason it was a tricky catch is that it appears to fail only on certain operating systems. Let me know what you think. |
This forces RocksDB to use less reliable kTolerateCorruptedTailRecords
mode for wal recovery.
Fixes: https://tracker.ceph.com/issues/54288
Signed-off-by: Igor Fedotov igor.fedotov@croit.io
Checklist
Show available Jenkins commands
jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox