New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
jewel: osd: do not send ENXIO on misdirected op by default #13255
Conversation
In practice this tends to get bubbled up the stack as an error on the caller, and they usually do not handle it properly. For example, with librbd, this turns into EIO and break the VM. Instead, this will manifest as a hung op on the client. That is also not ideal, but given that the root cause here is generally a bug, it's not clear what else would be better. We already log an error in the cluster log, so teuthology runs will continue to fail. Signed-off-by: Sage Weil <sage@redhat.com> (cherry picked from commit 923e7f5) # Conflicts: # PendingReleaseNotes # src/common/config_opts.h
@jdurgin @liewegas This passed an
I re-ran the "4 other failures" with the following results:
Log of the reproducibly failed test: http://qa-proxy.ceph.com/teuthology/smithfarm-2017-02-22_15:06:17-rados-pr-13255-distro-basic-smithi/847000/teuthology.log Do you think the above test results are sufficient to merge this PR? |
the rados_api_test failure is a known issue - http://tracker.ceph.com/issues/15368 The rest are unrelated to this change, so I think this is ready to merge |
We now only send the ENXIO reply if the osd_enxio_on_misdirected_op option | ||
is enabled (it's off by default). This means that a VM using librbd that | ||
previously would have gotten an EIO and gone read-only will now see a | ||
blocked/hung IO instead. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@liewegas This didn't make it into the 10.2.6 release notes, so it can be deleted now?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not sure what you mean... if the patch is in 10.2.6 then we should add this to the 10.2.6 release notes. (It's a bit confusing because these pending notes live in the jewel etc branches but the notes are all in master :/ )
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you send it in the release notes, I just merged it in master before announcing the release
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, it's in 10.2.6 - @theanalyst Can we squeeze this in still?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah lets do it, I'll update the blog, can send an addition in mail too if we want
http://tracker.ceph.com/issues/18812