New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
luminous: msg/async: do not bump connect_seq for fault during ACCEPTING_SESSION #29788
Conversation
Is there a tracker issue for this backport? |
No, I met the issue in my ceph cluster, it needs fix for ACCEPTING phase fault, this part of code has been rewritten in which Sage fixed the problem in msg/async/ProtocolV2 with commit b2b1234 There is a similar track issue for CONNECTING phase fault, Sage fixed it in msg/async/ProtocolV1 |
@chenerqi The corresponding master PR for this backport is #24546 It was merged for nautilus. That means: if it is going to be backported, there needs to be a tracker issue. The tracker issue should be created with the following fields:
(In the description just briefly explain the bug that #24546 is fixing) Once that tracker is created, we can proceed. Adding DNM until then. |
Tracker issue 42316 created. |
@chenerqi Thanks. There's one more thing left to do: fix the commit message. The correct commit message should be:
(in other words, please run |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
commit message is wrong
@rjfd Does this look right to you for luminous? |
ACCEPTING_SESSION If we have a connection race, and we lose, we may end up with outgoing messages *and* be in ACCEPTING_SESSION. If we then fault, we want to leave connect_seq at 0 to avoid triggering a reset. Signed-off-by: Sage Weil <sage@redhat.com> (cherry picked from commit b2b1234) Conflicts: src/msg/async/ProtocolV2.cc - file does not exist in luminous. Made change manually in the file src/msg/async/AsyncConnection.cc
Updated. |
backport tracker issue: https://tracker.ceph.com/issues/42318
partial backport (1 commit) of: #24546
parent tracker: https://tracker.ceph.com/issues/42316