Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

luminous: msg: ceph_abort() when there are enough accepter errors in msg server #24419

Merged
merged 1 commit into from
Oct 5, 2018

Commits on Oct 4, 2018

  1. msg: ceph_abort() when there are enough accepter errors in msg server

    In some extrem cases(we have met one in our production cluster), when Accepter thread break out , new client can not connect to the osd. Because the former heartbeat connections are already connected, other osd can not detect failure then notify monitor to mark the failed osd down.
    In the patch, we there are abnormal communication errors ,we just ceph_abort  so that osd can go down fastly and other osds can notify monitor to mark the failed osd down.
    Signed-off-by: penglaiyxy@gmail.com <penglaiyxy@gmail.com>
    
    (cherry picked from commit 00e0ab4)
    
    Conflicts:
    	src/common/legacy_config_opts.h : Resolved for ms_max_accept_failures
    	src/common/options.cc : Resolved for ms_max_accept_failures
    	src/msg/async/AsyncMessenger.cc : Resolved in accept
    	src/msg/simple/Accepter.cc : Resolved in entry
    root authored and Prashant D committed Oct 4, 2018
    Configuration menu
    Copy the full SHA
    999980a View commit details
    Browse the repository at this point in the history