Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mon/Monitor: fix statfs handling before luminous switchover happens #19243

Merged
merged 1 commit into from Jan 3, 2018

Conversation

liewegas
Copy link
Member

After the mons are luminous but before we switch over to using the
MgrStatMonitor's new info, the version on mgrstat will generally be <<
than that of pgmon, and the client will send that version with the
request. This means that the statfs message will perpetually appear to be
in the future and fail the is_readable() check.

Fix this with any ugly hack that resets the version to 1 if we haven't
completed the luminous upgrade yet.

Signed-off-by: Sage Weil sage@redhat.com

@gregsfortytwo
Copy link
Member

Is there a ticket where I can read more about this? I think I get what's happening, but as you say it's an ugly hack...
Also you say you're setting it to version 1 in the commit/comment but appear to set it to version 0?

@yuriw
Copy link
Contributor

yuriw commented Nov 29, 2017

should fix http://tracker.ceph.com/issues/21930

@liewegas
Copy link
Member Author

the ticket doesn't have much info. you can log into a test machine in this state, though

see smithi185, smithi110 for the mons, and smithi145 for the hung client

2017-11-29 21:09:29.694437 7f54aabd8700  1 -- 172.21.15.30:0/283705467 --> 172.21.15.185:6789/0 -- statfs(23901 v26) v1 -- ?+0 0x55b54d886c00 con 0x55b54d528300
...

and the mons stuck with

2017-11-29 22:02:17.518638 7ff719fee700 10 mon.a@1(peon).paxosservice(mgrstat 25..25) dispatch 0x56439dbc0900 statfs(23900 pool -1 v26) v1 from client.14098 172.21.15.30:0/283705467 con 0x56439d490800
2017-11-29 22:02:17.518646 7ff719fee700 10 mon.a@1(peon).paxosservice(mgrstat 25..25)  waiting for paxos -> readable (v26)
2017-11-29 22:02:17.518651 7ff719fee700 10 mon.a@1(peon).paxosservice(mgrstat 25..25) dispatch 0x56439ce5fa80 statfs(23901 pool -1 v26) v1 from client.14098 172.21.15.30:0/283705467 con 0x56439d490800
2017-11-29 22:02:17.518656 7ff719fee700 10 mon.a@1(peon).paxosservice(mgrstat 25..25)  waiting for paxos -> readable (v26)

After the mons are luminous but before we switch over to using the
MgrStatMonitor's new info, the version on mgrstat will generally be <<
than that of pgmon, and the client will send that version with the
request.  This means that the statfs message will perpetually appear to be
in the future and fail the is_readable() check.

Fix this with any ugly hack that resets the version to 1 if we haven't
completed the luminous upgrade yet.

Signed-off-by: Sage Weil <sage@redhat.com>
Copy link
Member

@gregsfortytwo gregsfortytwo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay, seems like a simple enough fix for that.

Reviewed-by: Greg Farnum gfarnum@redhat.com =

@theanalyst
Copy link
Member

Do we still want this in 12.2.2?

@liewegas liewegas merged commit bccb483 into ceph:luminous Jan 3, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
5 participants