mds: fix cond that prevents sending MLock messages to starting MDS #21577

batrick · 2018-04-21T00:30:39Z

Fixes: http://tracker.ceph.com/issues/23812

Signed-off-by: Patrick Donnelly pdonnell@redhat.com

batrick · 2018-04-21T00:56:02Z

This is not quite right. That actual fix is to change MDSMap::is_degraded to return true if a rank is starting. I'll fix that shortly. (Time for dinner!)

ukernel · 2018-04-21T03:28:09Z

src/mds/Locker.cc

@@ -890,7 +890,7 @@ void Locker::eval_gather(SimpleLock *lock, bool first, bool *pneed_issue, list<M
 	return;
      }

-      if (!mds->is_cluster_degraded() ||
+      if (!mds->is_cluster_degraded() &&
 	  mds->mdsmap->get_state(auth) >= MDSMap::STATE_REJOIN) {


it's not wrong. !mds->is_cluster_degraded() check is an optimization, it avoids checking target mds' state for each message

Fixes: http://tracker.ceph.com/issues/23812 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

batrick · 2018-04-21T04:57:30Z

@ukernel PTAL. It looks to fix the problem in my testing. Please also look at #18278 (comment)

ukernel · 2018-04-21T07:04:13Z

The root cause is that mds discovers root inode when it's still in starting state. It should handle lock message after it get replica of root inode. I think a better to discover root inode when mds is active. (just like the code for creating new mds rank). The subtle part is that, when mds is in starting state, it needs to load dirfrag of its mdsdir, then creates a new log segments

batrick · 2018-04-23T03:11:10Z

@ukernel, that may be the cause but does it need fixed if we do this patch? It seems to me that having one rank in up:starting is actually a degraded state since the rank is unavailable which could prevent lock processing

batrick · 2018-04-23T03:41:01Z

Actually what I said makes no sense since a up:starting MDS wouldn't be holding any locks or authoritative for any subtrees.

Still, I feel "degraded" might not be the right thing here though and perhaps we just need a different term to represent that the MDS cluster has ranks which are creating/starting/recovering.

ukernel · 2018-04-23T04:01:08Z

I think degraded is OK for cluster with failed or recovering mds. If we make mds not discover inode when it's in creating/starting state, I think we don't need to create new state for cluster with creating/starting mds.

ukernel · 2018-04-23T09:19:00Z

@batrick do you want me to fix #23812

batrick · 2018-04-23T13:43:21Z

Sure!

batrick added bug-fix cephfs Ceph File System needs-review labels Apr 21, 2018

batrick requested a review from ukernel April 21, 2018 00:30

ukernel reviewed Apr 21, 2018

View reviewed changes

mds: fix cond that prevents sending MLock messages to starting MDS

5aa5c66

Fixes: http://tracker.ceph.com/issues/23812 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

batrick force-pushed the i23812 branch from 0046598 to 5aa5c66 Compare April 21, 2018 04:56

batrick mentioned this pull request Apr 21, 2018

mds: delay MMDSCacheRejoin::OP_WEAK until rejoin #18278

Closed

batrick closed this Apr 23, 2018

batrick deleted the i23812 branch April 23, 2018 17:51

batrick restored the i23812 branch May 23, 2018 18:45

batrick deleted the i23812 branch September 7, 2018 01:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mds: fix cond that prevents sending MLock messages to starting MDS #21577

mds: fix cond that prevents sending MLock messages to starting MDS #21577

batrick commented Apr 21, 2018

batrick commented Apr 21, 2018

ukernel Apr 21, 2018

batrick commented Apr 21, 2018

ukernel commented Apr 21, 2018

batrick commented Apr 23, 2018

batrick commented Apr 23, 2018

ukernel commented Apr 23, 2018

ukernel commented Apr 23, 2018 •

edited

batrick commented Apr 23, 2018

mds: fix cond that prevents sending MLock messages to starting MDS #21577

mds: fix cond that prevents sending MLock messages to starting MDS #21577

Conversation

batrick commented Apr 21, 2018

batrick commented Apr 21, 2018

ukernel Apr 21, 2018

Choose a reason for hiding this comment

batrick commented Apr 21, 2018

ukernel commented Apr 21, 2018

batrick commented Apr 23, 2018

batrick commented Apr 23, 2018

ukernel commented Apr 23, 2018

ukernel commented Apr 23, 2018 • edited

batrick commented Apr 23, 2018

ukernel commented Apr 23, 2018 •

edited