mon: clear duplicated logic in MDSMonitor #11209

david-z · 2016-09-23T07:29:28Z

Clear some duplicated logic in MDSMonitor when checking replacement for a failed MDS. It will make this part of logic more clear and readable.

Signed-off-by: Zhi Zhang zhangz.david@outlook.com

gregsfortytwo · 2016-09-23T21:08:52Z

src/mon/MDSMonitor.cc

+  } else if (info.state == MDSMap::STATE_STANDBY_REPLAY || 
+             info.state == MDSMap::STATE_STANDBY) {
+    dout(10) << " failing and removing " << gid << " " << info.addr << " mds." << info.rank 
+      << "." << info.inc << " " << ceph_mds_state_name(info.state)


These cases were almost the same, but you've dropped the last_beacon.erase(gid); line. Did you check git logs to see how we got the duplicated cases?

@gregsfortytwo Yes, I dropped last_beacon.erase(gid) because fail_mds_gid(gid) did this work here.

I checked git log and this duplicated case was introduced back to 2011. At that time this part made sense because the logic was a little different from now.

gregsfortytwo · 2016-09-23T21:09:17Z

src/mon/MDSMonitor.cc

-      fail_mds_gid(gid);
-      *mds_propose = true;
-    } else if (!info.laggy()) {
+    if (!info.laggy()) {


Also this might as well be an "else if" block instead of nested else-if, right?

@gregsfortytwo This was an "else if" back to 2011 and then changed to nested one. I think current logic looks good and can be easily understand. If it goes here, it means mds doesn't find a replacement and we can not ignore this mds no matter what current mds state is. We should record laggy_since once and report the warning.

The whole checking is almost based on mds state. If we pick up a branch to check whether mds is already laggy or not. I don't think the logic is better and more clear than current one.

What do you think? Thanks.

The last_beacon.erase part only made sense in the cases where we were actually removing an MDS. When we do last_beacon.erase for a GID that is laggy but staying in the map, it just gets replaced at the start of tick() next time (the "make sure last beacon is fully populated") section.

So I think you can delete that last_beacon.erase line, and then collapse the nested if into an else if as greg suggests

@jcsp yes, I looked through the code again and we can delay last_beacon.erase to the next tick().

jcsp · 2016-09-29T16:02:11Z

Tested here: http://pulpito.ceph.com/jspray-2016-09-27_00:45:11-fs-wip-jcsp-testing-20160926-distro-basic-mira

gregsfortytwo · 2016-10-05T20:25:24Z

Ping @david-z, looks like we're waiting on a few code changes from you. :)

david-z · 2016-10-09T02:41:04Z

@gregsfortytwo sorry for the late reply, I was on vacation in the past week.

Signed-off-by: Zhi Zhang <zhangz.david@outlook.com>

david-z · 2016-10-09T03:41:32Z

@gregsfortytwo @jcsp Changes are made as you suggested. Pls help to review. Thanks.

liewegas added cephfs Ceph File System cleanup core needs-qa labels Sep 23, 2016

gregsfortytwo requested changes Sep 23, 2016

View reviewed changes

jcsp assigned david-z Sep 29, 2016

gregsfortytwo removed the needs-qa label Oct 5, 2016

mon: clear duplicated logic in MDSMonitor

85c3ca1

Signed-off-by: Zhi Zhang <zhangz.david@outlook.com>

david-z force-pushed the wip-clear-dup-logic-mdsmonitor branch from c84719e to 85c3ca1 Compare October 9, 2016 03:39

gregsfortytwo added the needs-review label Oct 10, 2016

gregsfortytwo unassigned david-z Oct 10, 2016

jcsp merged commit 9c2dac1 into ceph:master Oct 11, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mon: clear duplicated logic in MDSMonitor #11209

mon: clear duplicated logic in MDSMonitor #11209

david-z commented Sep 23, 2016

gregsfortytwo Sep 23, 2016

david-z Sep 26, 2016

gregsfortytwo Sep 23, 2016

david-z Sep 26, 2016

david-z Sep 26, 2016

jcsp Sep 29, 2016

david-z Oct 9, 2016

jcsp commented Sep 29, 2016

gregsfortytwo commented Oct 5, 2016

david-z commented Oct 9, 2016 •

edited

david-z commented Oct 9, 2016

mon: clear duplicated logic in MDSMonitor #11209

mon: clear duplicated logic in MDSMonitor #11209

Conversation

david-z commented Sep 23, 2016

gregsfortytwo Sep 23, 2016

Choose a reason for hiding this comment

david-z Sep 26, 2016

Choose a reason for hiding this comment

gregsfortytwo Sep 23, 2016

Choose a reason for hiding this comment

david-z Sep 26, 2016

Choose a reason for hiding this comment

david-z Sep 26, 2016

Choose a reason for hiding this comment

jcsp Sep 29, 2016

Choose a reason for hiding this comment

david-z Oct 9, 2016

Choose a reason for hiding this comment

jcsp commented Sep 29, 2016

gregsfortytwo commented Oct 5, 2016

david-z commented Oct 9, 2016 • edited

david-z commented Oct 9, 2016

david-z commented Oct 9, 2016 •

edited