mon: don't blow away bootstrap-mgr on upgrades #18399

jcsp · 2017-10-19T13:30:23Z

Fixes: http://tracker.ceph.com/issues/20950
Signed-off-by: John Spray john.spray@redhat.com

jcsp · 2017-10-19T13:31:27Z

@vasukulkarni please could you try the upgrade suite on this?

jecluis · 2017-10-19T13:32:55Z

src/mon/AuthMonitor.cc

+    //  ceph-create-keys)
+    EntityName bootstrap_mgr_name;
+    bootstrap_mgr_name.from_str("client.bootstrap-mgr");
+    if (!mon->key_server.contains(bootstrap_mgr_name)) {
      KeyServerData::Incremental auth_inc;
      bool r = auth_inc.name.from_str("client.bootstrap-mgr");


can't we reuse bootstrap_mgr_name here for auth_inc.name ?

Sure! done.

vasukulkarni · 2017-10-19T15:54:24Z

@jcsp sure, I will remove the workaround I have added here (https://github.com/ceph/ceph/blob/master/qa/tasks/ceph_deploy.py#L767-L779 ) and retest with that branch and update the results here.

vasukulkarni · 2017-10-19T16:10:48Z

@jcsp will need the shaman build , can you please push this to cephci please

Fixes: http://tracker.ceph.com/issues/20950 Signed-off-by: John Spray <john.spray@redhat.com>

fixed by #18399 Signed-off-by: Vasu Kulkarni <vasu@redhat.com>

vasukulkarni · 2017-10-19T20:09:15Z

Running here: http://pulpito.front.sepia.ceph.com/vasu-2017-10-19_20:08:39-upgrade:jewel-x:ceph-deploy:-wip-20950-distro-basic-ovh/

vasukulkarni · 2017-10-19T21:26:19Z

Haven't seen this before, after upgrade, it tries to restart service but it failed, after the service is restarted the mgr node is installed, I can try to shuffle the restart but it looks new since it worked before.

http://qa-proxy.ceph.com/teuthology/vasu-2017-10-19_20:39:33-upgrade:jewel-x:ceph-deploy:-wip-20950-distro-basic-vps/1752183/teuthology.log

2017-10-19T21:12:26.248 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][DEBUG ]   python-cephfs.x86_64 2:13.0.0-2169.g57229ea.el7
2017-10-19T21:12:26.248 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][DEBUG ]   python-rados.x86_64 2:13.0.0-2169.g57229ea.el7
2017-10-19T21:12:26.248 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][DEBUG ]   python-rbd.x86_64 2:13.0.0-2169.g57229ea.el7
2017-10-19T21:12:26.248 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][DEBUG ]
2017-10-19T21:12:26.248 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][DEBUG ] Replaced:
2017-10-19T21:12:26.248 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][DEBUG ]   libcephfs1.x86_64 1:10.2.10-0.el7
2017-10-19T21:12:26.248 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][DEBUG ]
2017-10-19T21:12:26.248 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][DEBUG ] Complete!
2017-10-19T21:12:26.367 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][INFO  ] Running command: sudo ceph --version
2017-10-19T21:12:26.492 INFO:teuthology.orchestra.run.vpm011.stderr:[vpm181][DEBUG ] ceph version 13.0.0-2169-g57229ea (57229ea2a4369518c7a16b7a09b045b7896f5a70) mimic (dev)
2017-10-19T21:12:26.498 INFO:teuthology.orchestra.run.vpm181:Running: 'sudo systemctl restart ceph.target'
2017-10-19T21:12:26.594 INFO:teuthology.orchestra.run.vpm011:Running: 'sudo ceph -s'
2017-10-19T21:17:26.753 INFO:teuthology.orchestra.run.vpm011.stderr:2017-10-19 21:17:26.754 7ff3696b4700  0 monclient(hunting): authenticate timed out after 300
2017-10-19T21:17:26.790 INFO:teuthology.orchestra.run.vpm011.stderr:2017-10-19 21:17:26.754 7ff3696b4700  0 librados: client.admin authentication error (110) Connection timed out
2017-10-19T21:17:26.791 INFO:teuthology.orchestra.run.vpm011.stderr:[errno 110] error connecting to the cluster
2017-10-19T21:17:26.791 ERROR:teuthology.run_tasks:Saw exception from tasks.

tchaikov · 2017-10-21T02:39:58Z

vasukulkarni · 2017-10-26T22:47:02Z

tried one more run with some minor modifications but can't get it to working , maybe this is a 13.x issue, previously the upgrade was tested from jewel -> 12.x , I dont see any crash in monitors in any logs.

http://pulpito.ceph.com/vasu-2017-10-26_21:09:34-upgrade:jewel-x:ceph-deploy:-wip-20950-distro-basic-vps/

jcsp · 2017-10-31T16:35:12Z

@vasukulkarni can you try testing this cherry-picked onto luminous please?

vasukulkarni · 2017-11-01T01:01:29Z

I will pick this up on luminous and test and update here.

vasukulkarni · 2017-11-02T02:52:24Z

Tested this after cherry-picking the commit on top of luminous and without the mgr workaround and it is working fine, full logs here: http://pulpito.ceph.com/vasu-2017-11-02_00:30:23-upgrade:jewel-x:ceph-deploy:-wip-qa-mgr-testing-distro-basic-vps/1800269/

jcsp added bug-fix mgr mon labels Oct 19, 2017

jcsp requested a review from liewegas October 19, 2017 13:30

jecluis reviewed Oct 19, 2017

View reviewed changes

jcsp force-pushed the wip-20950 branch from 9e45428 to a79076d Compare October 19, 2017 13:50

liewegas approved these changes Oct 19, 2017

View reviewed changes

liewegas added the needs-qa label Oct 19, 2017

jecluis approved these changes Oct 19, 2017

View reviewed changes

mon: don't blow away bootstrap-mgr on upgrades

57229ea

Fixes: http://tracker.ceph.com/issues/20950 Signed-off-by: John Spray <john.spray@redhat.com>

jcsp force-pushed the wip-20950 branch from a79076d to 57229ea Compare October 19, 2017 17:04

vasukulkarni added a commit that referenced this pull request Oct 19, 2017

Remove the workaround of recreating mgr key, this is now

e8af4eb

fixed by #18399 Signed-off-by: Vasu Kulkarni <vasu@redhat.com>

tchaikov added the wip-kefu-testing label Oct 20, 2017

tchaikov removed needs-qa wip-kefu-testing labels Oct 21, 2017

vasukulkarni approved these changes Nov 2, 2017

View reviewed changes

vasukulkarni added the needs-backport label Nov 2, 2017

jcsp merged commit 737877f into ceph:master Nov 2, 2017

jcsp deleted the wip-20950 branch November 2, 2017 10:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mon: don't blow away bootstrap-mgr on upgrades #18399

mon: don't blow away bootstrap-mgr on upgrades #18399

jcsp commented Oct 19, 2017

jcsp commented Oct 19, 2017

jecluis Oct 19, 2017

jcsp Oct 19, 2017

vasukulkarni commented Oct 19, 2017

vasukulkarni commented Oct 19, 2017

vasukulkarni commented Oct 19, 2017

vasukulkarni commented Oct 19, 2017

tchaikov commented Oct 21, 2017

vasukulkarni commented Oct 26, 2017

jcsp commented Oct 31, 2017

vasukulkarni commented Nov 1, 2017

vasukulkarni commented Nov 2, 2017

mon: don't blow away bootstrap-mgr on upgrades #18399

mon: don't blow away bootstrap-mgr on upgrades #18399

Conversation

jcsp commented Oct 19, 2017

jcsp commented Oct 19, 2017

jecluis Oct 19, 2017

Choose a reason for hiding this comment

jcsp Oct 19, 2017

Choose a reason for hiding this comment

vasukulkarni commented Oct 19, 2017

vasukulkarni commented Oct 19, 2017

vasukulkarni commented Oct 19, 2017

vasukulkarni commented Oct 19, 2017

tchaikov commented Oct 21, 2017

vasukulkarni commented Oct 26, 2017

jcsp commented Oct 31, 2017

vasukulkarni commented Nov 1, 2017

vasukulkarni commented Nov 2, 2017