[3.7] etcd migrate: instead of scaleup playbook etcd server should be started back #7313

vrutkovs · 2018-02-27T16:50:27Z

Backport of #7226
Replaces #7297

Verified to be working on 3.7 cluster

michaelgugino · 2018-02-27T17:26:48Z

playbooks/common/openshift-etcd/migrate.yml

+  roles:
+  - openshift_etcd_facts
+
+- name: Re-configure etcd and bring the cluster up


This looks okay to me. Please update with test results.

Had to add a few more vars, now migrates 3.7 cluster correctly

…ed back master doesn't need to be restarted and etcd URLs updated as new etcd nodes are being added. There is no need to run scaleup playbook, as etcd nodes are already added to the cluster. The migrate procedure now does the following: * checks if the etcd data needs to be migrated * makes etcd backup * stops etcd services on all nodes except the first one * migrates etcd data on first etcd node * clears data on other etcd nodes * updates etcd cluster configuration, updating ETCD_INITIAL_CLUSTER and ETCD_INITIAL_CLUSTER_STATE * starts the etcd service again one by one on etcd nodes Now the only copy of migrated data is on the first etcd cluster, which would replicate it to other nodes. After migration is done master configs are updated and services restarted if needed

michaelgugino

/lgtm

vrutkovs · 2018-02-28T15:42:58Z

/retest

vrutkovs · 2018-03-01T11:16:24Z

/retest

vrutkovs · 2018-03-08T09:43:24Z

/test install

openshift-ci-robot · 2018-03-08T10:27:56Z

@vrutkovs: The following tests failed, say /retest to rerun them all:

Test name	Commit	Details	Rerun command
ci/openshift-jenkins/logging	`7e30cda`	link	`/test logging`
ci/openshift-jenkins/extended_conformance_install_crio	`7e30cda`	link	`/test crio`
ci/openshift-jenkins/install	`7e30cda`	link	`/test install`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

openshift-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Feb 27, 2018

vrutkovs requested review from sdodson, michaelgugino and mtnbikenc February 27, 2018 16:50

michaelgugino approved these changes Feb 27, 2018

View reviewed changes

vrutkovs force-pushed the 3.7-migrate-start-etcd-and-check-health branch from b53a096 to 7e30cda Compare February 27, 2018 23:18

michaelgugino approved these changes Feb 28, 2018

View reviewed changes

openshift-ci-robot assigned michaelgugino Feb 28, 2018

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Feb 28, 2018

sdodson merged commit 5a60969 into openshift:release-3.7 Mar 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[3.7] etcd migrate: instead of scaleup playbook etcd server should be started back #7313

[3.7] etcd migrate: instead of scaleup playbook etcd server should be started back #7313

vrutkovs commented Feb 27, 2018 •

edited

michaelgugino Feb 27, 2018

vrutkovs Feb 27, 2018

michaelgugino left a comment

vrutkovs commented Feb 28, 2018

vrutkovs commented Mar 1, 2018

vrutkovs commented Mar 8, 2018

openshift-ci-robot commented Mar 8, 2018

[3.7] etcd migrate: instead of scaleup playbook etcd server should be started back #7313

[3.7] etcd migrate: instead of scaleup playbook etcd server should be started back #7313

Conversation

vrutkovs commented Feb 27, 2018 • edited

michaelgugino Feb 27, 2018

Choose a reason for hiding this comment

vrutkovs Feb 27, 2018

Choose a reason for hiding this comment

michaelgugino left a comment

Choose a reason for hiding this comment

vrutkovs commented Feb 28, 2018

vrutkovs commented Mar 1, 2018

vrutkovs commented Mar 8, 2018

openshift-ci-robot commented Mar 8, 2018

vrutkovs commented Feb 27, 2018 •

edited