[TASK] Restore to a brand new cluster that does not contain anything #3367

shuo-wu · 2021-12-03T11:59:58Z

What's the task? Please describe

Provide the doc that instructs how we can do restore for an empty cluster via Velero

Additional context

Refer to the discussion in https://cloud-native.slack.com/archives/CNVPEL9U3/p1638530150134100?thread_ts=1638467093.129100&cid=CNVPEL9U3

tom-mayer · 2021-12-03T14:04:58Z

We wanted to have a very generic solution of how to backup/restore a cluster from scratch that uses longhorn as its storage provider. For etcd backup we use Velero.

We backup the cluster state and exclude (note that we are also excluding PVs and PVCs):
- persistentvolumes
- persistentvolumeclaims
- volumes.longhorn.io,
- backups.longhorn.io
- backupvolumes.longhorn.io
- nodes.longhorn.io
- replicas.longhorn.io
- engines.longhorn.io
- backingimagedatasources.longhorn.io
- backingimagemanagers.longhorn.io,
- backingimages.longhorn.io
- sharemanagers.longhorn.io
- instancemanagers.longhorn.io
- engineimages.longhorn.io
On the new cluster which is blank, we install Velero via their CLI. We restore the backup we took before with Velero which will make longhorn spin up with the old settings. The longhorn deployment/pods were in the backup, that why it is not necessary to reinstall it. Which was one of our goals. The settings/backupTarget get restored by the restore of the CRDs through Velero.
(All pods that do require longhorn Volumes are now waiting because they don't have their PVCs (which we didn't backup for that exact reason)). When the longhorn ui is ready we click on the "Backup" tab which will load all the backups from the restored backupTarget. We can then batch restore all the Volumes from there
On the Volumes tab when the volumes are ready, we recreate the PVs/PVCs form the UI (Actions -> Create PVC). It is crucial to tick the "Use previous PVC" checkbox in the dialog.
Claims get auto-attached to the waiting pods

This works since we restored etcd together with longhorn all the original namespaces are there and the PVCs will also get recreated via longhorn in their original namespaces, making the pods heal after they are there.

longhorn-io-github-bot · 2021-12-20T11:17:06Z

Pre Ready-For-Testing Checklist

* [ ] Where is the reproduce steps/test steps documented?
The reproduce steps/test steps are at:

* [ ] Is there a workaround for the issue? If so, where is it documented?
The workaround is at:

Does the PR include the explanation for the fix or the feature?

* [ ] Does the PR include deployment change (YAML/Chart)? If so, where are the PRs for both YAML file and Chart?
The PR for the YAML change is at:
The PR for the chart change is at:

* [ ] Have the backend code been merged (Manager, Engine, Instance Manager, BackupStore etc) (including backport-needed/*)?
The PR is at

* [ ] Which areas/issues this PR might have potential impacts on?
Area
Issues

* [ ] If labeled: require/LEP Has the Longhorn Enhancement Proposal PR submitted?
The LEP PR is at

* [ ] If labeled: area/ui Has the UI issue filed or ready to be merged (including backport-needed/*)?
The UI issue/PR is at

If labeled: require/doc Has the necessary document PR submitted or merged (including backport-needed/*)?
The documentation issue/PR is at
Add a doc that instructs how users can restore Longhorn to a new cluster website#440
[Backport][v1.2.3]Cluster restore doc website#442

* [ ] If labeled: require/automation-e2e Has the end-to-end test plan been merged? Have QAs agreed on the automation test case? If only test case skeleton w/o implementation, have you created an implementation issue (including backport-needed/*)
The automation skeleton PR is at
The automation test case PR is at
The issue of automation test case implementation is at (please create by the template)

* [ ] If labeled: require/automation-engine Has the engine integration test been merged (including backport-needed/*)?
The engine automation PR is at

If labeled: require/manual-test-plan Has the manual test plan been documented?
The updated manual test plan is at Add a test for the new cluster restore instruction longhorn-tests#836

* [ ] If the fix introduces the code for backward compatibility Has a separate issue been filed with the label release/obsolete-compatibility?
The compatibility issue is filed at

khushboo-rancher · 2022-02-17T01:20:11Z

Verified with master-head 02/16/2022

Validation - pass

Follow the instructions from longhorn/website#440 to restore in the new cluster. It worked as expected.

shuo-wu added area/install-uninstall-upgrade Install, Uninstall or Upgrade related require/doc Require updating the longhorn.io documentation kind/task General task request to fulfill another primary request labels Dec 3, 2021

innobead added this to the Backlog milestone Dec 3, 2021

shuo-wu modified the milestones: Backlog, v1.3.0 Dec 7, 2021

shuo-wu self-assigned this Dec 7, 2021

shuo-wu mentioned this issue Dec 20, 2021

Add a doc that instructs how users can restore Longhorn to a new cluster longhorn/website#440

Merged

This was referenced Dec 21, 2021

[BUG] The default access mode of a restored RWX volume is RWO #3444

Closed

Add a test for the new cluster restore instruction longhorn/longhorn-tests#836

Merged

EugenMayer mentioned this issue Jan 1, 2022

[Question] - Is it possible to export the longhorn configuration using Velero/Restic ? #2134

Closed

shuo-wu mentioned this issue Jan 4, 2022

[Backport][v1.2.3]Cluster restore doc longhorn/website#442

Merged

khushboo-rancher self-assigned this Jan 11, 2022

khushboo-rancher closed this as completed Feb 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TASK] Restore to a brand new cluster that does not contain anything #3367

[TASK] Restore to a brand new cluster that does not contain anything #3367

shuo-wu commented Dec 3, 2021

tom-mayer commented Dec 3, 2021

longhorn-io-github-bot commented Dec 20, 2021 •

edited by shuo-wu

Loading

khushboo-rancher commented Feb 17, 2022

[TASK] Restore to a brand new cluster that does not contain anything #3367

[TASK] Restore to a brand new cluster that does not contain anything #3367

Comments

shuo-wu commented Dec 3, 2021

What's the task? Please describe

Additional context

tom-mayer commented Dec 3, 2021

longhorn-io-github-bot commented Dec 20, 2021 • edited by shuo-wu Loading

Pre Ready-For-Testing Checklist

khushboo-rancher commented Feb 17, 2022

longhorn-io-github-bot commented Dec 20, 2021 •

edited by shuo-wu

Loading