[FEATURE] Application consistent snapshot/backup, Volume Group #2128

yasker · 2020-12-21T22:43:42Z

Is your feature request related to a problem? Please describe.
Currently, Longhorn can do crash-consistent snapshot/backup. But for the applications like databases, quiescing is needed to create an application-consistent snapshot, so the application will make sure all the data in the memory has been written to the disk before creating the snapshot.

Describe the solution you'd like
Provide a way for the users to quiesce the workload before taking the snapshot/backup.

Describe alternatives you've considered
Users can also script the backup using CSI snapshotter and run quiescing/unquescing before taking the snapshot. But it won't work with the recurring snapshot/backup which is scheduled by Longhorn.

Additional context
One application is normally composed of multiple workloads, so we need to consider volume group scenario as well.

The text was updated successfully, but these errors were encountered:

innobead · 2021-01-11T06:59:08Z

@jenting Please help with this. Thanks.

joshimoo · 2021-01-19T17:19:46Z

linking the backup refactor issue #1761

janeczku · 2021-02-04T15:30:46Z

Application state may be distributed across multiple persistent volumes (e.g. sharded DB, distributed indexes). The application consistent backup must then ensure that snapshots are performed in a point-in-time consistent manner across all volumes.

Offering such feature Longhorn may allow users to create "volume groups" and define on-demand or scheduled backup plans for the whole group instead of individual volumes.

innobead · 2021-02-08T15:39:09Z

Application state may be distributed across multiple persistent volumes (e.g. sharded DB, distributed indexes). The application consistent backup must then ensure that snapshots are performed in a point-in-time consistent manner across all volumes.

Offering such feature Longhorn may allow users to create "volume groups" and define on-demand or scheduled backup plans for the whole group instead of individual volumes.

Thanks for the comment. The similar idea we had some discussion before. The goal is we need to introduce a mechanism to make the volumes of application atomic aware operations supported and like u said, not just individual volumes.

Please follow up the discussion and proposals in near future.

innobead · 2021-10-21T02:31:57Z

Hey team! Please add your planning poker estimate with ZenHub @jenting @PhanLe1010 @shuo-wu @joshimoo

joshimoo · 2021-10-21T20:28:50Z

There is a lot to consider here and high uncertainty, volume groups, pre/post hooks (executing inside of the workload pods), error handling etc. Now that we have backup crds we can also consider exposing (syncing) these crds as csi snapshots via creation of the appropriate csi snapshot and snapshotData resources.

shuo-wu · 2022-02-16T09:19:08Z

Leave a note here:
Velero relies on annotation hooks to execute the cmd before/after backup. I can do something similar to archive application-consistent snapshot/backup. (run sync and freezes before the snapshot/backup)

The next:
Investigate Kanister

R-Studio · 2022-06-09T08:36:33Z

This feature would be awesome! 👍🏽😃
@innobead can be estimated when this feature will be implemented? (I ask because it is a open feature request since 21 Dec 2020)

innobead · 2023-01-14T04:57:26Z

ref: kubernetes/enhancements#3476

innobead · 2023-02-07T08:21:15Z

ref: kubernetes/enhancements#3476

container-storage-interface/spec#519
https://github.com/container-storage-interface/spec/releases/tag/v1.8.0 (introduce VolumeGroupSnapshot related RPCs under new GroupController services as alpha)

innobead · 2023-07-27T04:29:44Z

Some interesting project can be referenced as well, https://github.com/kanisterio/kanister.

c3y1huang · 2023-08-24T04:29:59Z

container-storage-interface/spec#519 https://github.com/container-storage-interface/spec/releases/tag/v1.8.0 (introduce VolumeGroupSnapshot related RPCs under new GroupController services as alpha)

Note:

The Kubernetes VolumeGroupSnapshot supports PVC only and is triggered by the PVC label group=<group_name>. This means a PVC can only be associated with a single group, this is probably not ideal for RWX PVCs where it might be used by multiple workloads.
The most recent release of kubernetes-csi/external-snapshotter, version v6.2.2, doesn't have the implementation of the volume group snapshot APIs. This feature currently sits in the master branch.

innobead · 2023-08-24T07:00:11Z

container-storage-interface/spec#519 https://github.com/container-storage-interface/spec/releases/tag/v1.8.0 (introduce VolumeGroupSnapshot related RPCs under new GroupController services as alpha)

Note:

The Kubernetes VolumeGroupSnapshot supports PVC only and is triggered by the PVC label group=<group_name>. This means a PVC can only be associated with a single group, this is probably not ideal for RWX PVCs where it might be used by multiple workloads.

This new feature seems similar to the current VolumeSnapshot which means users also can do RWX volume backup by multiple workloads now, so it's not related to this new interface. This is a general interface, so basically we don't need to worry about whether it's ideal or not for different volume access modes.

The most recent release of kubernetes-csi/external-snapshotter, version v6.2.2, doesn't have the implementation of the volume group snapshot APIs. This feature currently sits in the master branch.

I see, then we have two phases here. In the first phase (1.6), let's introduce the internal mechanism like AppBackup and AppRestore or VolumeGroupBackup and VolumeGroupRestore if we want to make it more general. In the second phase (after 1.6) when VolumeGroupSnapshot is ready upstream, we can do the same backup & restore but the trigger point is from the CSI path instead of creating AppBackup or restore by users or recurring jobs.

To sum up.

The creator/driver of AppBackup will be a user, recurring job, or CSI. (DR should be naturally the same)
The creator of AppRestore will be a user or CSI.

c3y1huang · 2023-09-27T09:54:17Z

Backlog this first to work on higher-priority items:

[BUG] Somehow the Rebuilding field inside volume.meta is set to true when one replica only, causing the volume into attaching/detaching loop #6626
area/negative-testing

yasker added this to the v1.2.0 milestone Dec 21, 2020

yasker mentioned this issue Dec 21, 2020

[FEATURE] Application backup/restore #2129

Open

joshimoo mentioned this issue Jan 8, 2021

[Question] backup/snapshot hooks #2164

Closed

yasker assigned jenting Jan 14, 2021

joshimoo mentioned this issue Jan 19, 2021

[FEATURE] PVC or Workload name visible in backup list section of Dashboard #1539

Closed

innobead modified the milestones: v1.2.0, v1.3.0 Apr 29, 2021

innobead changed the title ~~[FEATURE]Application consistent snapshot/backup~~ [FEATURE] Application consistent snapshot/backup Sep 21, 2021

innobead unassigned jenting Oct 20, 2021

innobead added the highlight Important feature/issue to highlight label Oct 21, 2021

jenting mentioned this issue Nov 5, 2021

[QUESTION] Restoring helm-generated pvc's #2862

Closed

innobead assigned derekbit Nov 8, 2021

innobead assigned c3y1huang and unassigned derekbit Dec 23, 2021

innobead added the reprioritization-needed Need to reconsider to re-prioritize in another milestone instead of the current one label Jan 10, 2022

innobead unassigned c3y1huang Jan 20, 2022

shuo-wu mentioned this issue Feb 18, 2022

[BUG] Corruption using XFS after node restart or pod scale #3597

Closed

innobead modified the milestones: v1.3.0, v1.4.0 Mar 18, 2022

innobead changed the title ~~[FEATURE] Application consistent snapshot/backup~~ [FEATURE] Application consistent snapshot/backup, Volume Group May 20, 2022

joshimoo mentioned this issue Jun 17, 2022

[FEATURE] Auto-balancing volumes between nodes & disks #4105

Open

innobead modified the milestones: v1.4.0, v1.5.0 Aug 30, 2022

innobead removed the require/api label Dec 4, 2022

innobead added area/kubernetes Kubernetes related like K8s version compatibility area/csi CSI related like control/node driver, sidecars labels Jan 14, 2023

innobead removed the reprioritization-needed Need to reconsider to re-prioritize in another milestone instead of the current one label Feb 8, 2023

innobead unassigned shuo-wu Feb 8, 2023

innobead modified the milestones: v1.5.0, v1.6.0 Feb 8, 2023

c3y1huang mentioned this issue Mar 20, 2023

Create RecurringJobs with the helm chart #3904

Closed

innobead assigned PhanLe1010 Jul 19, 2023

innobead assigned c3y1huang and unassigned PhanLe1010 Aug 16, 2023

c3y1huang mentioned this issue Sep 27, 2023

feat(lep): volume group design #6801

Draft

ChipWolf mentioned this issue Oct 26, 2023

[FEATURE] Use fsfreeze instead of sync before snapshot #2187

Open

innobead modified the milestones: v1.6.0, v1.7.0 Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Application consistent snapshot/backup, Volume Group #2128

[FEATURE] Application consistent snapshot/backup, Volume Group #2128

yasker commented Dec 21, 2020 •

edited

innobead commented Jan 11, 2021

joshimoo commented Jan 19, 2021

janeczku commented Feb 4, 2021 •

edited

innobead commented Feb 8, 2021

innobead commented Oct 21, 2021

joshimoo commented Oct 21, 2021

shuo-wu commented Feb 16, 2022

R-Studio commented Jun 9, 2022

innobead commented Jan 14, 2023

innobead commented Feb 7, 2023 •

edited

innobead commented Jul 27, 2023

c3y1huang commented Aug 24, 2023 •

edited

innobead commented Aug 24, 2023 •

edited

c3y1huang commented Sep 27, 2023

[FEATURE] Application consistent snapshot/backup, Volume Group #2128

[FEATURE] Application consistent snapshot/backup, Volume Group #2128

Comments

yasker commented Dec 21, 2020 • edited

innobead commented Jan 11, 2021

joshimoo commented Jan 19, 2021

janeczku commented Feb 4, 2021 • edited

innobead commented Feb 8, 2021

innobead commented Oct 21, 2021

joshimoo commented Oct 21, 2021

shuo-wu commented Feb 16, 2022

R-Studio commented Jun 9, 2022

innobead commented Jan 14, 2023

innobead commented Feb 7, 2023 • edited

innobead commented Jul 27, 2023

c3y1huang commented Aug 24, 2023 • edited

innobead commented Aug 24, 2023 • edited

c3y1huang commented Sep 27, 2023

yasker commented Dec 21, 2020 •

edited

janeczku commented Feb 4, 2021 •

edited

innobead commented Feb 7, 2023 •

edited

c3y1huang commented Aug 24, 2023 •

edited

innobead commented Aug 24, 2023 •

edited