[FEATURE] New setting to limit the concurrent volume restoring from backup #4558

c3y1huang · 2022-09-14T04:07:41Z

Is your feature request related to a problem? Please describe

We like to support to rolling out the volume with data from backup/backing-image. However, Longhorn has no limitation on the concurrent volume restoring from backup.

Longhorn should have another setting similar to concurrent-replica-rebuild-per-node-limit.

Describe the solution you'd like

Introduce a new setting to limit the concurrent volume creating from the backup.

Describe alternatives you've considered

Handle in rollout-load controller.

Additional context

#4388 (comment)

The text was updated successfully, but these errors were encountered:

longhorn-io-github-bot · 2022-12-05T07:37:15Z

Pre Ready-For-Testing Checklist

chriscchien · 2022-12-09T09:11:13Z

Hi @c3y1huang

I have a 3 nodes cluster, 8 backups from different volumes and Longhorn set Concurrent Volume Backup Restore Per Node Limit to 1
After I select all backups in backup page then click Restore latest Backup
The concurrent restore number somehow will greater then 3 as below picture shown.

In addition, I had a simple test script to calculate rebuilding numbers from backend API, when restoring, somehow it will give me number greater than 3 which reflected the situation as UI displayed.

def test_count_restore_limit(client):

    for i in range(1000):
        try:
            restoring = 0
            volumes = client.list_volume()
            for v in volumes:
                if v.restoreStatus and v.restoreStatus[0].progress != 0 and v.restoreStatus[0].progress != 100:
                    restoring = restoring + 1
                else:
                    pass
        except:
            print("x")
            pass

        print(restoring)
        time.sleep(1)

chriscchien · 2022-12-12T00:52:12Z

Did several times of restoring volume with 1 replica and Concurrent Volume Backup Restore Per Node Limit set to 1

At begging the restore number = 3 matched the setting. But there still have chance that the restoring volume number surpassed Concurrent Volume Backup Restore Per Node Limit value, not every time, but not very hard to reproduce ( Can reproduce every time when restoring volume with 3 replicas)

In addition, last success build of longhorn-engine was 6 days ago, not sure if this impacted

chriscchien · 2022-12-15T07:35:15Z

Verified in longhorn master aa3998 with steps
Result Pass

Environment:
- 3 nodes cluster
- 8 backups from different volume, each size were 2Gi

The setting won't effect DR volumes, no matter the setting is 0 or not, all DR volumes were restoring at the same time.
Set concurrent-volume-backup-restore-per-node-limit to 0, then restore all volumes, can see volume attach to nodes in maintenance mode and restoring never started.
Set concurrent-volume-backup-restore-per-node-limit to 1, can observe volume start restoring and each node's restoring number not exceed the setting.
Also tested concurrent-volume-backup-restore-per-node-limit = 2 , restore volume with 1 and 3 replicas, all worked well, the restoring number not exceed setting per node.

c3y1huang added the kind/feature Feature request, new feature label Sep 14, 2022

c3y1huang mentioned this issue Sep 14, 2022

feat(system-backup/restore): add LEP #4388

Merged

innobead added this to the v1.4.0 milestone Sep 14, 2022

innobead added priority/0 Must be fixed in this release (managed by PO) area/volume-backup-restore Volume backup restore area/recurring-job Longhorn recurring job related component/longhorn-manager Longhorn manager (control plane) labels Sep 14, 2022

innobead assigned c3y1huang Sep 14, 2022

innobead added the require/lep Require adding/updating enhancement proposal label Dec 2, 2022

c3y1huang added the require/auto-e2e-test Require adding/updating auto e2e test cases if they can be automated label Dec 5, 2022

github-actions bot mentioned this issue Dec 5, 2022

[TEST][FEATURE] New setting to limit the concurrent volume restoring from backup. #4993

Closed

innobead changed the title ~~[FEATURE] New setting to limit the concurrent volume restoring from backup.~~ [FEATURE] New setting to limit the concurrent volume restoring from backup Dec 7, 2022

innobead added area/system-backup-restore Longhorn system backup restore require/doc Require updating the longhorn.io documentation labels Dec 7, 2022

innobead assigned chriscchien Dec 9, 2022

c3y1huang mentioned this issue Dec 13, 2022

fix(concurrent-backup-restore): decreased counter before restored longhorn/longhorn-manager#1611

Merged

longhorn deleted a comment from longhorn-io-github-bot Dec 13, 2022

c3y1huang mentioned this issue Dec 15, 2022

[BACKPORT][v1.4.0] fix(concurrent-backup-restore): decreased counter before restored longhorn/longhorn-manager#1620

Merged

chriscchien closed this as completed Dec 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] New setting to limit the concurrent volume restoring from backup #4558

[FEATURE] New setting to limit the concurrent volume restoring from backup #4558

c3y1huang commented Sep 14, 2022

longhorn-io-github-bot commented Dec 5, 2022 •

edited by c3y1huang

chriscchien commented Dec 9, 2022

chriscchien commented Dec 12, 2022 •

edited

chriscchien commented Dec 15, 2022

[FEATURE] New setting to limit the concurrent volume restoring from backup #4558

[FEATURE] New setting to limit the concurrent volume restoring from backup #4558

Comments

c3y1huang commented Sep 14, 2022

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Describe alternatives you've considered

Additional context

longhorn-io-github-bot commented Dec 5, 2022 • edited by c3y1huang

Pre Ready-For-Testing Checklist

chriscchien commented Dec 9, 2022

chriscchien commented Dec 12, 2022 • edited

chriscchien commented Dec 15, 2022

longhorn-io-github-bot commented Dec 5, 2022 •

edited by c3y1huang

chriscchien commented Dec 12, 2022 •

edited