copy --refresh for vm from another server fails with: Snapshot cannot be restored due to subsequent snapshot(s) #457

hi-ko · 2024-01-31T12:54:38Z

Distribution: ubuntu
Distribution version: 22.04.3 LTS
- Kernel version: 5.15.0-91-generic
- Incus version: 0.5.1
- Storage backend in use: zfs

copy --refresh for vm to another server fails on subsequent copy

When copying a vm using copy --refresh from another server, the first attempt is successful but once changes and snapshots added the copy fails with:

incus copy incus2:test-copy1 test-copy1 --refresh
Error: Failed instance creation: Error transferring instance data: Failed migration on target: Failed creating instance on target: Snapshot "snap0" cannot be restored due to subsequent snapshot(s). Set zfs.remove_snapshots to override

incus storage volume list zpool1|grep test-copy1
| virtual-machine            | test-copy1              |             | block        | 1       |
| virtual-machine (snapshot) | test-copy1/snap0        |             | block        | 0       |
root@lxd02:~# zfs list -t snapshot |grep test-copy1
zpool1/virtual-machines/test-copy1@snapshot-snap0                               60K      -     7.11M  -
zpool1/virtual-machines/test-copy1@snapshot-snap1                               60K      -     7.11M  -
zpool1/virtual-machines/test-copy1.block@snapshot-snap0                          0B      -      640M  -

I can remove the snapshot-snap1 via zfs destroy but the snapshot will come back on next copy attempt and produces the error again. It looks like the zfs send/receive is done too early for vms?

Steps to reproduce

create vm on server1
incus launch images:ubuntu/22.04/cloud test-copy1 --vm
Create snapshot
incus snapshot create test-copy1
On server2 do initial copy
incus copy server1:test-copy1 test-copy1 --refresh

On server1 do changes and create new snapshot

incus shell test-copy1
dd if=/dev/urandom of=testfile count=10 bs=1M;exit
incus snapshot create test-copy1

copy angain from server2
incus copy server1:test-copy1 test-copy1 --refresh
Error: Failed instance creation: Error transferring instance data: Failed migration on target: Failed creating instance on target: Snapshot "snap0" cannot be restored due to subsequent snapshot(s). Set zfs.remove_snapshots to override

Information to attach

incus config show  test-copy1
architecture: x86_64
config:
  image.architecture: amd64
  image.description: Ubuntu jammy amd64 (20240131_07:42)
  image.os: Ubuntu
  image.release: jammy
  image.serial: "20240131_07:42"
  image.type: disk-kvm.img
  image.variant: cloud
  volatile.base_image: b984c16267ca9d6ed22606193f6a30f394d4e7e0f76839357a67ca12027a93ec
  volatile.cloud-init.instance-id: 8e6bdd84-3dc4-4318-93e2-1b088ec50577
  volatile.eth0.host_name: tap0c081c0a
  volatile.eth0.hwaddr: 00:16:3e:4e:be:e2
  volatile.last_state.ready: "false"
  volatile.uuid: 96c7d6b8-9756-40c2-83f9-20cc65cdab0a
  volatile.uuid.generation: 96c7d6b8-9756-40c2-83f9-20cc65cdab0a
  volatile.vsock_id: "326816208"
devices: {}
ephemeral: false
profiles:
- default
stateful: false
description: ""

The text was updated successfully, but these errors were encountered:

stgraber · 2024-02-22T04:51:50Z

Got it reproduced. Interestingly this only seems to affect virtual machines, containers don't suffer the same issue.

stgraber · 2024-02-22T05:03:15Z

Oh, I see the problem... working on a fix now.

The ZFS logic has most of its functions call themselves with the filesystem datset after processing the block volume for virtual machines. This is usually correct, but in the case of receiving a refresh stream, we specifically want to handle the snapshot rollback separately for both volumes. Closes lxc#457 Signed-off-by: Stéphane Graber <stgraber@stgraber.org>

stgraber self-assigned this Feb 11, 2024

stgraber added the Bug Confirmed to be a bug label Feb 11, 2024

stgraber added this to the incus-0.6 milestone Feb 11, 2024

stgraber mentioned this issue Feb 22, 2024

incusd/storage/zfs: Fix refresh of VM volumes #517

Merged

brauner closed this as completed in #517 Feb 22, 2024

borgeira mentioned this issue Apr 11, 2024

copy --refresh for vm from another server fails with: Snapshot cannot be restored due to subsequent snapshot(s) canonical/lxd#13304

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

copy --refresh for vm from another server fails with: Snapshot cannot be restored due to subsequent snapshot(s) #457

copy --refresh for vm from another server fails with: Snapshot cannot be restored due to subsequent snapshot(s) #457

hi-ko commented Jan 31, 2024

stgraber commented Feb 22, 2024

stgraber commented Feb 22, 2024

copy --refresh for vm from another server fails with: Snapshot cannot be restored due to subsequent snapshot(s) #457

copy --refresh for vm from another server fails with: Snapshot cannot be restored due to subsequent snapshot(s) #457

Comments

hi-ko commented Jan 31, 2024

copy --refresh for vm to another server fails on subsequent copy

Steps to reproduce

Information to attach

stgraber commented Feb 22, 2024

stgraber commented Feb 22, 2024