Test case: zpool_expand_* #2437

FransUrbo · 2014-06-28T15:53:30Z

Way to reproduce:

#!/bin/bash

rm -f /var/tmp/zfs_test-1
truncate -s 25000m /var/tmp/zfs_test-1

zpool create test /var/tmp/zfs_test-1
for i in {1..3}; do
        zfs create -V 1g test/vol$i
done
sleep 1

zpool create -o autoexpand=on test2 /dev/zvol/test/vol1  /dev/zvol/test/vol2 /dev/zvol/test/vol3

autoexp=$(zpool get autoexpand test2 2> /dev/null | tail -1 | awk '{print $3}')
echo "Autoexpand on test2: '$autoexp'"

prev_size=$(zpool get size test2 2> /dev/null | tail -1 | awk '{print $3}')
zfs_prev_size=$(zfs get -pH -o value avail test2)

for i in {1..3}; do
        zfs set volsize=2g test/vol$i
done

sync
sleep 10
sync

expand_size=$(zpool get size test2 2> /dev/null | tail -1 | awk '{print $3}')
zfs_expand_size=$(zfs get -pH -o value avail test2)

echo "test has previous size: '$prev_size' ($zfs_prev_size) and expanded size: '$expand_size' ($zfs_expand_size)"

Results in:

# /tmp/testme.sh
Autoexpand on test2: 'on'
test has previous size: '2.98G' (3145056256) and expanded size: '2.98G' (3145031680)

The text was updated successfully, but these errors were encountered:

behlendorf · 2014-06-29T17:40:26Z

This is expected, although it is a bug / missing feature. It's my understanding that the way this works on Illumos is that an autoexpand event is handled by their FMA infrastructure. In turn FMA calls zpool online -e pool device to expand the device. We'll want to do something equivalent with the ZED on Linux.

FransUrbo · 2014-11-05T09:29:28Z

@behlendorf why is this tagged 'Test Suite'? It was FOUND by the test suite, but it's not a problem with it, but with ZFS (ZED).

andyeff · 2015-10-20T13:20:54Z

Apologies if this isn't quite on topic, but I'm having a problem with manual autoexpand after a LUN expansion, not an automatic one.

Environment: RHEL 6.5, ZFS 0.6.5.3-1, VMWare Workstation 9.0.4.

I have added a 1GB disk, /dev/sdb, and created a pool: zpool create tank sdb
I poweroff the VM and expand the disk to 2GB in vmware.
I power on, and then I attempt to grow the zpool to available space.

I have tried: enabling autoexpand on tank, rebooting, zpool online -e tank sdb, exporting/importing pool, but it will not grow.

Partition view:

[root@rheltest ~]# parted /dev/sdb unit s print
Model: VMware, VMware Virtual S (scsi)
Disk /dev/sdb: 4194304s
Sector size (logical/physical): 512B/512B
Partition Table: gpt

Number  Start     End       Size      File system  Name  Flags
 1      2048s     2078719s  2076672s               zfs
 9      2078720s  2095103s  16384s

I am unsure if the two partitions created are standard? Is the small second partition 9 blocking the first from being able to expand? The extra disk in vmware was turned into a zpool without doing anything else on it first - I did not create any existing partitions, just gave it the whole disk.

andyeff · 2015-10-27T15:14:25Z

I did some further testing, in this case I created an LVM physical volume, volume group and logical volume. If I use that logical volume for ZFS, I can both extend the LVM logical volume and the ZFS pool / filesystem quite happily.

I suspect I'm running into an unsupported case by trying to extend ZFS when a raw disk device is used during pool creation.

odoucet · 2016-01-13T11:22:00Z

We experience the same issue as @andyeff (with Xen instead of VMware).
zpool online -e does not do the trick, nor reboot.
We previously did get this working, but cannot remember which version we used at that time (0.6.2 or 0.6.3). It is not working on 0.6.5.4-1

ilovezfs · 2016-01-13T11:23:04Z

@odoucet did you try offlining and then re-onlining -e?

odoucet · 2016-01-13T13:11:22Z

Cannot be done on this specific zpool because it is made of only one device, and zfs refused to offline the device in this case ("no valid replica").

ThomasRasmussen · 2016-06-13T12:56:59Z

I've hit the same issue as @andyeff but on Ubuntu 16.04 LTS with its standard ZFS implementation.
I've created a blank disk in vmware (4GiB), run zpool create -o autoexpand=on -f tank /dev/sdb and now parted lists two partitions.
I've then extended the vmware disk to 5GiB and made sure that kernel actually recognizes this change (fdisk et.al shows disk is 5GiB).
I suspect that the small partition (8MiB in size) is infact blocking autoexpand from working. I can delete this partition and adjust partition 1 to the new size, and then zpool online -e tank /dev/sdb actually adjust the pool to a new size.

But deleting this 8MiB partition doesn't sound like a good solution, there must be a reason why it is there? I haven't experienced any problems yet, but still...

Any thoughts, comments or recommendations as to the correct procedure to expand a disk?

FransUrbo · 2016-06-13T13:17:23Z

In ZoL, this isn't done automatically (yet). You need to online this new disk (actually, the new space) for the additional space to be seen.

ThomasRasmussen · 2016-06-13T13:22:06Z

If I understand everything correct, then this should be a proper guide:

/dev/sdb attached, 5GB
zpool create -o autoexpand=on -f tank /dev/sdb
zpool list confirms that it is 5GB
expand disk in vmware to 10GB
* reboot server, to ensure kernel sees 10GB
* confirmed using parted -l /dev/sdb
zpool online -e tank /dev/sdb

But this is not true... My tests shows that I need to do this:

/dev/sdb attached, 5GB
zpool create -o autoexpand=on -f tank /dev/sdb
zpool list confirms that it is 5GB
expand disk in vmware to 10GB
* reboot server (or rescan scsi bus), to ensure kernel sees 10GB
* confirmed using parted -l /dev/sdb
parted: delete partition 9 on /dev/sdb
parted: resize partition 1 on /dev/sdb to 100%
zpool online -e tank /dev/sdb
confirm correct 10GB on tank

ilovezfs · 2016-06-13T13:30:35Z

zpool online -e is supposed to do the re-partitioning – not you – if you're using whole_disk=1.

ThomasRasmussen · 2016-06-13T13:34:33Z

@ilovezfs in which version of ZoL did whole_disk appear? On my host I have

ii zfsutils-linux 0.6.5.6-0ubuntu9 amd64 Native OpenZFS management utilities for Linux

andyeff · 2016-06-13T13:35:18Z

@ilovezfs Hmm this might have some relation to #2296 whereby /dev/mapper drives / multipath drives are being left as whole_disk=0 (which I believe is an issue I have)

At a guess, I can't switch whole_disk from 0 to 1 on an existing pool?

ilovezfs · 2016-06-13T13:36:37Z

whole_disk=0/1 is an upstream thing that's older than the hills.

Yeah, anything that's a partition OR not "real" is supposed to be whole_disk=0, so that is correct behavior for dev mapper.

ThomasRasmussen · 2016-06-13T13:40:26Z

@ilovezfs hmmm... How is whole_disk enabled?

sudo zpool create -o whole_disk=1 pool02 /dev/sdc
property 'whole_disk' is not a valid pool property
sudo zpool create -O whole_disk=1 pool02 /dev/sdc
cannot create 'pool02': invalid property 'whole_disk'

ilovezfs · 2016-06-13T13:43:54Z

It's handled under the hood automatically. If you have a physical disk, and you use "sdc" not "sdc#{number}", then it will be using whole_disk=1 (i.e., it will autopartition, putting the pool on partition 1 and reserving some space in a partition 9). zpool online -e handles the task of rewriting the partition table in this case.

andyeff · 2016-06-13T13:53:31Z

@ilovezfs Is there a standard method for doing this manually, e.g. if using SAN-provided disks as /dev/mapper devices where whole disk won't be 1? What's involved in the partition rewriting?

ilovezfs · 2016-06-13T14:12:42Z

For whole_disk=0 devices, you resize the faux-device or partition yourself and then run zpool online -e #{pool} #{faux device or partition}. autoexpand=on seems to have no impact in that case, even if you export and import.

FransUrbo · 2016-06-13T15:39:03Z

@andyeff I'm not sure what you mean by SAN-provided disks, but I can access my iSCSI targets (when logged in to them/it) as a /dev/sd? device..

andyeff · 2016-06-13T15:42:16Z

@FransUrbo ah, I don't believe I can get them to be represented that way in my environment. We're using a pair of fibre-channel paths to access the disks, so multipathd presents the disks as /dev/mapper/mpath? items

FransUrbo · 2016-06-13T16:02:59Z

Ah, ok. Fair enough. I haven't used in FC in 10+ years, so I have no idea how it works now :).

ThomasRasmussen · 2016-06-14T05:51:11Z

@ilovezfs If I understand correctly, this should have worked? I would have expected that partition 9 was moved and partition 1 was extended... but what am I doing wrong?

# parted /dev/sdc print
Error: /dev/sdc: unrecognised disk label
Model: VMware Virtual disk (scsi)
Disk /dev/sdc: 8590MB
Sector size (logical/physical): 512B/512B
Partition Table: unknown
Disk Flags:

# zpool create -f -o autoexpand=on pool02 /dev/sdc

# zpool list
NAME     SIZE  ALLOC   FREE  EXPANDSZ   FRAG    CAP  DEDUP  HEALTH  ALTROOT
pool02  7.94G  62.5K  7.94G         -     0%     0%  1.00x  ONLINE  -

### Expand vmdisk from 8GiB to 9GiB

# for device in `ls /sys/class/scsi_device/`; do echo 1 > /sys/class/scsi_device/$device/device/rescan; done

dmesg | tail -4
[66593.367036]  sdc: sdc1 sdc9
[66593.375151]  sdc: sdc1 sdc9
[66694.262278] sd 2:0:2:0: [sdc] 18874368 512-byte logical blocks: (9.66 GB/9.00 GiB)
[66694.262321] sdc: detected capacity change from 8589934592 to 9663676416

# parted /dev/sdc print
Warning: Not all of the space available to /dev/sdc appears to be used, you can fix the GPT to use all of the space (an extra 2097152
blocks) or continue with the current setting?
Fix/Ignore? f
Model: VMware Virtual disk (scsi)
Disk /dev/sdc: 9664MB
Sector size (logical/physical): 512B/512B
Partition Table: gpt
Disk Flags:

Number  Start   End     Size    File system  Name  Flags
 1      1049kB  8580MB  8579MB               zfs
 9      8580MB  8589MB  8389kB

# zpool online -e pool02 /dev/sdc

# zpool list
NAME     SIZE  ALLOC   FREE  EXPANDSZ   FRAG    CAP  DEDUP  HEALTH  ALTROOT
pool02  7.94G   192K  7.94G         -     0%     0%  1.00x  ONLINE  -

# parted /dev/sdc print
Model: VMware Virtual disk (scsi)
Disk /dev/sdc: 9664MB
Sector size (logical/physical): 512B/512B
Partition Table: gpt
Disk Flags:

Number  Start   End     Size    File system  Name  Flags
 1      1049kB  8580MB  8579MB               zfs
 9      8580MB  8589MB  8389kB

ThomasRasmussen · 2016-06-14T06:18:27Z

I have then tried to remove partition 9 before running zpool online, and then it is actually expanded:

# parted /dev/sdc rm 9

# zpool online -e pool02 /dev/sdc

Then the partition is infact resized, but what is partition 9 used for? is it critical for ZFS to function or is it something that is "just" added?

FransUrbo · 2016-06-14T10:01:01Z

what is partition 9 used for? is it critical for ZFS to function or is it something that is "just" added?

It's not actually used for anything, it's a "buffer" partition. Meaning, you don't have to add a new disk with exactly the same number of blocks etc.

ilovezfs · 2016-06-14T10:07:28Z

If I understand correctly, this should have worked?

@ThomasRasmussen No that procedure is not expected to work because you're changing the virtual disk size, but the original partition table would still be there. zpool online -e is supposed to work as a follow up to zpool replace.

ThomasRasmussen · 2016-06-14T10:16:54Z

@ilovezfs so it is not possible to perform resize of a pool when the virtual-disk is expanded?

On Solaris (and I also believe it is true for FreeBSD) this is infact working without problems... so I would have thought that it was true for Linux as well :-(

gaurkuma · 2017-01-24T22:44:07Z

@behlendorf @ilovezfs @ThomasRasmussen

We had a similar problem of expanding current vdevs. We are running as a VM solution so the approach we are thinking of getting around all the expansion hassle is as follows:

We ask for a really large virtual disk.
Instead of saying vdev->asize = vdev->max_asize we assign vdev_max_asize to the device size obtained via bdev_capacity. But vdev->asize is assigned a smaller value (x % of max_asize)
As we start filling the vdevs, at some point (e.g. when we start allocation from the second last metaslab) we raise a zed event that will invoke zpool online -e which will increase the vdev size by some finite size not to exceed the max_asize.

Initial tests results look promising. Looking for some comments from the experts as I am not sure why first of all max_asize and asize exists if there were assigned the same value and secondly, what is the purpose of the expandsize zpool property. i dont think it is being used.

Enable additional test cases, in most cases this required a few minor modifications to the test scripts. In a few cases a real bug was uncovered and fixed. And in a handful of cases where pools are layered on pools the test case will be skipped until this is supported. Details below for each test case. * zpool_add_004_pos - Skip test on Linux until adding zvols to pools is fully supported and deadlock free. * zpool_add_005_pos.ksh - Skip dumpadm portion of the test which isn't relevant for Linux. The find_vfstab_dev, find_mnttab_dev, and save_dump_dev functions were updated accordingly for Linux. * zpool_add_006_pos - Update test case such that it doesn't depend on nested pools. Switch to truncate from mkfile to reduce space requirements and speed up the test case. * zpool_clear_001_pos - Speed up test case by filling filesystem to 25% capacity. * zpool_create_002_pos, zpool_create_004_pos - Use sparse files for file vdevs in order to avoid increasing the partition size. * zpool_create_006_pos - 6ba1ce9 allows raidz+mirror configs with similar redundancy. Updating the valid_args and forced_args cases. * zpool_create_008_pos - Disable overlapping partition portion. * zpool_create_011_neg - Fix to correctly create the extra partition. Modified zpool_vdev.c to use fstat64_blk() wrapper which includes the st_size even for block devices. * zpool_create_012_neg - Updated to properly find swap devices. * zpool_create_014_neg, zpool_create_015_neg - Updated to use swap_setup() and swap_cleanup() wrappers which do the right thing on Linux and Illumos. Removed '-n' option which successed under Linux due to differences in the inuse checks. * zpool_create_016_pos.ksh - Skipped test case isn't useful. * zpool_create_020_pos - Added missing / to cleanup() function. Remove cache file prior to test to ensure a clean environment and avoid false positives. * zpool_destroy_001_pos - Removed test case which creates a pool on a zvol. This is more likely to deadlock under Linux and has never been completely supported on any platform. * zpool_destroy_002_pos - 'zpool destroy -f' is unsupported on Linux. Mount point must not be busy in order to unmount them. * zfs_destroy_001_pos - Handle EBUSY error which can occur with volumes when racing with udev. * zpool_expand_001_pos, zpool_expand_003_neg - Skip test on Linux until adding zvols to pools is fully supported and deadlock free. The test could be modified to use loopback devices but it would be preerable to use the test case as is for improved coverage. * zpool_export_004_pos - Updated test case to such that it doesn't depend on nested pools. Normal file vdev under /var/tmp are fine. * zpool_import_all_001_pos - Updated to skip partition 1, which is known as slice 2, on Illumos. This prevents overwritting the default TESTPOOL which was causing the failure. * zpool_import_002_pos, zpool_import_012_pos - No changes needed. * zpool_remove_003_pos - No changes needed * zpool_upgrade_002_pos, zpool_upgrade_004_pos - Root cause addressed by upstream OpenZFS commit 3b7f360. * zpool_upgrade_007_pos - Disabled in test case due to known failure. Opened issue openzfs#6112 * zvol_misc_002_pos - Updated to to use ext2. * zvol_misc_001_neg, zvol_misc_003_neg, zvol_misc_004_pos, zvol_misc_005_neg, zvol_misc_006_pos - Moved to skip list, thesei test case could be updated to use Linux's crash dump facility. * zvol_swap_* - Updated to use swap_setup/swap_cleanup helpers. File creation switched from /tmp to /var/tmp. Enabled minimal useful tests for Linux, skip test cases which aren't applicable. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Issue openzfs#3484 Issue openzfs#5634 Issue openzfs#2437 Issue openzfs#5202 Issue openzfs#4034 Issue openzfs#6095

Enable additional test cases, in most cases this required a few minor modifications to the test scripts. In a few cases a real bug was uncovered and fixed. And in a handful of cases where pools are layered on pools the test case will be skipped until this is supported. Details below for each test case. * zpool_add_004_pos - Skip test on Linux until adding zvols to pools is fully supported and deadlock free. * zpool_add_005_pos.ksh - Skip dumpadm portion of the test which isn't relevant for Linux. The find_vfstab_dev, find_mnttab_dev, and save_dump_dev functions were updated accordingly for Linux. Add O_EXCL to the in-use check to prevent the -f (force) option from working for mounted filesystems and improve the resulting error. * zpool_add_006_pos - Update test case such that it doesn't depend on nested pools. Switch to truncate from mkfile to reduce space requirements and speed up the test case. * zpool_clear_001_pos - Speed up test case by filling filesystem to 25% capacity. * zpool_create_002_pos, zpool_create_004_pos - Use sparse files for file vdevs in order to avoid increasing the partition size. * zpool_create_006_pos - 6ba1ce9 allows raidz+mirror configs with similar redundancy. Updating the valid_args and forced_args cases. * zpool_create_008_pos - Disable overlapping partition portion. * zpool_create_011_neg - Fix to correctly create the extra partition. Modified zpool_vdev.c to use fstat64_blk() wrapper which includes the st_size even for block devices. * zpool_create_012_neg - Updated to properly find swap devices. * zpool_create_014_neg, zpool_create_015_neg - Updated to use swap_setup() and swap_cleanup() wrappers which do the right thing on Linux and Illumos. Removed '-n' option which successed under Linux due to differences in the inuse checks. * zpool_create_016_pos.ksh - Skipped test case isn't useful. * zpool_create_020_pos - Added missing / to cleanup() function. Remove cache file prior to test to ensure a clean environment and avoid false positives. * zpool_destroy_001_pos - Removed test case which creates a pool on a zvol. This is more likely to deadlock under Linux and has never been completely supported on any platform. * zpool_destroy_002_pos - 'zpool destroy -f' is unsupported on Linux. Mount point must not be busy in order to unmount them. * zfs_destroy_001_pos - Handle EBUSY error which can occur with volumes when racing with udev. * zpool_expand_001_pos, zpool_expand_003_neg - Skip test on Linux until adding zvols to pools is fully supported and deadlock free. The test could be modified to use loopback devices but it would be preerable to use the test case as is for improved coverage. * zpool_export_004_pos - Updated test case to such that it doesn't depend on nested pools. Normal file vdev under /var/tmp are fine. * zpool_import_all_001_pos - Updated to skip partition 1, which is known as slice 2, on Illumos. This prevents overwritting the default TESTPOOL which was causing the failure. * zpool_import_002_pos, zpool_import_012_pos - No changes needed. * zpool_remove_003_pos - No changes needed * zpool_upgrade_002_pos, zpool_upgrade_004_pos - Root cause addressed by upstream OpenZFS commit 3b7f360. * zpool_upgrade_007_pos - Disabled in test case due to known failure. Opened issue openzfs#6112 * zvol_misc_002_pos - Updated to to use ext2. * zvol_misc_001_neg, zvol_misc_003_neg, zvol_misc_004_pos, zvol_misc_005_neg, zvol_misc_006_pos - Moved to skip list, thesei test case could be updated to use Linux's crash dump facility. * zvol_swap_* - Updated to use swap_setup/swap_cleanup helpers. File creation switched from /tmp to /var/tmp. Enabled minimal useful tests for Linux, skip test cases which aren't applicable. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Issue openzfs#3484 Issue openzfs#5634 Issue openzfs#2437 Issue openzfs#5202 Issue openzfs#4034 Issue openzfs#6095

Enable additional test cases, in most cases this required a few minor modifications to the test scripts. In a few cases a real bug was uncovered and fixed. And in a handful of cases where pools are layered on pools the test case will be skipped until this is supported. Details below for each test case. * zpool_add_004_pos - Skip test on Linux until adding zvols to pools is fully supported and deadlock free. * zpool_add_005_pos.ksh - Skip dumpadm portion of the test which isn't relevant for Linux. The find_vfstab_dev, find_mnttab_dev, and save_dump_dev functions were updated accordingly for Linux. Add O_EXCL to the in-use check to prevent the -f (force) option from working for mounted filesystems and improve the resulting error. * zpool_add_006_pos - Update test case such that it doesn't depend on nested pools. Switch to truncate from mkfile to reduce space requirements and speed up the test case. * zpool_clear_001_pos - Speed up test case by filling filesystem to 25% capacity. * zpool_create_002_pos, zpool_create_004_pos - Use sparse files for file vdevs in order to avoid increasing the partition size. * zpool_create_006_pos - 6ba1ce9 allows raidz+mirror configs with similar redundancy. Updating the valid_args and forced_args cases. * zpool_create_008_pos - Disable overlapping partition portion. * zpool_create_011_neg - Fix to correctly create the extra partition. Modified zpool_vdev.c to use fstat64_blk() wrapper which includes the st_size even for block devices. * zpool_create_012_neg - Updated to properly find swap devices. * zpool_create_014_neg, zpool_create_015_neg - Updated to use swap_setup() and swap_cleanup() wrappers which do the right thing on Linux and Illumos. Removed '-n' option which succeeds under Linux due to differences in the in-use checks. * zpool_create_016_pos.ksh - Skipped test case isn't useful. * zpool_create_020_pos - Added missing / to cleanup() function. Remove cache file prior to test to ensure a clean environment and avoid false positives. * zpool_destroy_001_pos - Removed test case which creates a pool on a zvol. This is more likely to deadlock under Linux and has never been completely supported on any platform. * zpool_destroy_002_pos - 'zpool destroy -f' is unsupported on Linux. Mount point must not be busy in order to unmount them. * zfs_destroy_001_pos - Handle EBUSY error which can occur with volumes when racing with udev. * zpool_expand_001_pos, zpool_expand_003_neg - Skip test on Linux until adding zvols to pools is fully supported and deadlock free. The test could be modified to use loop-back devices but it would be preferable to use the test case as is for improved coverage. * zpool_export_004_pos - Updated test case to such that it doesn't depend on nested pools. Normal file vdev under /var/tmp are fine. * zpool_import_all_001_pos - Updated to skip partition 1, which is known as slice 2, on Illumos. This prevents overwriting the default TESTPOOL which was causing the failure. * zpool_import_002_pos, zpool_import_012_pos - No changes needed. * zpool_remove_003_pos - No changes needed * zpool_upgrade_002_pos, zpool_upgrade_004_pos - Root cause addressed by upstream OpenZFS commit 3b7f360. * zpool_upgrade_007_pos - Disabled in test case due to known failure. Opened issue #6112 * zvol_misc_002_pos - Updated to to use ext2. * zvol_misc_001_neg, zvol_misc_003_neg, zvol_misc_004_pos, zvol_misc_005_neg, zvol_misc_006_pos - Moved to skip list, these test case could be updated to use Linux's crash dump facility. * zvol_swap_* - Updated to use swap_setup/swap_cleanup helpers. File creation switched from /tmp to /var/tmp. Enabled minimal useful tests for Linux, skip test cases which aren't applicable. Reviewed-by: Giuseppe Di Natale <dinatale2@llnl.gov> Reviewed-by: loli10K <ezomori.nozomu@gmail.com> Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Issue #3484 Issue #5634 Issue #2437 Issue #5202 Issue #4034 Closes #6095

While the autoexpand property may seem like a small feature it depends on a significant amount of system infrastructure. Enough of that infrastructure is now in place with a few customizations for Linux the autoexpand property for whole disk configurations can be supported. Autoexpand works as follows; when a block device is resized a change event is generated by udev with the DISK_MEDIA_CHANGE key. The ZED, which is monitoring udev events detects the event for disks (but not partitions) and hands it off to zfs_deliver_dle(). The zfs_deliver_dle() function appends the exected whole disk partition suffix, and if the partition can be matched against a known pool vdev it re-opens it. Re-opening the vdev with trigger a re-reading of the partition table so the maximum possible expansion size can be reported. Next if the property autoexpand is set to "on" a vdev expansion will be attempted. After performing some sanity checks on the disk to verify it's safe to expand the ZFS partition (-part1) it will be expanded an the partition table updated. The partition is then re-opened again to detect the updated size which allows the new capacity to be used. Added PHYS_PATH="/dev/zvol/dataset" to vdev configuration for ZFS volumes. This was required for the test cases which test expansion by layering a new pool on top of ZFS volumes. Enable the zpool_expand_001_pos and /zpool_expand_003_pos test cases which excercise the autoexpand property. Fixed zfs_zevent_wait() signal handling which could result in the ZED spinning when a signal was not handled. Removed vdev_disk_rrpart() functionality which can be abandoned in favour of re-opening the device which trigger a re-read of the partition table as long no other partitions are in use. This will be true as long as we're working with hole disks. As a bonus this allows us to remove to Linux kernel API checks. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Issue openzfs#120 Issue openzfs#2437 Issue openzfs#5771 Issue openzfs#7582

While the autoexpand property may seem like a small feature it depends on a significant amount of system infrastructure. Enough of that infrastructure is now in place with a few modifications for Linux it can be supported. Auto-expand works as follows; when a block device is modified (re-sized, closed after being open r/w, etc) a change uevent is generated for udev. The ZED, which is monitoring udev events, passes the change event along to zfs_deliver_dle() if the disk or partition contains a zfs_member as identified by blkid. From here the device is matched against all imported pool vdevs using the vdev_guid which was read from the label by blkid. If a match is found the ZED reopens the pool vdev. This re-opening is important because it allows the vdev to be briefly closed so the disk partition table can be re-read. Otherwise, it wouldn't be possible to report thee maximum possible expansion size. Finally, if the property autoexpand=on a vdev expansion will be attempted. After performing some sanity checks on the disk to verify that it is safe to expand, the primary partition (-part1) will be expanded and the partition table updated. The partition is then re-opened (again) to detect the updated size which allows the new capacity to be used. In order to make all of the above possible the following changes were required: * Updated the zpool_expand_001_pos and zpool_expand_003_pos tests. These tests now create a pool which is layered on a loopback, scsi_debug, and file vdev. This allows for testing of non- partitioned block device (loopback), a partition block device (scsi_debug), and a file which does not receive udev change events. This provided for better test coverage, and by removing the layering on ZFS volumes there issues surrounding layering one pool on another are avoided. * zpool_find_vdev_by_physpath() updated to accept a vdev guid. This allows for matching by guid rather than path which is a more reliable way for the ZED to reference a vdev. * Fixed zfs_zevent_wait() signal handling which could result in the ZED spinning when a signal was not handled. * Removed vdev_disk_rrpart() functionality which can be abandoned in favor of kernel provided blkdev_reread_part() function. * Added a rwlock which is held as a writer while a disk is being reopened. This is important to prevent errors from occurring for any configuration related IOs which bypass the SCL_ZIO lock. The zpool_reopen_007_pos.ksh test case was added to verify IO error are never observed when reopening. This is not expected to impact IO performance. Additional fixes which aren't critical but were discovered and resolved in the course of developing this functionality. * Added PHYS_PATH="/dev/zvol/dataset" to the vdev configuration for ZFS volumes. This is as good as a unique physical path, while the volumes are not used in the test cases anymore for other reasons this improvement was included. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Issue openzfs#120 Issue openzfs#2437 Issue openzfs#5771 Issue openzfs#7366 Issue openzfs#7582

While the autoexpand property may seem like a small feature it depends on a significant amount of system infrastructure. Enough of that infrastructure is now in place with a few modifications for Linux it can be supported. Auto-expand works as follows; when a block device is modified (re-sized, closed after being open r/w, etc) a change uevent is generated for udev. The ZED, which is monitoring udev events, passes the change event along to zfs_deliver_dle() if the disk or partition contains a zfs_member as identified by blkid. From here the device is matched against all imported pool vdevs using the vdev_guid which was read from the label by blkid. If a match is found the ZED reopens the pool vdev. This re-opening is important because it allows the vdev to be briefly closed so the disk partition table can be re-read. Otherwise, it wouldn't be possible to report thee maximum possible expansion size. Finally, if the property autoexpand=on a vdev expansion will be attempted. After performing some sanity checks on the disk to verify that it is safe to expand, the primary partition (-part1) will be expanded and the partition table updated. The partition is then re-opened (again) to detect the updated size which allows the new capacity to be used. In order to make all of the above possible the following changes were required: * Updated the zpool_expand_001_pos and zpool_expand_003_pos tests. These tests now create a pool which is layered on a loopback, scsi_debug, and file vdev. This allows for testing of non- partitioned block device (loopback), a partition block device (scsi_debug), and a file which does not receive udev change events. This provided for better test coverage, and by removing the layering on ZFS volumes there issues surrounding layering one pool on another are avoided. * zpool_find_vdev_by_physpath() updated to accept a vdev guid. This allows for matching by guid rather than path which is a more reliable way for the ZED to reference a vdev. * Fixed zfs_zevent_wait() signal handling which could result in the ZED spinning when a signal was not handled. * Removed vdev_disk_rrpart() functionality which can be abandoned in favor of kernel provided blkdev_reread_part() function. * Added a rwlock which is held as a writer while a disk is being reopened. This is important to prevent errors from occurring for any configuration related IOs which bypass the SCL_ZIO lock. The zpool_reopen_007_pos.ksh test case was added to verify IO error are never observed when reopening. This is not expected to impact IO performance. Additional fixes which aren't critical but were discovered and resolved in the course of developing this functionality. * Added PHYS_PATH="/dev/zvol/dataset" to the vdev configuration for ZFS volumes. This is as good as a unique physical path, while the volumes are not used in the test cases anymore for other reasons this improvement was included. Signed-off-by: Sara Hartse <sara.hartse@delphix.com> Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Issue openzfs#120 Issue openzfs#2437 Issue openzfs#5771 Issue openzfs#7366 Issue openzfs#7582

behlendorf added the Test Suite label Jun 29, 2014

behlendorf added this to the 0.6.4 milestone Jun 29, 2014

behlendorf added the zed label Jun 29, 2014

behlendorf added Bug - Minor labels Nov 6, 2014

behlendorf removed this from the 0.6.4 milestone Nov 6, 2014

behlendorf removed Bug - Minor labels Sep 30, 2016

behlendorf changed the title ~~Pool is not autoexpanded after LUN expansion~~ Test case: zpool_expand_* Jan 24, 2017

behlendorf mentioned this issue Jun 13, 2018

Add support for autoexpand property #7629

Merged

13 tasks

behlendorf closed this as completed in d441e85 Jul 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test case: zpool_expand_* #2437

Test case: zpool_expand_* #2437

FransUrbo commented Jun 28, 2014

behlendorf commented Jun 29, 2014

FransUrbo commented Nov 5, 2014

andyeff commented Oct 20, 2015

andyeff commented Oct 27, 2015

odoucet commented Jan 13, 2016

ilovezfs commented Jan 13, 2016

odoucet commented Jan 13, 2016

ThomasRasmussen commented Jun 13, 2016

FransUrbo commented Jun 13, 2016

ThomasRasmussen commented Jun 13, 2016 •

edited

Loading

ilovezfs commented Jun 13, 2016

ThomasRasmussen commented Jun 13, 2016

andyeff commented Jun 13, 2016

ilovezfs commented Jun 13, 2016

ThomasRasmussen commented Jun 13, 2016

ilovezfs commented Jun 13, 2016 •

edited

Loading

andyeff commented Jun 13, 2016

ilovezfs commented Jun 13, 2016 •

edited

Loading

FransUrbo commented Jun 13, 2016

andyeff commented Jun 13, 2016

FransUrbo commented Jun 13, 2016

ThomasRasmussen commented Jun 14, 2016 •

edited

Loading

ThomasRasmussen commented Jun 14, 2016

FransUrbo commented Jun 14, 2016

ilovezfs commented Jun 14, 2016

ThomasRasmussen commented Jun 14, 2016

gaurkuma commented Jan 24, 2017

Test case: zpool_expand_* #2437

Test case: zpool_expand_* #2437

Comments

FransUrbo commented Jun 28, 2014

behlendorf commented Jun 29, 2014

FransUrbo commented Nov 5, 2014

andyeff commented Oct 20, 2015

andyeff commented Oct 27, 2015

odoucet commented Jan 13, 2016

ilovezfs commented Jan 13, 2016

odoucet commented Jan 13, 2016

ThomasRasmussen commented Jun 13, 2016

FransUrbo commented Jun 13, 2016

ThomasRasmussen commented Jun 13, 2016 • edited Loading

ilovezfs commented Jun 13, 2016

ThomasRasmussen commented Jun 13, 2016

andyeff commented Jun 13, 2016

ilovezfs commented Jun 13, 2016

ThomasRasmussen commented Jun 13, 2016

ilovezfs commented Jun 13, 2016 • edited Loading

andyeff commented Jun 13, 2016

ilovezfs commented Jun 13, 2016 • edited Loading

FransUrbo commented Jun 13, 2016

andyeff commented Jun 13, 2016

FransUrbo commented Jun 13, 2016

ThomasRasmussen commented Jun 14, 2016 • edited Loading

ThomasRasmussen commented Jun 14, 2016

FransUrbo commented Jun 14, 2016

ilovezfs commented Jun 14, 2016

ThomasRasmussen commented Jun 14, 2016

gaurkuma commented Jan 24, 2017

ThomasRasmussen commented Jun 13, 2016 •

edited

Loading

ilovezfs commented Jun 13, 2016 •

edited

Loading

ilovezfs commented Jun 13, 2016 •

edited

Loading

ThomasRasmussen commented Jun 14, 2016 •

edited

Loading