LVM RAID #286

vpodzime · 2015-11-23T14:59:51Z

This adds an elementary support of non-linear LVs to Blivet. The first patch is unrelated, but also useful. Tests will follow later.

jkonecny12 · 2015-11-30T10:34:41Z

blivet/devicelibs/raid.py

@@ -668,7 +676,7 @@ class Single(ErsatzRAID):

 class Dup(RAIDLevel):

-    """ A RAID level which expresses one way btrfs metadata may be distributed.
+    """ A RAID level which expresses one way btrfs metadata may be distriuted.


Please fix this small typing error.

Will do, good catch.

jkonecny12 · 2015-11-30T14:25:55Z

Otherwise seems good to me as far as I can tell.

dwlehman · 2015-12-10T18:37:37Z

Should we support the "striped" and "mirror" segment types? I think at least one of those is the old lvm implementation and not using the md driver. My suspicion is the lvm team would steer you and anyone else away from using that.

dwlehman · 2015-12-10T19:11:15Z

This looks good overall. It would be nice if md, btrfs, and lvm all accepted the same raid_level ctor argument (it could be an alias/alternative to seg_type for lvm).

vpodzime · 2015-12-23T10:59:19Z

Should we support the "striped" and "mirror" segment types? I think at least one of those is the old lvm implementation and not using the md driver. My suspicion is the lvm team would steer you and anyone else away from using that.

Since there's no such thing as LVM RAID0, striped should definitely be supported. And while mirror is superseded by raid1, it's still quite commonly used, AFAICT, especially because it is the default on RHEL 5 (if you just specify --mirrors=1 and leave the decision about the type to LVM). So I think this decision should probably be left to anaconda and blivet-gui, but not blivet which also needs to deal with existing configurations.

vpodzime · 2015-12-23T11:14:14Z

It would be nice if md, btrfs, and lvm all accepted the same raid_level ctor argument (it could be an alias/alternative to seg_type for lvm).

Yeah, that would be nice. But I'd like to fully support the seg_type argument -- e.g. once all the LV classes are unified by the single class and mixins, the thin-pool segment type will be used to create a thin pool LV.

vpodzime · 2016-01-06T11:10:43Z

Added unit tests and fixed 3 minor things they discovered. Unless somebody complains somehow in the next two days, I'm gonna merge this PR.

It's just more common these days and the definition is easier to find and recognize in the sources.

This allows for more precise configuration of how the LVM setup should look like. We already have some rudimentary support for specifying PVs for caches which is a must, but it's useful for all LVs in general. Thanks dlehman@redhat.com for pieces of code and ideas for this patch!

This really only takes care about passing the type down to libblockdev and thus lvcreate. We need to a lot more to actually fully support various types of LVs.

If we want to support other types of LV than linear we need to know how much space we have in each PV. For example a 1GiB RAID1 LV requires not only 2GiB total space in the VG with 2 PVs, but at the same time it requires 1GiB space on each of the PVs.

Striped LVs are essentially RAID0 LVs as far as all the calculations go.

We already support various segment types for LVs to some extent. However, we need to get a better picture of how much space on such LVs' PVs is required. We already have a code for that, so let's just use it. This unfortunately requires the LVPVSpec to have read-write attributes/fields and thus it cannot be a namedtuple. We should probably come up with some "read-write namedtuple" thing for cases like this in the future.

LVM creates small internal LVs for RAID LVs that hold the necessary metadata. We need to account for that in order to be able to do calculations of PV/VG free space etc. This requires us to do some of the calculations in a more granular manner -- separating data and metadata parts.

non-linear LVs require space on specific PVs and thus have stronger restrictions than linear LVs which can be allocated from anywhere.

Useful for some manual testing as well as for people wondering how to do something like that.

A useful simplification of what we have to check in a few places.

Right now, we only support creation of non-linear LVs under some conditions that allow us avoid trying to do too crazy things. Let's make sure these conditions are met when a new LVMLogicalVolumeDevice object is being created.

We need that information in order to do checks when adding more LVs and users need this information to decide about where to place their LVs. Let's not bother with existing LVs allocations for now. We can just ignore those and only care about newly added (non-existing) LVs which we need to place somewhere.

The word "copies" is accurate together with mirror/RAID1 RAID, but it's misleading with other RAID levels LVM supports. The property doesn't seem to be accessed anywhere outside the class so let's just replace it with a private property with a more accurate name. Also give incomplete/inaccurate information if we have incomplete/inaccurate information instead of erroring out.

lv.size reports the size of the LV not the space occupied in the VG, that's what data_vg_space_used is for. Under the same logic lv.metadata_size should report the size of the metadata space LV has available leaving lv.metadata_vg_space_used for reporting how much space from the VG the metadata part(s) of the LV take. If the LV exists, we should just go through the internal metadata LVs and sum their sizes because that's the actual/real value. Also document the property. Please note that no changes are needed outside of these two properties because they are already used properly and this just fixes the values such places in code calculate with (like metadata_size passed to blockdev.thpoolcreate() or calculation of the pmspare LV's size).

Now that we keep track of available space in the PVs we need to take into account LVM caches because those specify PVs and thus we need to make sure that they really fit in somewhere. Also the users need to know how much space they still have available in their PVs if they add a cache to their LV(s).

It is shorter, faster and more reliable. Using pv.name was a remnant of development version of the LVM cache support that worked with PV names instead of PV (StorageDevice) objects.

LVM complains about a PV appearing multiple times in the list of PVs to use. Add and use a function for deduplicating things in a list (keeping the ordering of the items).

Useful for testing as well as for users wondering how to do something like this.

In order to make sure we are working with something we understand.

Creating an LV means some extents were allocated from its VG's PVs. In order to prevent us from working with old values we need to make sure fresh values are fetched.

…e size When an LVM cache is created, an internal metadata LV is created for it. And for LVM that also means that a special pmspare LV with a size greater or equal to the size of the metadata LV has to exist (and thus may be created) in the same VG. We don't want to bother user code with these calculations and thus we should subtract this space from the requested cache's size.

We can call linear LVs to be RaidDevice instances using the Linear RAID level. So we can make all (non-thin) LVs to be instances of the RaidDevice class.

RAID LVs have/need one extent big internal meta data LVs. Instead of bothering the user code with this, let's just make the RAID LVs one extent smaller than requested and use that space for the meta data. Also reserve the space for the 'mirror' segment type which is a different implementation of RAID1 in Device Mapper.

For example "mirror" is the nick of the "RAID1Level" the name of which is "raid1".

This adds tests for commits, changes and new things implemented for the support of non-linear LVs (aka LVM RAID).

jkonecny12 · 2016-01-07T11:30:26Z

My quick look found nothing bad so I think it should be good for pushing :) .

LVM RAID

vpodzime force-pushed the master-lvm_raid branch 6 times, most recently from c848e67 to e89bf61 Compare November 25, 2015 10:39

jkonecny12 reviewed Nov 30, 2015
View reviewed changes

jkonecny12 added the ACK label Nov 30, 2015

vpodzime force-pushed the master-lvm_raid branch 3 times, most recently from 9f9ffa6 to bf55cd9 Compare December 2, 2015 15:08

vpodzime force-pushed the master-lvm_raid branch from bf55cd9 to 8cfc947 Compare January 6, 2016 11:08

vpodzime added the 2.x label Jan 6, 2016

vpodzime force-pushed the master-lvm_raid branch from 8cfc947 to 651a858 Compare January 6, 2016 11:43

vpodzime mentioned this pull request Jan 6, 2016

RFC: Add a file with release notes #298

Merged

vpodzime added 8 commits January 6, 2016 18:22

Define the Device.parent property with the @Property decorator

52fc7ae

It's just more common these days and the definition is easier to find and recognize in the sources.

Honor the LV segment type when creating it

b466998

This really only takes care about passing the type down to libblockdev and thus lvcreate. We need to a lot more to actually fully support various types of LVs.

Add a definition for the striped "RAID level"

f6d612a

Striped LVs are essentially RAID0 LVs as far as all the calculations go.

Create non-linear LVs before linear LVs

81b8f54

non-linear LVs require space on specific PVs and thus have stronger restrictions than linear LVs which can be allocated from anywhere.

vpodzime added 3 commits January 6, 2016 18:22

Add an example of non-linear LV creation

ebfa918

Useful for some manual testing as well as for people wondering how to do something like that.

Define and use a new is_raid_lv property of LVMLogicalVolumeDevice

e77bad4

A useful simplification of what we have to check in a few places.

vpodzime force-pushed the master-lvm_raid branch from 651a858 to dd159d3 Compare January 6, 2016 17:26

vpodzime added 14 commits January 6, 2016 18:31

Use pv.path instead of constructing it from name and "/dev/"

f6a19a4

It is shorter, faster and more reliable. Using pv.name was a remnant of development version of the LVM cache support that worked with PV names instead of PV (StorageDevice) objects.

Make sure the slow_pvs+fast_pvs list we pass to libblockdev has no dups

60d4b12

LVM complains about a PV appearing multiple times in the list of PVs to use. Add and use a function for deduplicating things in a list (keeping the ordering of the items).

Add example of LVM cache creation

2f8da2f

Useful for testing as well as for users wondering how to do something like this.

Add an early check of the requested LV segment type

3a9787c

In order to make sure we are working with something we understand.

Refresh the PV free space info after LV creation

ed115d5

Creating an LV means some extents were allocated from its VG's PVs. In order to prevent us from working with old values we need to make sure fresh values are fetched.

Inherit the RaidDevice mixin in the LVMLogicalVolumeDevice class

d1d25b9

We can call linear LVs to be RaidDevice instances using the Linear RAID level. So we can make all (non-thin) LVs to be instances of the RaidDevice class.

Make sure we try to match LVM RAID levels with all their names

1955bb3

For example "mirror" is the nick of the "RAID1Level" the name of which is "raid1".

Add unit tests for changes related to non-linear LVs

8f324f4

This adds tests for commits, changes and new things implemented for the support of non-linear LVs (aka LVM RAID).

vpodzime force-pushed the master-lvm_raid branch from dd159d3 to 8f324f4 Compare January 6, 2016 17:32

vpodzime added a commit that referenced this pull request Jan 7, 2016

Merge pull request #286 from vpodzime/master-lvm_raid

0b62859

LVM RAID

vpodzime merged commit 0b62859 into storaged-project:master Jan 7, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LVM RAID #286

LVM RAID #286

vpodzime commented Nov 23, 2015

jkonecny12 Nov 30, 2015

vpodzime Nov 30, 2015

jkonecny12 commented Nov 30, 2015

dwlehman commented Dec 10, 2015

dwlehman commented Dec 10, 2015

vpodzime commented Dec 23, 2015

vpodzime commented Dec 23, 2015

vpodzime commented Jan 6, 2016

jkonecny12 commented Jan 7, 2016

LVM RAID #286

LVM RAID #286

Conversation

vpodzime commented Nov 23, 2015

jkonecny12 Nov 30, 2015

Choose a reason for hiding this comment

vpodzime Nov 30, 2015

Choose a reason for hiding this comment

jkonecny12 commented Nov 30, 2015

dwlehman commented Dec 10, 2015

dwlehman commented Dec 10, 2015

vpodzime commented Dec 23, 2015

vpodzime commented Dec 23, 2015

vpodzime commented Jan 6, 2016

jkonecny12 commented Jan 7, 2016