pacific: krbd: make sure the device node is accessible after the mapping #39642

idryomov · 2021-02-23T16:53:02Z

We have always assumed this to be the case and users' scripts and
orchestration tools have grown to depend on this. Let's add some
enforcement, prompted by [1]:

"I am running my Kubernetes worker node inside of an LXC container
which doesn't benefit from the device node created by the kernel, so
I'm using udev to create the /dev/rbd* device nodes inside of the LXC
container."

which, through the unfortunate interaction with ceph-csi rbd plugin,
results in data loss for "volumeMode: Filesystem" PVs because it ends
up recreating the filesystem every time the PV is attached to the pod:

"When deleting the pod and re-creating it, I can see that the RBD
image is indeed being reformatted. This seems to be because when
blkid is being run to check if the image is formatted, the /dev/rbd*
device has not yet been created by udev. By the time the code gets
down to running mkfs, the device is there and the damage is done."

[1] ceph/ceph-csi#1820

Fixes: https://tracker.ceph.com/issues/49410
Signed-off-by: Ilya Dryomov idryomov@gmail.com
(cherry picked from commit f6854ac)

We have always assumed this to be the case and users' scripts and orchestration tools have grown to depend on this. Let's add some enforcement, prompted by [1]: "I am running my Kubernetes worker node inside of an LXC container which doesn't benefit from the device node created by the kernel, so I'm using udev to create the /dev/rbd* device nodes inside of the LXC container." which, through the unfortunate interaction with ceph-csi rbd plugin, results in data loss for "volumeMode: Filesystem" PVs because it ends up recreating the filesystem every time the PV is attached to the pod: "When deleting the pod and re-creating it, I can see that the RBD image is indeed being reformatted. This seems to be because when blkid is being run to check if the image is formatted, the /dev/rbd* device has not yet been created by udev. By the time the code gets down to running mkfs, the device is there and the damage is done." [1] ceph/ceph-csi#1820 Fixes: https://tracker.ceph.com/issues/49410 Signed-off-by: Ilya Dryomov <idryomov@gmail.com> (cherry picked from commit f6854ac)

dillaman

👍

idryomov · 2021-02-24T13:03:34Z

https://pulpito.ceph.com/dis-2021-02-24_11:43:49-krbd:unmap-wip-rbd-map-sanity-check-pacific-testing-basic-smithi/

idryomov added bug-fix rbd labels Feb 23, 2021

idryomov added this to the pacific milestone Feb 23, 2021

idryomov changed the title ~~krbd: make sure the device node is accessible after the mapping~~ pacific: krbd: make sure the device node is accessible after the mapping Feb 23, 2021

idryomov requested a review from dillaman February 23, 2021 21:20

dillaman approved these changes Feb 23, 2021

View reviewed changes

idryomov merged commit c356ba8 into ceph:pacific Feb 24, 2021

idryomov deleted the wip-rbd-map-sanity-check-pacific branch February 24, 2021 13:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pacific: krbd: make sure the device node is accessible after the mapping #39642

pacific: krbd: make sure the device node is accessible after the mapping #39642

idryomov commented Feb 23, 2021

dillaman left a comment

idryomov commented Feb 24, 2021

pacific: krbd: make sure the device node is accessible after the mapping #39642

pacific: krbd: make sure the device node is accessible after the mapping #39642

Conversation

idryomov commented Feb 23, 2021

dillaman left a comment

Choose a reason for hiding this comment

idryomov commented Feb 24, 2021