New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
testiso failing in 4.11 using RHEL 8.5; race mounting rootfs #666
Comments
|
One possibility here is that |
|
We are seeing this also on internal 4.9 CI jobs with the same error, but the |
|
We had one successful build + test of 4.10 using the RHEL 8.4 EUS bits (the repo location we had previously been using switched to RHEL 8.5). I suspect there was something in the 8.5 bits that was causing the error, but the pkgdiff for the CI jobs isn't working, so we don't have a good snapshot of what changed. |
|
We've had additional success building RHCOS 4.10 + 4.9 using RHEL 8.4 EUS sources. Some additional investigation around RHCOS 4.10 + RHEL 8.5 kernel would be a good place to start. |
|
Issues go stale after 90d of inactivity. Mark the issue as fresh by commenting If this issue is safe to close now please do so with /lifecycle stale |
|
This issue seems to have resolved itself, though we don't really understand why. Please re-open if this happens again. /close |
|
@miabbott: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
This showed up again in #718. The kernel message The util-linux message The util-linux commit mentions that the failure only occurs with a non-zero file offset. That's not a common setting, so it's unsurprising that no one noticed. But CoreOS uses a non-zero offset to mount the rootfs squashfs directly out of the cpio archive. (Added in RHCOS 4.6 in coreos/fedora-coreos-config@18a2c51.) A reproducer based on the util-linux commit message triggers pretty quickly for me: truncate -s 100M disk
mkdir point
losetup -o 239 /dev/loop3 disk
mkfs.ext4 /dev/loop3
losetup -d /dev/loop3
while mount -o loop,offset=239 disk point && umount point; do :; doneWe should ask for util-linux/util-linux@eab90ef to be backported to 8.4+. |
|
Awesome work on that investigation 🕵️ !
|
All of the ISO-based `testiso` scenarios are failing due to openshift#666 so we are going to disable them until we get `util-linux` patched. See also: https://bugzilla.redhat.com/show_bug.cgi?id=2058176
|
/remove-lifecycle stale |
|
Issues go stale after 90d of inactivity. Mark the issue as fresh by commenting If this issue is safe to close now please do so with /lifecycle stale |
|
/remove-lifecycle stale |
|
https://bugzilla.redhat.com/show_bug.cgi?id=2058176 is shipped in 8.6 |
Our Live ISO relies on loopback mounting a squashfs.
It's failing in 4.10; looks like this may be a kernel change:
The text was updated successfully, but these errors were encountered: