-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tonistiigi/binfmt:latest
(qemu 9.2.0) segfault on ubuntu-24.04 / ubuntu-latest
#198
Comments
Run qemu on Ubuntu 22.04 See also: - actions/runner-images#11471 - docker/setup-qemu-action#188 - docker/setup-qemu-action#198 Upgrading to QEMU v8.1.5 doesn't seem to help, so closes #1529 I runt the CI multiple times and it always worked, so I think this downgrade really "fixes" the issue. These changes are made under both the "Apache 2.0" and the "GNU Lesser General Public License 2.1 or later" license terms (dual license). SPDX-License-Identifier: Apache-2.0 OR LGPL-2.1-or-later
This is impacting us as well, as of this morning most of our builds that use There is also an issue opened on tonistiigi/binfmt, tonistiigi/binfmt#240 |
This is also effecting us as well. |
Same issue for |
tonistiigi/binfmt:latest
(qemu 9.2.0) segfault on ubuntu-24.04 / ubuntu-latest
This is probably related to a kernel issue on Ubuntu: tonistiigi/binfmt#215 (comment) |
A workaround that we found is to run our docker builds on ARM based VMs with Qemu emulating the AMD64. Minimal modification was needed for our use case (CI runners). |
This worked for me, thanks |
Here's a minimalish repro based on our build that was failing. Maybe worth noting is that our runner is Dockerfile:
Excerpt from our GitHub workflow:
|
I mentioned it in another issue (here) but I found another workaround as well. If you set up your # Use multiarch/qemu-user-static only for non-native architectures. Ending tag is excluded due to specifying platform
# See https://docs.docker.com/reference/dockerfile/#automatic-platform-args-in-the-global-scope for more info
FROM --platform=$BUILDPLATFORM multiarch/qemu-user-static AS qemu
# Start with base image
FROM luxonis/depthai-library:latest
# Copy QEMU binary only when cross-compiling with ARM and ARM64
COPY --from=qemu /usr/bin/qemu-*-static /usr/bin/
# More build statements... This was tested on Windows 10 Pro x64 host machine while building docker images to be used on Raspberry Pi 3s ( I'd like to know why this works, as this seems too simple for my liking but I'm glad this is a decent workaround for now. |
We're not currently running Server on ARM, so we don't really need this. This also allows to workaround QEMU segfaulting on ARM see docker/setup-qemu-action#198.
We're not currently running Server on ARM, so we don't really need this. This also allows to workaround QEMU segfaulting on ARM see docker/setup-qemu-action#198.
Same issue |
At first it was intermitted failures now I consistently with fail every time when attempting to docker buildx |
### What this PR does / why we need it? Backport vllm-project#64 to v0.7.1-dev branch Add container image build ci: - Enable branch, tag docker image publish - branch image: `vllm-ascend:main`, `vllm-ascend:v0.7.1-dev` - tag image: `vllm-ascend:v0.7.1rc1` - Enable PR docker image build check - other changes: - Prepare the `REPO_OWNER` because the ghcr lowerercase required - Add `Free up disk space` step to avoid `No space left on device` like vllm-project#27 - Setup qemu with image to resolve docker/setup-qemu-action#198 ### Does this PR introduce _any_ user-facing change? NO ### How was this patch tested? build: CI passed --------- Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Signed-off-by: angazenn <zengyanjia@huawei.com>
Signed-off-by: deadprogram <ron@hybridgroup.com>
We are mysteriously encountering this issue on ARM64 self-hosted GitHub Actions ( |
We experience the same issue with actions-runners |
* Bump setup-qemu-action to 3.4.0 * Bump Ubuntu version docker/setup-qemu-action#198 caused the Ubuntu Docker image build to fail, and this commit fixes that.
Same issue for linux/arm64 |
Specifically, this is happening during an |
For me, as a workaround I downgraded it to ubuntu-22.04 however the build time takes a huge impact; when building my docker image is 4x slower. Used to take around 10 mins when building using linux/arm64, now it take 40 mins. |
https://github.com/tonistiigi/binfmt/releases/tag/deploy%2Fv9.2.2-52 (QEMU 9.2.2) has been released yesterday: tonistiigi/binfmt#215 (comment) If you encounter something similar, open an issue on https://github.com/tonistiigi/binfmt/issues |
This reverts commit 3dbfd17. This shouldn't be necessary any longer; QEMU in tonistiigi/binfmt:latest has been updated to 9.2.2, which has this bug fixed - see docker/setup-qemu-action#198 (comment) tonistiigi/binfmt#215 (comment) and https://gitlab.com/qemu-project/qemu/-/issues/1913.
Works around issue from https://github.com/espressif/esp-idf/actions/runs/13531037397/job/37813060700 caused by Qemu segmentation fault. Errors were encountered while processing: libc-bin The workaround is from docker/setup-qemu-action#198.
Works around issue from https://github.com/espressif/esp-idf/actions/runs/13531037397/job/37813060700 caused by Qemu segmentation fault. Errors were encountered while processing: libc-bin The workaround is from docker/setup-qemu-action#198.
Contributing guidelines
I've found a bug, and:
Description
As mentioned in #188, the default image is unusable on ubuntu-24.04 since a kernel update a few weeks back.
As reported in that issue, moving to the qemu 8.1.5 image solved the issue for most users (since then, there has been a 9.2.0 / master image published).
It seems more & more unlikely that the latest image will be updated and more likely that setup-qemu-action should change the default image (possibly depending on the runner) to an image that works in most cases rather than what I imagine is almost none => hence this is a bug report setup-qemu-action can do something about without closing saying it's an upstream issue.
Multiple reproducers/logs/... are already available in that issue so not added new ones here.
Expected behaviour
No random segmentation faults.
Actual behaviour
Random segmentation faults in all the repositories I know of when the action is used with the default image on ubuntu-24.04 (latest)
Repository URL
No response
Workflow run URL
No response
YAML workflow
N/C
Workflow logs
No response
BuildKit logs
Additional info
No response
The text was updated successfully, but these errors were encountered: