-
-
Notifications
You must be signed in to change notification settings - Fork 13.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nixos.ova build fails on Hydra #25901
Comments
Differences between succeeding and failing job:
|
Here's the line from nix giving the error: It's definitely related to NixOS/nix@eba840c. |
Probably the same reason as explained in 6cfb3b6. No idea on how to fix, though. |
This issue seems to have no owner (just stating facts, no offense intended). Is there a process to identify the offending commit and roll it back? |
Apparently it wasn't triggered by a nixpkgs commit but by a nix change. That's why most people won't reproduce it. I don't think a process really exists (for this). /cc @edolstra for the option to roll that change back on the build farm for now, as there's no idea how to fix it properly and the channel is on a ~11 days old commit already. |
Seems it's now timing out building ibus package? |
ibus was updated on master (since the last ova failure) and it seems to build fine on Hydra now, though the ova job doesn't really show that (yet). |
Now the job built successfully, though I can't see why. AFAIK it's possible some build slaves still use an older version of nix, or something... |
Right, probably only the packet machine can succeed. I managed to make Hydra build the tested job successfully now, after two weeks, but this issue remains a channel blocker IMO. |
I guess the packet machine was updated so now the job will never succeed... |
The problem seems to be we're running insstall commands for the ova inside the sandbox, which prevents setuid/gid, however, we do use setsid/setgid in some places: ex in nixos-prepare-root:
|
Here are some additional places:
|
Right, |
Please bear in mind I don't know anything about the low-level details involved. I was able to reproduce the failure in a VM and trying to debug and connect some breadcrumbs I ended up with the below, although again my reasoning here may well be flawed. Hopefully it is of some help and doesn't add to the confusion. Applying the following:
on top of:
makes the test pass for me. Patch and ideas stolen from #10496 |
@pbogdan I'm in no position to evaluate if your patch is the best solution (I have no idea what I'm doing here,) however I can definitely appreciate your great digging and patch. Thank you! |
@grahamc you can safely skip the sticky bit in, when creating the image. It will be fixed by stage-2 automatically: https://github.com/NixOS/nixpkgs/blob/master/nixos/modules/system/boot/stage-2-init.sh#L62 |
Oh, I didn't realize this isn't reproducible with the latest stable nix. It still builds in |
This solution seems OK. I think I now understand. @pbogdan: thank you a lot for finding a solution; I designated you as the author of the modified commit :-) |
Not only did it work okay, it passed: https://hydra.nixos.org/build/54649680#tabs-summary I think we'll get a channel update: https://hydra.nixos.org/job/nixos/trunk-combined/tested#tabs-constituents |
Yes, I do believe we'll finally get a channel bump within several hours 🎉 There are just some heavy packages left, e.g. webkitgtk... |
nixos-unstable bumped! |
Maybe we should just get rid of that fakeroot stuff? I mean, the goal was to git rid of the QEMU VM, but that's still being used, so fakeroot just seems an unnecessary complication... |
Issue description
The ova job has been failing during past week, blocking the nixos-unstable channel. I don't know what's the exact problem.
Steps to reproduce
I'm unable to reproduce the failure locally.
The text was updated successfully, but these errors were encountered: