New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hydra.nixos.org nixpkgs/nixos jobs coordination #117495
Comments
I've aborted the hardening-flags jobs because of IMO more important jobs. #104091 (comment) I am thinking of pausing the staging* builds because of the upcoming openssl cve fix #117191 |
Perhaps we should enable discussions in nixpkgs. Maybe it's better suited for this |
This comment has been minimized.
This comment has been minimized.
Maybe we could've done this on the discourse forum :-) GH discussions sound like a replacement for that, though I don't have any experience with them yet. |
Someone's cancelled the first |
I've canceled all jobs except for release-20.09-small to get that channel update out as quickly as possible. I'm not sure of the cause yet, but "bump to front" doesn't seem to bump to front. I'll do 20.09 after that, then unstable-small. |
"Bump to front" only works when new jobs are to be dispatched iirc :/ I usually bump the jobs to front and restart the queue runner to force it to redispatch all jobs (with the correct order this time). Kills all builds but yeah… |
I don't know... in this state where almost all is cancelled, we get most of the build farm idling. EDIT: well, it should only take a few hours until -small is finished. |
Good point, @vcunat. Now that the 20.09-small build is making good progress, to get more builds running on the unused capacity I restarted the unstable-small jobset. But this shouldn't overwhelm anything and will hopefully keep the 20.09-small jobset highly prioritized. |
I started a couple 20.09 darwin jobs as well, as there are none in the nixos jobsets. |
20.09-small finished its builds: https://hydra.nixos.org/eval/1658031 |
unstable-small is very near completion also. I'm going to wait for it to complete then restart 20.09 |
I restarted the 20.09 jobs. |
nixos:unstable-small:tested finished: https://hydra.nixos.org/eval/1658000 |
I've just bumped the 20.09 jobs to the front of the queue, restarted hydra-queue-runner, and started hydra-evaluator. Hopefully hydra keeps churning on 20.09 as the priority overnight. With that, I'm heading to bed. |
I canceled 20.03 jobs (for now). Those jobsets still have high amount of shares configured; I think we should lower them significantly, as 20.03 isn't officially supported anymore. EDIT: I did that later. |
There are still some strange timeouts on the 20.09 build: https://hydra.nixos.org/build/140063634 |
Occasional timeouts like that do happen. It works locally so I restarted it. |
I've aborted the haskell-updates jobs since the mass rebuild after the openssl update is still ongoing cc @peti |
Well, it was based atop, so perhaps it was targeting merge before the real rebuild of master with new openssl happens. |
20.09 has about 7,000 jobs left, but tested has passed. I'm inclined to cancel all remaining jobs, le the channel advance, and then restart the jobs to backfill the cache. Any opinions? I'll do it in ~30min unless I hear otherwise. |
I wish you hadn't. How am I supposed to do the weekly merge tonight without any results from Hydra? |
Sorry, it may be challenging to get nixos-unstable caught up in time. |
All four supported NixOS channels have updated and contain the new openssl 🎉
|
|
Purpose
Inform one another about Hydra jobs that need to be (re)started, aborted or otherwise adjusted. Note often these kind of things go via IRC as well. This is just another channel. Also, if one needs a jobset they could request here as well.
Added the channel-blocker label so it will show up on status.nixos.org.
cc @roberth @vcunat @mweinelt
The text was updated successfully, but these errors were encountered: