phd: collect core when killing non-booting guest by iximeow · Pull Request #1079 · oxidecomputer/propolis

iximeow · 2026-03-12T19:14:46Z

this may or may not prove useful in practice; if we're lucky something got funky in device emulation and we can see a stuck thread. on the other hand, if we're unlucky the guest is stuck in a loop and all we see is one vCPU was running while everything else was idle. in this case, at least, hopefully the serial console says something about the condition (it usually does, from experience)

this does the immediate thing in #1034. theoretically I've put the core in the right spot to get slurped up when we tar up the rest of the phd run artifacts so I'll rerun the phd job 'til we get a core...?

this may or may not prove useful in practice; if we're lucky something got funky in device emulation and we can see a stuck thread. on the other hand, if we're unlucky the guest is stuck in a loop and all we see is one vCPU was running while everything else was idle.

hawkw

this is great! thank you!

hawkw · 2026-03-12T22:08:30Z

phd-tests/framework/src/test_vm/mod.rs

+                );
+                let proc = self.server.as_ref().unwrap();
+                proc.core();
+                anyhow::bail!("timed out while waiting to boot")


maybe we ought to stuff the core's path in this error so that it gets printed in the test failure as well as in its logs?

i think i've got it so that the warn!("core written to {}", core_path); would be right above this in the rendered buildomat output, so it should be pretty easy to notice if you've gotta look at the logs.. it stubbornly does not want to do the thing though so i guess we'll see?

anyway i'm mostly thinking that until very recently most timed out while waiting to boot really meant that i did something funky with the guest test image or adapter. there the core wouldn't have been nearly as useful as looking at the guest's serial history, so i don't wanna nudge in a misleading direction.

iximeow · 2026-03-12T23:17:01Z

ixi/core-on-stuck-phd-demo didn't cough up a flake after four runs. merging this so we have it whenever the next time it does happen..

iximeow added the testing Related to testing and/or the PHD test framework. label Mar 12, 2026

hawkw approved these changes Mar 12, 2026

View reviewed changes

iximeow merged commit a54b7de into master Mar 12, 2026
12 checks passed

iximeow deleted the ixi/core-on-stuck-phd branch March 12, 2026 23:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

phd: collect core when killing non-booting guest#1079

phd: collect core when killing non-booting guest#1079
iximeow merged 1 commit intomasterfrom
ixi/core-on-stuck-phd

iximeow commented Mar 12, 2026

Uh oh!

hawkw left a comment

Uh oh!

hawkw Mar 12, 2026

Uh oh!

iximeow Mar 12, 2026

Uh oh!

Uh oh!

iximeow commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

iximeow commented Mar 12, 2026

Uh oh!

hawkw left a comment

Choose a reason for hiding this comment

Uh oh!

hawkw Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

iximeow Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

iximeow commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants