spawn the agent in a process group / job #1174

bmc-msft · 2021-08-23T21:10:57Z

When we terminate the agent, any child processes it's created do not get
terminated but should.

By launching in a process group (or job) using the command-group crate,
we can terminate the child and all of it's children automatically.

When we terminate the agent, any child processes it's created do not get terminated but should. By launching in a process group (or job) using the command-group crate, we can terminate the child and all of it's children automatically.

bmc-msft · 2021-08-23T21:17:02Z

Note, on Windows, command-group uses a job object created with winapi::um::jobapi2::CreateJobObjectW with JOB_OBJECT_LIMIT_KILL_ON_JOB_CLOSE set. On kill, the job object is terminated, which should kill all of the children (and grand children).

bmc-msft · 2021-08-23T22:23:09Z

All of the windows tasks are failing during integration testing. Investigating.

…t issue

bmc-msft · 2021-08-25T18:55:39Z

This PR pins to the commit hash associated with a PR that addresses handling timeouts to GetQueuedCompletionStatus.

watchexec/command-group#3

Once this PR, or something like it, has been merged & released, we should move to the released version of command-group.

ranweiler

This overall approach seems reasonable, at least as a short-term improvement.

I guess I'd then ask:

Should we being using this for every child process, or at least many others?
I wouldn't think that the temp dir bug in the onefuzz-agent should be impacted by this at all. In particular, if the supervisor is "done", or knows that it is stopping, it should never be possible for a the onefuzz-agent to cause a task failure, period. Furthermore, if a supervisor is running task A, and we tell it to stop because task A should be stopped, this should already be reflected in the server, and I'd expect any subsequent "task A failed" messages should just be dropped as spurious. If that's not happening, there's a deeper problem we should also be fixing.

An aside, in terms of merging: we should split the process group changes out from the unrelated addition of all the extra context() calls.

stishkin · 2023-02-22T21:59:52Z

Closing.

Will revisit some time later

spawn the agent in a process group / job

4e4e948

When we terminate the agent, any child processes it's created do not get terminated but should. By launching in a process group (or job) using the command-group crate, we can terminate the child and all of it's children automatically.

bmc-msft marked this pull request as draft August 23, 2021 22:22

demoray added 2 commits August 24, 2021 13:56

add context through most of the worker/agent processing in supervisor

d980d23

use a pinned hash of a PR for command-group that addresses the timeou…

bf7c35f

…t issue

bmc-msft and others added 2 commits August 25, 2021 14:56

Merge branch 'main' into spawn-agent-in-process-group

b48f3e1

Merge branch 'main' into spawn-agent-in-process-group

7698f43

bmc-msft linked an issue Sep 7, 2021 that may be closed by this pull request

Terminate fuzzers before node teardown #1227

Closed

demoray and others added 2 commits September 7, 2021 08:48

use released version

0619247

Merge branch 'main' into spawn-agent-in-process-group

f233646

mgreisen assigned chkeita Oct 19, 2021

Merge branch 'main' into spawn-agent-in-process-group

83e8dd3

chkeita requested a review from ranweiler November 9, 2021 19:48

ranweiler reviewed Nov 9, 2021

View reviewed changes

stishkin closed this Feb 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spawn the agent in a process group / job #1174

spawn the agent in a process group / job #1174

bmc-msft commented Aug 23, 2021

bmc-msft commented Aug 23, 2021

bmc-msft commented Aug 23, 2021

bmc-msft commented Aug 25, 2021

ranweiler left a comment

stishkin commented Feb 22, 2023

spawn the agent in a process group / job #1174

spawn the agent in a process group / job #1174

Conversation

bmc-msft commented Aug 23, 2021

bmc-msft commented Aug 23, 2021

bmc-msft commented Aug 23, 2021

bmc-msft commented Aug 25, 2021

ranweiler left a comment

Choose a reason for hiding this comment

stishkin commented Feb 22, 2023