Use waitpid to iterate over all exited child processes #122

s-ludwig · 2019-08-21T17:11:21Z

Fixes #116 and replaces #117 by extending the use of waitpid to also iterate over exited child processes in addition to avoiding zombie processes.

s-ludwig · 2019-08-21T17:14:16Z

@tchaloupka, @ZombineDev, this is what I had in mind - do you see any issues with this approach?

tchaloupka · 2019-08-21T18:21:25Z

Hi, I just happen to look on this too :)

As I see your changes:

is somewhere documented the behavior that multiple signals can be combined to one? I've tried to look at some man pages (i.e. http://man7.org/linux/man-pages/man2/signalfd.2.html) and haven't found it anywhere - it seems pretty bad if it's that way :(
if there really can be a problem with combination of signals, then this approach is probably only valid one to ensure that we don't end up with zombies, what I'm only afraid of is that when someone uses subprocesses via i.e. std.process or directly than this way we can handle a closed process that we don't know about and that would lead to unexpected results in the original code (as processes would already be waited). It can maybe be handled using waitid with WNOWAIT but I guess that using P_ALL as id type with that flag'll result in infinite loop..
is while (() @trusted { return read(cast(int)fd, &nfo, nfo.sizeof); } () == nfo.sizeof) really correct? Will read return 0 on last signal? And is it correct to continue when error is returned?

s-ludwig · 2019-08-21T18:46:59Z

is somewhere documented the behavior that multiple signals can be combined to one? I've tried to look at some man pages (i.e. http://man7.org/linux/man-pages/man2/signalfd.2.html) and haven't found it anywhere - it seems pretty bad if it's that way :(

The POSIX standard just says that non-realtime signals may generally get coalesced, and this also applies to signalfd. Searching for "signalfd coalescing", or similar, yields quite a lot of evidence that this also happens in practice, also with SIGCHLD in particular.

if there really can be a problem with combination of signals, then this approach is probably only valid one to ensure that we don't end up with zombies, what I'm only afraid of is that when someone uses subprocesses via i.e. std.process or directly than this way we can handle a closed process that we don't know about and that would lead to unexpected results in the original code (as processes would already be waited). It can maybe be handled using waitid with WNOWAIT but I guess that using P_ALL as id type with that flag'll result in infinite loop..

True, I was worried about the same, although we of course already had the same issue with just relying on signalfd. I don't really see an efficient way around this - iterating over all known process IDs using WNOHANG instead of using -1 seems to be the only safe way - resulting in overall quadratic complexity and the need to centrally keep track of all processes.

is while (() @trusted { return read(cast(int)fd, &nfo, nfo.sizeof); } () == nfo.sizeof) really correct? Will read return 0 on last signal? And is it correct to continue when error is returned?

It should return -1 with EAGAIN in that case. But it doesn't really matter, as the goal is just to drain the file descriptor. Any error during that process is not really relevant, except maybe for logging. For errors other than EAGAIN it's possible that the signalfd will cease to stop working, so recreating it would theoretically be an improvement here, although I'm unsure how realistic that case actually is in the first place.

tchaloupka · 2019-08-21T20:18:46Z

Signals

Damn, good to know, found it described i.e. here for reference: https://ldpreload.com/blog/signalfd-is-useless

Worry

Due to the signals coalesce, there probably isn't a better way to handle that.
Probably should be noted in changelog then so there are no surprises.

Loop

Damn, I just didn't notice the == nfo.sizeof at the end, nevermind..

In regard of the change, I've tried to add a testcase for this and ended up with something like

#!/usr/bin/env dub
/+ dub.sdl:
	name "test"
	dependency "eventcore" path=".."
+/

module test;

import core.sys.posix.sys.wait : waitpid, WNOHANG;
import core.time : Duration, msecs;
import eventcore.core;
import std.process : thisProcessID;
import std.stdio;

int numProc;

void main(string[] args)
{
	if (args.length == 2)
	{
		import core.thread : Thread;
		writeln("Child: ", args[1], " from ", thisProcessID);
		Thread.sleep(100.msecs);
	}
	else {
		ProcessID[] procs;
		foreach (_; 0..10) {
			auto p = eventDriver.processes.spawn(
				["./test", "hello"],
				ProcessStdinFile(ProcessRedirect.inherit),
				ProcessStdoutFile(ProcessRedirect.inherit),
				ProcessStderrFile(ProcessRedirect.inherit),
				null, ProcessConfig.none, null
			);
			assert(p != Process.init);

			numProc++;
			procs ~= p.pid;
			auto wres = eventDriver.processes.wait(p.pid, (ProcessID pid, int res) nothrow
			{
				numProc--;
				try writefln("Child %s exited with %s", pid, res);
				catch(Exception){}
			});
			if (wres == 0) numProc--;
			writeln("Started child: ", p.pid);
		}

		do eventDriver.core.processEvents(Duration.max);
		while (numProc);

		foreach (p; procs) assert(waitpid(cast(int)p, null, WNOHANG) == -1);
	}
}

It hangs sometimes infinitely with my implementation probably due to the signals coalesce.
Feel free to add/modify it as needed.

tchaloupka · 2019-08-21T20:55:32Z

Sorry I should've added that I've also modified at my local version return value of wait as it returns index of last callback which can be 0 which is the same value returned when the process has already exited. In that case callback is not called.
So in current version wres in test is always 0.

s-ludwig · 2019-08-21T21:30:43Z

Thanks, I modified the test to reliably fail and also fixed the wait() return value, as well as falling back to the variant of iterating over all known processes to avoid the compatibility issue, since I figured that a performance bug is better than a hard to debug bad interaction with foreign code...

tchaloupka · 2019-08-21T22:11:13Z

Great, thanks!

source/eventcore/drivers/posix/processes.d

s-ludwig · 2019-08-22T10:44:34Z

After a lot of digging through the thousand puzzle pieces of the Posix API, it became clear that using SIGCHLD, and especially signalfd is a futile approach, as long as the process is not a fully controlled environment in terms of which code causes the process to fork/clone and how (not even mentioning custom signal handling). The most important practical example is that the vibe.core.process test in vibe-core currently hangs for DMD 2.087.x, and I couldn't find out which change in Phobos/Druntime might be responsible for that.

So instead of SIGCHLD, the new approach is to start up a separate wait thread as needed and let that call waitpid/waitid to perform the waiting in a blocking way. This also has the advantage that it now works on other Posix systems apart from Linux. And the vibe-core test now passes for DMD 2.087.1.

Performance wise this is all pretty unfortunate, but it may be possible to make some advances in that regard later on.

s-ludwig · 2019-08-22T10:45:39Z

Forgot to CC @BenjaminSchaaf

…loses #117.

…eady exited. Avoids overlap with valid wait IDs, so that a paired cancelWait() doesn't cancel a different wait.

Instead of using waitpid(-1), explicitly waits on all known processes. This is inefficient for large numbers of child processes, but seems to be the only way to ensure to not interfere with other code that uses waitpid().

It turns out that in a heterogeneous process where other parts of the code may start processes or threads and may be waiting for those to finish, it is not realistic to rely on signalfd or even SIGCHLD in general to get notified about child process exits. The only solid way appears to be to start a separate waiter thread that uses waitid/waitpid to wait for exited child processes in a blocking way. This also fixes the hanging vibe.core.process test in vibe-core with DMD 2.087.x.

Integrates the contents of StaticProcesses into PosixEventDriverProcesses to fully hide it form the Windows build. It also changes lockedProcessInfo to be a non-template function, as that lead to a linker error on macOS.

s-ludwig · 2019-08-23T08:34:15Z

BTW, if there are no objections, I'd like to merge this today and tag a new release, so that the vibe-core/vibe.d CI finally passes again on DMD 2.087.x.

BenjaminSchaaf · 2019-08-23T09:40:04Z

@s-ludwig I plan on doing a review of this pr with some production code I have tonight (ie. next couple hours), if you don't mind waiting til then that would be great. From a cursory glance it looks fine though.

s-ludwig · 2019-08-23T09:57:44Z

Sure!

BenjaminSchaaf

Certainly a much more simple and robust approach for this than my previous code, thanks for all the improvements @s-ludwig!

In terms of performance there is an option to use the pid returned by waitid instead of checking all processes. This could however introduce subtle bugs in code using std.process alongside eventcore. (where we wait on a process spawned by std.process before other code can). Not sure whether that's worth the performance gains or not.

source/eventcore/drivers/posix/processes.d

BenjaminSchaaf · 2019-08-23T14:38:36Z

source/eventcore/drivers/posix/processes.d

+					} ();
+				}
+
+				foreach (pid; allprocs) {


I might be wrong here, but couldn't a process go out of reference at this point, causing lockedProcessInfo to get a null ProcessInfo*, resulting in a segfault? We should at least add an assert into lockedProcessInfo to make sure the pointer is not null.

I think this also begs the question of what we should do if a process is not waited on, ie. the last reference is lost before it exits. Maybe it's worth putting an assert in releaseRef to make sure the last reference is lost after the process has completed so that zombies are easier to debug.

I might be wrong here, but couldn't a process go out of reference at this point, causing lockedProcessInfo to get a null ProcessInfo*, resulting in a segfault? We should at least add an assert into lockedProcessInfo to make sure the pointer is not null.

There is a if (info is null) check at line 371 (onProcessExitStatic), which should catch that case, if I'm not overlooking something.

I think this also begs the question of what we should do if a process is not waited on, ie. the last reference is lost before it exits. Maybe it's worth putting an assert in releaseRef to make sure the last reference is lost after the process has completed so that zombies are easier to debug.

I didn't notice it before starting to work on this PR, but the reference handling in general goes against the usual rules where releaseRef must be the final call to free up the associated slot. The changes required to fix this are large enough that I'd like to split this into a separate PR, though, also considering that this issue already exists in the current master version.

BTW, I think the reason why the sequence spawn -> wait -> releaseRef currently works is that the initial ref count is zero and wraps around to size_t.max after the wait is done, so that finally the releaseRef call decrements it to size_t.,max - 1 without asserting. It means that currently all slots will leak and finally crash once a PID gets reused.

source/eventcore/drivers/posix/processes.d

s-ludwig · 2019-08-23T17:35:47Z

In terms of performance there is an option to use the pid returned by waitid instead of checking all processes. This could however introduce subtle bugs in code using std.process alongside eventcore. (where we wait on a process spawned by std.process before other code can). Not sure whether that's worth the performance gains or not.

That was my original approach, but such bugs would be really nasty to track down, so if anything, I'd make that an opt-in behavior. I figured that for now this is okay, considering that I didn't come up with a use case that has more than maybe a few dozen child processes open.

s-ludwig · 2019-08-23T19:36:11Z

I've opened two followup issues for the two remaining points: #124, #125

schveiguy · 2020-10-15T13:16:02Z

Just saw this. Might this help with vibe-d/vibe-core#205 ?

BenjaminSchaaf · 2020-10-15T13:38:26Z

This only affects programs that use the child process functionality which I very much doubt the "vanilla vibe.d server" from that issue is using.

schveiguy · 2020-10-15T13:41:46Z

Yeah, I figured it out after reading more closely. The "zombie process" thing is what triggered my interest. Sorry for the noise.

s-ludwig mentioned this pull request Aug 21, 2019

Access signalfd_siginfo from signal handler #99

Open

s-ludwig force-pushed the fix_zombie_processes branch from 0a6031e to 2d44090 Compare August 21, 2019 21:00

tchaloupka mentioned this pull request Aug 21, 2019

Fix #116 - zombie processes with posix driver #117

Closed

kubo39 reviewed Aug 22, 2019

View reviewed changes

source/eventcore/drivers/posix/processes.d Outdated Show resolved Hide resolved

s-ludwig force-pushed the fix_zombie_processes branch 2 times, most recently from 919c787 to 7689219 Compare August 22, 2019 19:39

s-ludwig and others added 10 commits August 23, 2019 09:35

Fix indentation style.

e1c6d99

Use waitpid to iterate over all exited child processes. Fixes #116. C…

507fb5a

…loses #117.

Add test for SIGCHLD coalescing.

de199d3

Return an invalid wait ID for processes.wait() if the process has alr…

72234fc

…eady exited. Avoids overlap with valid wait IDs, so that a paired cancelWait() doesn't cancel a different wait.

Avoid interference with other users of waitpid.

4724f14

Instead of using waitpid(-1), explicitly waits on all known processes. This is inefficient for large numbers of child processes, but seems to be the only way to ensure to not interfere with other code that uses waitpid().

Use a more robust way to self-execute the test binary.

1ef320c

Use the Posix process driver on all Posix operating systems.

f1c2eb7

Fix Windows compilation.

01c2c26

Integrates the contents of StaticProcesses into PosixEventDriverProcesses to fully hide it form the Windows build. It also changes lockedProcessInfo to be a non-template function, as that lead to a linker error on macOS.

Ensure that a valid PID is passed to kill().

5c3afcc

s-ludwig force-pushed the fix_zombie_processes branch from 7689219 to 5c3afcc Compare August 23, 2019 07:35

BenjaminSchaaf reviewed Aug 23, 2019

View reviewed changes

kubo39 reviewed Aug 23, 2019

View reviewed changes

source/eventcore/drivers/posix/processes.d Outdated Show resolved Hide resolved

Fix indentation and remove unused imports/variables.

20373d1

This was referenced Aug 23, 2019

Posix process driver reference counting is broken #124

Open

Implement an opt-in performance mode for the Posix process driver #125

Open

s-ludwig merged commit bca94d5 into master Aug 23, 2019

s-ludwig deleted the fix_zombie_processes branch August 23, 2019 22:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use waitpid to iterate over all exited child processes #122

Use waitpid to iterate over all exited child processes #122

s-ludwig commented Aug 21, 2019

s-ludwig commented Aug 21, 2019

tchaloupka commented Aug 21, 2019

s-ludwig commented Aug 21, 2019

tchaloupka commented Aug 21, 2019 •

edited

tchaloupka commented Aug 21, 2019

s-ludwig commented Aug 21, 2019

tchaloupka commented Aug 21, 2019

s-ludwig commented Aug 22, 2019

s-ludwig commented Aug 22, 2019

s-ludwig commented Aug 23, 2019

BenjaminSchaaf commented Aug 23, 2019

s-ludwig commented Aug 23, 2019

BenjaminSchaaf left a comment

BenjaminSchaaf Aug 23, 2019 •

edited

s-ludwig Aug 23, 2019

s-ludwig commented Aug 23, 2019

s-ludwig commented Aug 23, 2019

schveiguy commented Oct 15, 2020

BenjaminSchaaf commented Oct 15, 2020

schveiguy commented Oct 15, 2020

Use waitpid to iterate over all exited child processes #122

Use waitpid to iterate over all exited child processes #122

Conversation

s-ludwig commented Aug 21, 2019

s-ludwig commented Aug 21, 2019

tchaloupka commented Aug 21, 2019

s-ludwig commented Aug 21, 2019

tchaloupka commented Aug 21, 2019 • edited

tchaloupka commented Aug 21, 2019

s-ludwig commented Aug 21, 2019

tchaloupka commented Aug 21, 2019

s-ludwig commented Aug 22, 2019

s-ludwig commented Aug 22, 2019

s-ludwig commented Aug 23, 2019

BenjaminSchaaf commented Aug 23, 2019

s-ludwig commented Aug 23, 2019

BenjaminSchaaf left a comment

Choose a reason for hiding this comment

BenjaminSchaaf Aug 23, 2019 • edited

Choose a reason for hiding this comment

s-ludwig Aug 23, 2019

Choose a reason for hiding this comment

s-ludwig commented Aug 23, 2019

s-ludwig commented Aug 23, 2019

schveiguy commented Oct 15, 2020

BenjaminSchaaf commented Oct 15, 2020

schveiguy commented Oct 15, 2020

tchaloupka commented Aug 21, 2019 •

edited

BenjaminSchaaf Aug 23, 2019 •

edited