Better handling of os.fork from inside trio #464

njsmith · 2018-03-04T00:42:59Z

If you call os.fork from inside a trio context, and then try to use trio in both the parent and the child, you are going to have a bad time. fork is a very powerful tool, and leaves the process in a strange state: the child is just like the parent, except that threads etc. have disappeared, and they share all fd's (including the self-pipe), etc. This seems essentially unsolvable: if you want to start a new process from trio using fork+exec, that's fine, but anything else is probably not going to work. I guess if you fork and then only execute synchronous code in the child then things might be mostly OK (though you've just leaked a bunch of memory in the child, because it will forever keep copies of all the parent process's data, and copy-on-write won't save you because of Python's well-known issues with GC causing un-sharing).

(See also bpo-21998, which is asyncio's version of this issue.)

We should at least document this. It would also be good to actually catch it and provide some error, since nobody reads the documentation.

One approach is to record the pid when we enter trio, and then each time we touch trio state, verify that os.getpid() still gives the same value. I guess "enter trio" here means any of the @_public methods in trio/_core, yielding to the event loop, and TrioToken.run_sync_soon?

In 3.7 there's os.register_at_fork, which could potentially be useful (maybe to reduce overhead?), but I'm not entirely sure how.

The text was updated successfully, but these errors were encountered:

smurfix · 2018-03-10T20:56:07Z

I don't particularly like fiddling with getpid() all the time, that's 99.999999% nonproductive work trying to catch the one mistake somebody made. I can't think of a way to get a worse usefulness/required_work ratio than that.

IMHO the best solution would be to os.register_at_fork an after_in_child handler which monkeypatch-poisons the current Trio mainloop and/or task so that it'll fall flat on its face the next time it's entered, preferably in such a way that there's a semi-understandable error message. That at least would cause no additional busywork for the common case (no-fork-ever).

njsmith · 2018-03-10T22:17:40Z

Ok, but: how do you propose to poison the main loop if not by setting some flag that we check constantly :-)

…

On Mar 10, 2018 12:56 PM, "Matthias Urlichs" ***@***.***> wrote: I don't particularly like fiddling with getpid() all the time, that's 99.999999% nonproductive work trying to catch the one mistake somebody made. I can't think of a way to get a worse usefulness/required_work ratio than that. IMHO the best solution would be to os.register_at_fork an after_in_child handler which monkeypatch-poisons the current Trio mainloop and/or task so that it'll fall flat on its face the next time it's entered, preferably in such a way that there's a semi-understandable error message. That at least would cause no additional busywork for the common case (no-fork-ever). — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#464 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAlOaGe2Db230c_rrR_LDHuKAQgM6Sa2ks5tdD3ogaJpZM4SbJUl> .

smurfix · 2018-03-10T23:11:59Z

At the very least, checking a flag for True/False is less expensive than a system call.

Another way would be to simply del GLOBAL_RUN_CONTEXT.task which would then trigger an internal Trio error when run_impl() tries to do the same thing, if not sooner.

sseg · 2018-03-15T13:26:42Z

Would that be sufficient to raise a warning on fork? Since the first-order solution was documentation, it seems unnecessary to me to do more than document the undefined behavior at runtime.

njsmith added design discussion polish user happiness labels Mar 4, 2018

oremanj added subprocesses low-level labels Oct 17, 2019

goodboy mentioned this issue Dec 10, 2021

Moar spawning backends (rsyscall, nogil threads) goodboy/tractor#272

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better handling of os.fork from inside trio #464

Better handling of os.fork from inside trio #464

njsmith commented Mar 4, 2018

smurfix commented Mar 10, 2018

njsmith commented Mar 10, 2018 via email

smurfix commented Mar 10, 2018

sseg commented Mar 15, 2018

Better handling of os.fork from inside trio #464

Better handling of os.fork from inside trio #464

Comments

njsmith commented Mar 4, 2018

smurfix commented Mar 10, 2018

njsmith commented Mar 10, 2018 via email

smurfix commented Mar 10, 2018

sseg commented Mar 15, 2018