ENH: Please support subinterpreters #24755

mkostousov · 2023-09-20T14:55:16Z

Proposed new feature or change:

Version 1.25.1, Python 3.12.re02
After enabling interpreters in Python C Api:

PyInterpreterConfig config = {
.check_multi_interp_extensions = 1,
.gil = PyInterpreterConfig_OWN_GIL,
};
PyThreadState *tstate = NULL;
PyStatus status = Py_NewInterpreterFromConfig(&tstate, &config);
if (PyStatus_Exception(status)) {
return -1;
}

Import numpy throws an exception:
module numpy.core._multiarray._umath does not support loading in subinterpreters

mattip · 2023-09-21T08:05:15Z

PEP 554 states:

To mitigate that impact and accelerate compatibility, we will do the following:

be clear that extension modules are not required to support use in multiple interpreters

raise ImportError when an incompatible module is imported in a subinterpreter

provide resources (e.g. docs) to help maintainers reach compatibility

reach out to the maintainers of Cython and of the most used extension modules (on PyPI) to get feedback and possibly provide assistance

The PEP also links to Isolating Extensions which has a lot of theory, but does not clearly state how to migrate a large existing c-extension library like NumPy to support subinterpreters. I think we would need to:

move to HeapTypes
move all static state into module state
carefully analyze code for possible shared state.

I am a bit unclear whether subinterpreters share a single GIL, if not we would also have to carefully examine the code for possible race conditions.

This is a lot of work, and may have performance implications. What is your use case for subinterpreters? Do you think you could help with the effort or find funding for this effort?

rgommers · 2023-09-21T09:51:28Z

Relevant mailing list threads and issues:

Support for Multiple Interpreters (Subinterpreters) in numpy (Aug 2022)
Dealing with static local variables in Numpy (Aug 2023)
Issues labelled with Embedded

seberg · 2023-09-26T10:20:16Z

This is a lot of work, and may have performance implications. What is your use case for subinterpreters? Do you think you could help with the effort or find funding for this effort?

I suspect the vast majority of changes to be relatively easy, but there is still the same problem that we need someone to explicitly dedicate time on this, and I doubt it will be one of the current core devs.
We even added a warning a long time back saying exactly that, but it seems CPython changes to make subinterpreter support better in the long-run now enforces an error rather than a warning.

a-reich · 2023-10-10T11:26:49Z

PEP 554 states: …

FWIW the recent CPython changes should be from PEP 684 “Per-Interpreter GIL”; PEP 554, for the Python API and subinterpreter management features, is still in draft status.

mdekstrand · 2023-11-10T12:50:12Z

There's a very strong use case for subinterpreters since PEP 684 for parallel processing that I expect would be useful to a lot of numpy client code: using subinterpreters in separate threads will enable shared memory (at least in a read-only case) with significantly less hassle than multiprocessing.

a-reich · 2023-11-10T19:53:24Z

I’m also very excited about the potential opportunities of using subinterpreters with numpy, and agree with what @mdekstrand said. In particular, the latest draft of PEP 734 discusses sharing data via the buffer protocol (and already implemented in the private interpreters module since 3.13a1). Since ndarrays can export their buffer or be created from one without copies, this could be a very nice pattern:

pickle your array with protocol 5 to get some serialized metadata plus the memoryview,
pass that view to a bunch of interpreters (which is basically instant) as well as the small metadata,
and unpickle: now all of them are sharing the data in each of their arrays
And if you don’t want to worry about data races, seems like np can handle that by setting the readonly flag.

You get concurrency with performant, opt-in data sharing, without the hassles of managing subprocesses and using multiprocessing.shared_memory where you have to create a shared buffer of fixed size ahead of time and only create arrays using that. With interpreters you can take any random array you got and easily share it.

mattip changed the title ~~Support for subinterpreters~~ ENH: Please support subinterpreters Sep 21, 2023

rgommers added the 01 - Enhancement label Sep 21, 2023

ngoldbaum mentioned this issue Mar 16, 2024

BUG: WSGI incompatibility, destroying downstream A.I. web apps based on Torch, TensorFlow, Jax, ... #26039

Closed

a-reich mentioned this issue Apr 26, 2024

ENH: Support nogil python build (tracking issue) #26157

Open

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Please support subinterpreters #24755

ENH: Please support subinterpreters #24755

mkostousov commented Sep 20, 2023

mattip commented Sep 21, 2023

rgommers commented Sep 21, 2023

seberg commented Sep 26, 2023

a-reich commented Oct 10, 2023

mdekstrand commented Nov 10, 2023

a-reich commented Nov 10, 2023

ENH: Please support subinterpreters #24755

ENH: Please support subinterpreters #24755

Comments

mkostousov commented Sep 20, 2023

Proposed new feature or change:

mattip commented Sep 21, 2023

rgommers commented Sep 21, 2023

seberg commented Sep 26, 2023

a-reich commented Oct 10, 2023

mdekstrand commented Nov 10, 2023

a-reich commented Nov 10, 2023