PARI is not thread-safe from above #28800

embray · 2019-11-25T11:12:00Z

This is a follow-up to #26608. That ticket specifically discussed the issue of multi-threaded PARI causing Sage's docbuild to break. That problem is worked around in #28356, so I decided to close #26608.

However, the general problem remains, which is that PARI is not thread-safe from above, meaning that while threads created and managed by the PARI library itself work fine, threads created in a multi-system environment (like Sage) which happen to use PARI (specifically PARI built with multi-threading support) it will segfault.

This has been discussed in #26608 as well as this discussion on the OpenDreamKit project as well as related in-person conversations for which I unfortunately lack notes.

With #26608 resolved this is fortunately not an immediate problem for Sage, though it would be very easy for someone thinking they can just carelessly use threads (e.g. from the Python level) in their own code and experience similar crashes.

This is also not a problem just of PARI; getting this kind of multi-system multi-level parallelism right is hard, and should require cooperation, and specific guidelines to follow. Although I have not yet hit any other specific examples I have no doubt that the exist; for example I would not be surprised if HPC-GAP has similar problems.

Upstream: Reported upstream. Developers deny it's a bug.

CC: @antonio-rojas @jdemeyer @kiwifb @dimpase @saraedum @embray @timokau

Component: interfaces

Keywords: pari threading parallelism gap

Issue created by migration from https://trac.sagemath.org/ticket/28800

dimpase · 2019-11-25T12:18:47Z

comment:3

Macaulay2 does have these problems with Pari, AFAIK. They are thinking of removing Pari from their dependencies all together due to this.

embray · 2019-12-13T12:26:57Z

comment:4

As I think I mentioned in the original discussion, the issue could be mitigated in PARI somewhat, with a few strategically-placed checks to ensure that important thread-local variables have been initialized, and re-initialize them as needed (using more-or-less existing code for re-initializing PARI in its own, self-managed threads).

embray · 2019-12-13T12:33:44Z

comment:5

An alternative approach (although one that would still be made easier with some internal refactoring of PARI*) would be like Python's PyGILState_Ensure(). This places some onus on users of PARI in multi-threaded code to make sure PARI's interpreter state is properly initialized before using it in a new thread, which is not an unfair thing to ask users to do.

* I should clarify what I mean by this. When multi-threading was added to PARI, a number of global variables were simply converted directly to thread-local variables (sometimes it's not clear to me if all of them need to be thread-local; I don't know). By contrast, to use CPython again as an example (since I know it well), all variables that need to be thread specific (e.g. in PARI this would include things like the main stack pointer) are collected into a single PyThreadState struct, which makes it much easier to manage. Each thread has its own threadstate stored in a thread-local variable, so just one variable instead of a whole bunch (meaning only one call to the TSS APIs to get/set it). A similar reorganization of PARI's thread-specific variables would be helpful.

dimpase · 2020-08-23T10:08:23Z

comment:6

one way around this difficulty is to use spawn, not fork, in multiprocessing, something that is available in Python 3.7 and later.
One just calls

multiprocessing.set_start_method('spawn')

somewhere early enough.

This alone is not enough, one needs to rework various things due to spawn pickling the environment.

embray · 2020-08-31T13:49:04Z

comment:7

Yeah, setting set_start_method('spawn') globally would wreak havoc, though might be useful in some careful cases. I think this specific issue would be better addressed with improvements to how PARI manages its thread-local state.

embray added c: interfaces labels Nov 25, 2019

This comment has been minimized.

Sign in to view

yyyyx4 mentioned this issue Feb 16, 2024

"cysignals.signals.SignalError: Segmentation fault" when using multiprocessing #36370

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PARI is not thread-safe from above #28800

PARI is not thread-safe from above #28800

embray commented Nov 25, 2019

This comment has been minimized.

dimpase commented Nov 25, 2019

embray commented Dec 13, 2019

embray commented Dec 13, 2019

dimpase commented Aug 23, 2020

embray commented Aug 31, 2020

PARI is not thread-safe from above #28800

PARI is not thread-safe from above #28800

Comments

embray commented Nov 25, 2019

This comment has been minimized.

dimpase commented Nov 25, 2019

embray commented Dec 13, 2019

embray commented Dec 13, 2019

dimpase commented Aug 23, 2020

embray commented Aug 31, 2020