-
Notifications
You must be signed in to change notification settings - Fork 2.3k
[WIP] Re-enable Move Subinterpreter test for free-threaded Python 3.14 #5940
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
rwgk
wants to merge
1
commit into
pybind:master
Choose a base branch
from
rwgk:move_subinterpreter_freethreaded
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+0
−7
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
rwgk
added a commit
to XuehaiPan/pybind11
that referenced
this pull request
Dec 21, 2025
This was referenced Dec 21, 2025
rwgk
added a commit
to rwgk/pybind11
that referenced
this pull request
Dec 22, 2025
This commit improves the C++ test infrastructure to ensure test output is visible in CI logs, and disables a test that hangs on free-threaded Python 3.14+. Changes: ## CI/test infrastructure improvements - .github/workflows: Added `timeout-minutes: 3` to all C++ test steps to prevent indefinite hangs. - tests/**/CMakeLists.txt: Added `USES_TERMINAL` to C++ test targets (cpptest, test_cross_module_rtti, test_pure_cpp) to ensure output is shown immediately rather than buffered and possibly lost on crash/timeout. - tests/test_with_catch/catch.cpp: Added a custom Catch2 progress reporter with timestamps, Python version info, and a SIGTERM handler to make test execution and failures clearly visible in CI logs. ## Disabled hanging test - The "Move Subinterpreter" test is disabled on free-threaded Python 3.14+ due to a hang in Py_EndInterpreter() when the subinterpreter is destroyed from a different thread than it was created on. Work on fixing the underlying issue will continue under PR pybind#5940. Context: We were in the dark for months (since we started testing with Python 3.14t) because CI logs gave no clue about the root cause of hangs. This led to ignoring intermittent hangs (mostly on macOS). Our hand was forced only with the Python 3.14.1 release, when hangs became predictable on all platforms. For the full development history of these changes, see PR pybind#5933.
rwgk
added a commit
to rwgk/pybind11
that referenced
this pull request
Dec 22, 2025
Catch2 v2 doesn't have native skip support (v3 does with SKIP()). This macro allows tests to be skipped with a visible message while still appearing in the test list. Use this for the Move Subinterpreter test on free-threaded Python 3.14+ so it shows as skipped rather than being conditionally compiled out. Example output: [ RUN ] Move Subinterpreter [ SKIPPED ] Skipped on free-threaded Python 3.14+ (see PR pybind#5940) [ OK ] Move Subinterpreter
rwgk
added a commit
that referenced
this pull request
Dec 22, 2025
…p hanging Move Subinterpreter test (#5942) * Improve C++ test infrastructure and disable hanging test This commit improves the C++ test infrastructure to ensure test output is visible in CI logs, and disables a test that hangs on free-threaded Python 3.14+. Changes: ## CI/test infrastructure improvements - .github/workflows: Added `timeout-minutes: 3` to all C++ test steps to prevent indefinite hangs. - tests/**/CMakeLists.txt: Added `USES_TERMINAL` to C++ test targets (cpptest, test_cross_module_rtti, test_pure_cpp) to ensure output is shown immediately rather than buffered and possibly lost on crash/timeout. - tests/test_with_catch/catch.cpp: Added a custom Catch2 progress reporter with timestamps, Python version info, and a SIGTERM handler to make test execution and failures clearly visible in CI logs. ## Disabled hanging test - The "Move Subinterpreter" test is disabled on free-threaded Python 3.14+ due to a hang in Py_EndInterpreter() when the subinterpreter is destroyed from a different thread than it was created on. Work on fixing the underlying issue will continue under PR #5940. Context: We were in the dark for months (since we started testing with Python 3.14t) because CI logs gave no clue about the root cause of hangs. This led to ignoring intermittent hangs (mostly on macOS). Our hand was forced only with the Python 3.14.1 release, when hangs became predictable on all platforms. For the full development history of these changes, see PR #5933. * Add test summary to progress reporter Print the total number of test cases and assertions at the end of the test run, making it easy to spot if tests are disabled or added. Example output: [ PASSED ] 20 test cases, 1589 assertions. * Add PYBIND11_CATCH2_SKIP_IF macro to skip tests at runtime Catch2 v2 doesn't have native skip support (v3 does with SKIP()). This macro allows tests to be skipped with a visible message while still appearing in the test list. Use this for the Move Subinterpreter test on free-threaded Python 3.14+ so it shows as skipped rather than being conditionally compiled out. Example output: [ RUN ] Move Subinterpreter [ SKIPPED ] Skipped on free-threaded Python 3.14+ (see PR #5940) [ OK ] Move Subinterpreter * Fix clang-tidy bugprone-macro-parentheses warning in PYBIND11_CATCH2_SKIP_IF
Collaborator
Author
|
@b-pass, it'd be great if we could connect directly. Could you please email me under the email address you see with |
17a8695 to
1bf6e51
Compare
Collaborator
Author
|
@b-pass This is now a real PR. Could you take it from here? |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Continuation of work done initially under PRs #5934 and #5933
The write-up below is a Cursor-generated.
In an earlier draft of the below, Cursor asked "Could this be a bug in cpython?" Cursor was offering to generate a minimal C-only reproducer. We can come back to that if we want to.
__
Investigation: Free-Threaded Python 3.14 Hang in "Move Subinterpreter" Test
To: @b-pass
From: Investigation with Cursor AI assistance
Date: December 20, 2025
Re: PR #5933 - Root cause of
Py_EndInterpreter()hang on free-threaded Python 3.14.2Executive Summary
We've isolated the exact cause of the "Move Subinterpreter" test hang on free-threaded Python 3.14.2. The issue is not in
gil_safe_call_once_and_store, the internals capsule destructors, or any cleanup code.The root cause is a single line in
py::subinterpreter::create():When this is called after
PyThreadState_DeleteCurrent()during subinterpreter creation, it leaves the system in a state where later callingPy_EndInterpreter()from a different thread causes a deadlock.Background
The Failing Test
The "Move Subinterpreter" test (test_subinterpreter.cpp:94-119) does:
sub.reset())This test passes on:
This test hangs on:
Prior Findings
A pure C reproducer (
move_subinterpreter_redux.c) that mimics the same pattern using only CPython C API passes on both 3.14.0t and 3.14.1t. This indicated the issue was in pybind11's internals, not CPython itself.Investigation Methodology
We systematically created minimal test cases, each removing one aspect of pybind11's subinterpreter handling, until we found what causes the hang.
Test Matrix
move_subinterpreter_redux.cdebug_pure_c_with_pb11_main.cppdebug_no_swap_back.cppdebug_with_swap_back.cppPyThreadState_Swap(prev_tstate)after creationdebug_no_get_internals.cppget_internals()calldebug_no_num_interp.cppget_num_interpreters_seen()++py::subinterpreter::create()The critical finding: The only difference between passing and failing tests is
PyThreadState_Swap(prev_tstate)afterPyThreadState_DeleteCurrent().Root Cause Analysis
The Problematic Code Path
In
subinterpreter.h, thecreate()function does:Why This Causes a Hang
On free-threaded Python 3.14.2, the sequence:
PyThreadState_DeleteCurrent()- deletes the subinterpreter's creation tstatePyThreadState_Swap(prev_tstate)- swaps to main interpreter's tstate...appears to leave some internal state inconsistent. Specifically, when later:
Py_EndInterpreter()...the
Py_EndInterpreter()call deadlocks.The Pure C Pattern That Works
The working C reproducer does this instead:
The key difference: no
PyThreadState_Swap()afterPyThreadState_DeleteCurrent().Minimal Reproducer
Here's the minimal code that demonstrates the issue:
To fix: Remove the
PyThreadState_Swap(prev_tstate)line after creation.Attempted Fixes
We tried several quick fixes, none of which fully worked:
Attempt 1: Skip
PyThreadState_Swap(prev_tstate)on free-threaded PythonResult: No longer hangs, but crashes with:
The
subinterpreter_scoped_activate main_guard(main())destructor tries to callPyGILState_Release()but there's no current thread state after we skipped the swap.Attempt 2: Create a fresh thread state for main instead of reusing
prev_tstateResult: Segfault. The fresh tstate doesn't properly integrate with the saved state in
main_guard.Conclusion
The fix is not trivial because the
subinterpreter_scoped_activate main_guard(main())at line 85 saves state (gil_state_and/orold_tstate_) that becomes stale afterPyThreadState_DeleteCurrent(). A proper fix likely requires restructuring howcreate()manages the main interpreter's GIL, possibly:subinterpreter_scoped_activateinsidecreate()on free-threaded PythonFiles Referenced
include/pybind11/subinterpreter.h- lines 79-127 (create()function)tests/test_with_catch/test_subinterpreter.cpp- lines 94-119 ("Move Subinterpreter" test)debug_no_swap_back.cpp- passesdebug_with_swap_back.cpp- hangsAppendix: Debug Output
Passing Test (no swap back)
Failing Test (with swap back)
This investigation was conducted using local builds of Python 3.14.2 (default and free-threaded) from commit
df793163d58.