CI, TST: Enable parallel threads testing in macOS CI job #30005

bwhitt7 · 2025-10-17T06:19:43Z

Getting to the final stretch of getting the test suite running in parallel threads! This fixes #29552, which describes more about this project I've been working on.

This PR introduces a few changes:

Has the test suite ran under pytest-run-parallel in a CI job. This takes the place of one of the macOS Accelerate runs, specifically the fast test run with macos-14 and Python version 3.14t-dev. This is so the CI runs don't take too much longer and the same number of jobs are active as before.
Adds an option to the spin test command to make running parallel threads easier. Users can now use spin test -p 4 to run the tests under parallel threads, compared to the previous method of using spin test -- --parallel-threads=4
Adds documentation for the spin test change. I was unable to build the docs locally, so please let me know if the syntax is incorrect.
Fixes a bug where tests under f2py weren't properly getting flagged as thread-unsafe.
Updates the hypothesis version and modifies some of the hypothesis tests to be more thread safe.

With the change to the CI jobs, me and @ngoldbaum will try and keep an eye on any failing tests that come up related to this PR, so that we can update our thread-unsafe markers and/or fix any failing tests. I also plan on updating more of the documentation soon to add more guidance on how to write thread-safe tests.

ngoldbaum · 2025-10-17T17:08:25Z

Ugh, it looks like there are issues we'd need to fix in pytest-run-parallel before we can add it to the requirements file. Let's delete it from there for now and then explicitly install pytest-run-parallel in the parallel CI job.

ngoldbaum · 2025-10-20T15:22:15Z

I opened scientific-python/spin#302 to track upstreaming the changes we're making to the spin configuration here and also making the output for spin test --help a little nicer now that there's support for both pytest-xdist and pytest-run-parallel. I think the way it is here is about as good as we can do from the NumPy side.

ngoldbaum · 2025-10-20T19:01:15Z

Ping @rgommers - I always appreciate your take for CI changes and you have a lot of context about spin.

ngoldbaum

LGTM! I think you also need to take HypothesisWorks/hypothesis#4562 (comment) into account. I also think that hypothesis issue can be closed.

I pinged Ralf to take a look since this changes the CI configuration.

rgommers

Thanks @bwhitt7 and @ngoldbaum. The CI and test suite changes themselves look good to me. I did test on Linux also with a much higher thread count (-p 17 and -p 21) and ran the full test suite several times, and found one failure (twice):

___________________________________ ERROR at call of TestRegression.test_openblas_threading ___________________________________

self = <numpy.linalg.tests.test_regression.TestRegression object at 0x5f98707d150>

    def test_openblas_threading(self):
        # gh-27036
        # Test whether matrix multiplication involving a large matrix always
        # gives the same (correct) answer
        x = np.arange(500000, dtype=np.float64)
        src = np.vstack((x, -10 * x)).T
        matrix = np.array([[0, 1], [1, 0]])
        expected = np.vstack((-10 * x, x)).T  # src @ matrix
        for i in range(200):
            result = src @ matrix
            mismatches = (~np.isclose(result, expected)).sum()
            if mismatches != 0:
>               assert False, ("unexpected result from matmul, "
                    "probably due to OpenBLAS threading issues")
E               AssertionError: unexpected result from matmul, probably due to OpenBLAS threading issues
E               assert False

expected   = array([[-0.00000e+00,  0.00000e+00],
       [-1.00000e+01,  1.00000e+00],
       [-2.00000e+01,  2.00000e+00],
       ...99997e+06,  4.99997e+05],
       [-4.99998e+06,  4.99998e+05],
       [-4.99999e+06,  4.99999e+05]], shape=(500000, 2))
i          = 1
matrix     = array([[0, 1],
       [1, 0]])
mismatches = np.int64(12)
result     = array([[ 0.00000e+00,  0.00000e+00],
       [-1.00000e+01,  1.00000e+00],
       [-2.00000e+01,  2.00000e+00],
       ...99997e+06,  4.99997e+05],
       [-4.99998e+06,  4.99998e+05],
       [-4.99999e+06,  4.99999e+05]], shape=(500000, 2))
self       = <numpy.linalg.tests.test_regression.TestRegression object at 0x5f98707d150>
src        = array([[ 0.00000e+00, -0.00000e+00],
       [ 1.00000e+00, -1.00000e+01],
       [ 2.00000e+00, -2.00000e+01],
       ...99997e+05, -4.99997e+06],
       [ 4.99998e+05, -4.99998e+06],
       [ 4.99999e+05, -4.99999e+06]], shape=(500000, 2))
x          = array([0.00000e+00, 1.00000e+00, 2.00000e+00, ..., 4.99997e+05,
       4.99998e+05, 4.99999e+05], shape=(500000,))

numpy/linalg/tests/test_regression.py:180: AssertionError


PARALLEL FAILED numpy/linalg/tests/test_regression.py::TestRegression::test_openblas_threading - AssertionError: unexpected result from matmul, probably due to OpenBLAS threading issues

I think that one should be marked as thread_unsafe. There are a max number of memory buffers OpenBLAS can use, so testing with a high level of parallelism (resulting in massive oversubscription) is not robust. That's not really relevant for real-world usage, so it's fine to just mark it as thread-unsafe and move on.

With that one change, I think this can be merged.

charris · 2025-10-21T18:28:34Z

@rgommers I've seen that failure several times.

bwhitt7 · 2025-10-21T19:43:41Z

@rgommers Thank you for testing this! Marked the test.

rgommers · 2025-10-21T20:30:08Z

Great - in it went! Nice work:)

rgommers · 2025-10-21T20:31:28Z

@rgommers I've seen that failure several times.

Yeah I think I've seen it before as well; it's just way easier to trigger when running multiple calls in parallel. I have a feeling we will be revisiting that particular test some more later (just not related to free-threading / parallel testing).

bwhitt7 added 19 commits October 16, 2025 19:17

CI: Add parallel thread testing to macOS CI jobs

22f0f5f

DEV: Add parallel threads command to spin

25e1a9e

TST: Fix f2py markers in conftest

9f88a49

WIP: Run macOS CI on fork

599cb04

DOC: Update doc to include new spin command

1cd3bcd

CI: Disable CI job on fork

133188a

CI: Change macOS version in parallel threads run

46887eb

DOC: Fix multiple thread testing typos

19aa465

WIP: Test CI one more time

658ea48

CI: Disable CI job on fork

e0b197c

STY: Add whitespace

c16637b

CI: Use new spin command

7d9b36f

CI: Set env vars in parallel run

4c3f5c9

TST: Marks thread unsafe docstring test

4736ebc

CI: Remove redundent env vars

557b1c8

TST: Mark large tests as thread unsafe

c7b1fc6

TST: Add pytest-run-parallel to requirements

d08ba13

CI: Test CI where pytest-run-parallel is unavaliable

4433456

CI: Update 3.14 version

f584087

bwhitt7 changed the title ~~CI: Enable parallel threads testing in macOS CI job~~ CI, TST: Enable parallel threads testing in macOS CI job Oct 17, 2025

DEV: Add more spin help text for new option

25b5a6e

bwhitt7 added 5 commits October 17, 2025 15:36

TST: Remove pytest-run-parallel

7fd17c7

STY: Fix doc issues

76d1a29

TST: Update hypothesis markers

d1c348c

CI: Set pytest-run-parallel version

921c748

DOC: Reword documentation

3e2c022

ngoldbaum mentioned this pull request Oct 20, 2025

Make it clearer that spin test -j uses pytest-xdist scientific-python/spin#302

Closed

DOC: Reword again

4db41d9

ngoldbaum approved these changes Oct 20, 2025

View reviewed changes

TST: Update hypothesis for thread-unsafe tests

043f893

rgommers added component: CI 39 - free-threading PRs and issues related to support for free-threading CPython (a.k.a. no-GIL, PEP 703) labels Oct 21, 2025

rgommers added this to the 2.4.0 release milestone Oct 21, 2025

rgommers approved these changes Oct 21, 2025

View reviewed changes

TST: Add thread unsafe marker

9468c37

rgommers merged commit 97d41bf into numpy:main Oct 21, 2025
76 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

CI, TST: Enable parallel threads testing in macOS CI job #30005

CI, TST: Enable parallel threads testing in macOS CI job #30005

Uh oh!

bwhitt7 commented Oct 17, 2025 •

edited

Loading

Uh oh!

ngoldbaum commented Oct 17, 2025

Uh oh!

ngoldbaum commented Oct 20, 2025

Uh oh!

ngoldbaum commented Oct 20, 2025

Uh oh!

ngoldbaum left a comment

Uh oh!

rgommers left a comment

Uh oh!

charris commented Oct 21, 2025

Uh oh!

bwhitt7 commented Oct 21, 2025

Uh oh!

Uh oh!

rgommers commented Oct 21, 2025

Uh oh!

rgommers commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

CI, TST: Enable parallel threads testing in macOS CI job #30005

CI, TST: Enable parallel threads testing in macOS CI job #30005

Uh oh!

Conversation

bwhitt7 commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngoldbaum commented Oct 17, 2025

Uh oh!

ngoldbaum commented Oct 20, 2025

Uh oh!

ngoldbaum commented Oct 20, 2025

Uh oh!

ngoldbaum left a comment

Choose a reason for hiding this comment

Uh oh!

rgommers left a comment

Choose a reason for hiding this comment

Uh oh!

charris commented Oct 21, 2025

Uh oh!

bwhitt7 commented Oct 21, 2025

Uh oh!

Uh oh!

rgommers commented Oct 21, 2025

Uh oh!

rgommers commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bwhitt7 commented Oct 17, 2025 •

edited

Loading