Matmul optim by FrancescAlted · Pull Request #616 · Blosc/python-blosc2

FrancescAlted · 2026-04-12T04:47:18Z

This can detect either specialized BLAS libraries that are used by NumPy (Linux/Windows) or MacOS Accelerate to use dgemm/sgemm in combination with Blosc2 prefilters. The result is a great acceleration in matmul, as can be seen in the plot below.

…tics Implement a runtime-discovered CBLAS backend for the matmul fast path on Linux/Windows, alongside the existing Accelerate/macOS path and naive fallback. Probe BLAS candidates from the active NumPy/conda environment, load providers exporting cblas_sgemm/cblas_dgemm, and fall back cleanly to naive when none fit. Control nested BLAS threading from Python with threadpoolctl around fast blosc2.matmul calls, but only for Linux CBLAS and only for small blocks. Use a benchmark-derived threshold of 192x192 to keep BLAS single-threaded for small GEMMs while avoiding regressions on larger ones; never apply this on macOS. Expose backend introspection via blosc2.get_matmul_library(), returning the loaded CBLAS library path or Accelerate.framework when available. Add BLOSC_TRACE diagnostics for CBLAS candidate probing, rejection, selection, and backend fallback decisions. Extend matmul benchmarks to report the active matmul library, compare against plain NumPy matmul, support warmup iterations, and use larger default problem sizes for steadier out-of-the-box results. Add tests covering backend selection, threadpoolctl usage/skips, threshold behavior, Darwin scoping, and matmul-library introspection; add threadpoolctl as a regular non-wasm dependency.

FrancescAlted and others added 9 commits March 23, 2026 13:34

Use Accelerate for fast matmul blocks on macOS

13063c2

Add fast-naive stats as well in bench

4bedf3d

Add missing matmul kernel C files

8c0890f

Add get_matmul_library to actual sphinx docs

d30956f

Restrict linux test to well, linux

3009e86

Make the number of repeasts 3 by default

ffccab7

Relax matmul thread-limit tests for macOS NumPy warnings

4ca52a2

Merge branch 'main' into matmul-optim

06c1541

FrancescAlted merged commit 8845a05 into main Apr 12, 2026
16 of 17 checks passed

FrancescAlted deleted the matmul-optim branch April 12, 2026 05:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Matmul optim#616

Matmul optim#616
FrancescAlted merged 9 commits intomainfrom
matmul-optim

FrancescAlted commented Apr 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

FrancescAlted commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

FrancescAlted commented Apr 12, 2026 •

edited

Loading