Release v3.3.0 #39

rossarmstrong · 2025-12-19T09:57:57Z

Enhancements

Implemented ultra-fast WER-only path with space-optimized 2-row dynamic programming algorithm and batch buffer reuse. Added four new functions (calculations_wer_only(), _calculations_wer_only_reuse_ptr(), _metrics_batch_wer_only(), metrics_wer_only()) that eliminate backtrace overhead and use O(n) memory instead of O(m×n). This optimization uses pointer swapping instead of value copying and reuses DP buffers across entire batches, providing significant performance gains for wer() and wers() functions that only need the WER metric without error counts or word lists.
Fixed portability issue in WER-only batch processing by replacing platform-dependent int* pointers with guaranteed 32-bit cnp.int32_t* pointers. This ensures correct behavior on all platforms where sizeof(int) may differ from 4 bytes, while also removing unnecessary type casts for cleaner code that follows NumPy/Cython best practices.
Expanded benchmarking support by adding optional third-party WER libraries (pywer, evaluate, universal-edit-distance, torchmetrics) to pyproject.toml under the benchmarks extra. Updated benchmark_synthetic_data_local.py to safely import optional dependencies, ensure all benchmark functions are always defined, and enforce consistent numeric return types. This fixes static analysis warnings, prevents runtime errors when optional packages are missing, and enables more comprehensive and reliable cross-package performance comparisons.
Standardized all Levenshtein dynamic programming buffers and memoryviews to use cnp.int32_t instead of platform-dependent int. This ensures strict dtype alignment with NumPy int32 arrays, removes undefined behavior on platforms where sizeof(int) != 4, and improves type safety without impacting performance.

…ript

… hardening

… for improved type safety

rossarmstrong added 8 commits December 19, 2025 11:38

⬆️ chore(version): update version to 3.3.0 across all config files

b3f498e

fix(metrics): use cnp.int32_t* for portable buffer pointer types

f6345f0

chore(benchmarks): add optional WER libraries and harden benchmark sc…

d7d0445

…ript

📝 docs(changelog): document benchmark dependency additions and script…

105b617

… hardening

🔧 docs(readme): fix FOSSA badge link URL encoding

616e076

fix(cython): use cnp.int32_t typed buffers for Levenshtein DP matrices

c19ca61

fix(changelog): standardize Levenshtein DP buffers to use cnp.int32_t…

7d5c260

… for improved type safety

fix(pyproject): add missing newline at end of benchmarks section

4a81160

rossarmstrong merged commit 4badd1c into main Dec 19, 2025
11 of 12 checks passed

rossarmstrong deleted the development branch December 19, 2025 10:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Release v3.3.0 #39

Release v3.3.0 #39

Uh oh!

rossarmstrong commented Dec 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Release v3.3.0 #39

Release v3.3.0 #39

Uh oh!

Conversation

rossarmstrong commented Dec 19, 2025

Enhancements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants