You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
v1.3.0 — Candidate Fusion Pipeline
Highlights
Candidate fusion pipeline: Calibrated Noisy-OR fusion replaces the single-winner selection. All validation strategies can now fire at every position, and a confidence-based arbiter determines which errors to surface — significantly reducing false positives.
Confidence gates: Word error suppression and context evidence guards filter out low-confidence results on structurally-clean text.
Pipeline conflict resolution: Override matrix and candidate arbiter for resolving cross-strategy conflicts.
Added
StatisticalConfusableStrategy (priority 24): Bidirectional bigram ratio for detecting confusable word pairs in context.
ConfusableCompoundClassifierStrategy (priority 47): MLP-based compound word detection using ONNX.
Data-driven calibration: Per-strategy breakpoints and reliability weights loaded from YAML.
Error.source_strategy field tracks which strategy produced each error.
SpellCheckerBuilder.with_candidate_fusion() for fluent configuration.
Expanded confusable pairs from 37 to 87, mined from production database.