You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
v1.2.0
New Features
Statistical confusable gate — bigram ratio detection at priority 24 (within fast-path cutoff), achieving 78% confusable detection rate with zero ML models
MLP classifier for confusable/compound detection (F1=0.900) with ONNX export
CMS multi-signal scoring for confusable detection (ngram + collocation thresholds)
Neural reranker v2 — 3-gate reranking with MLM logit wiring and 19-feature MLP
Mandatory compound expansion from 63 → 3,315 via template mining
Title/suffix compound detection layer
POS-based V+particle detection for broken compounds
28 confusable pair benchmark sentences added
Improvements
Error budget relaxed from per-sentence skip to heavy-error-only guard
CMS threshold reduction extended to curated confusable pairs
Dual MLP/LightGBM training pipelines with configurable MLM logit wiring
Benchmark confidence gap flag for analysis
Pipeline hardening to reduce FPR on expanded benchmark