You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Meta-classifier post-filter: A learned logistic regression model (41 features including one-hot error type, word frequency, and context signals) replaces manual per-strategy confidence thresholds. FPR dropped from 34.5% to 18.6%.
ConfusableSemanticStrategy (priority 48): MLM-enhanced confusable detection using masked language model logit comparison with asymmetric thresholds.
Rich Suggestion objects: Suggestion class with confidence and source metadata, backward-compatible (inherits from str).
Per-request CheckOptions: Runtime overrides for context_checking, grammar_checking, max_suggestions, and use_semantic.
Added
Kinzi (င်္) and consonant stacking variant support in confusable candidate generation.
MLM post-filter to suppress invalid_word and dangling_word false positives.
Expanded confusable pairs from 87 to 124+ with 9 linguistics-audit additions.
Expanded colloquial variants from 83 to 91 entries; removed 20 standard modern Burmese words incorrectly classified as colloquial.
Homophone morphological guard expanded to 2 prefixes and 4 compound suffixes.
Error.severity property with computed severity based on action type.
Candidate fusion enabled by default.
Changed
Config split: algorithm_configs.py split into 4 focused modules (algorithm_configs.py, text_configs.py, strategy_configs.py, infra_configs.py). All existing imports continue to work.
Benchmark consolidated to 1,146 sentences with 18 duplicate IDs fixed.