v0.1.2
Embedded POS Tagger
Replaced the spaCy subprocess (uv run) POS tagger with an embedded averaged perceptron tagger. No more Python/uv/spaCy dependency for tokenization.
What changed
- New
taggermodule invoice-g2p— averaged perceptron POS tagger ported from misaki-rs, weights embedded at compile time viainclude_str! - Removed
uv_pathfromG2PConfig— no more shelling out to spaCy for POS tags - Tagger weights (5.7MB) stored with git LFS
- espeak-ng subprocess kept as OOV pronunciation fallback (no C deps)
Previous release
No prior GitHub releases. All crates were at 0.1.1 on crates.io.