yasbd v0.1.0 (First public release) 🎉
yasbd-lib is now on PyPI. Pure Python sentence boundary detection, 5 languages, drop-in pysbd adapter.
pip install yasbd-libWhat's inside
- 2-pass pointer-based engine: abbreviation safe list + main splitter. No ML, no models, no bloat.
- 5 languages: en, fr, es, ht, ja. Add yours by copying a template.
- pysbd adapter: swap without changing a line of pipeline code.
- Streaming:
detect()yields integer offsets,segment()yields strings. Lazy generators, zero materialization. - Benchmarked against 6 competitors across 7 edge cases. #1 in accuracy.
What's next
- More languages
- spaCy pipeline component
StreamCleanerskip flags (issue #19)- Stabilize API for v0.2.0
Full changelog at CHANGELOG.md.