Skip to content

yasbd v0.1.0 (First public release) 🎉

Choose a tag to compare

@speedyk-005 speedyk-005 released this 29 May 21:48
· 153 commits to main since this release

yasbd-lib is now on PyPI. Pure Python sentence boundary detection, 5 languages, drop-in pysbd adapter.

pip install yasbd-lib

What's inside

  • 2-pass pointer-based engine: abbreviation safe list + main splitter. No ML, no models, no bloat.
  • 5 languages: en, fr, es, ht, ja. Add yours by copying a template.
  • pysbd adapter: swap without changing a line of pipeline code.
  • Streaming: detect() yields integer offsets, segment() yields strings. Lazy generators, zero materialization.
  • Benchmarked against 6 competitors across 7 edge cases. #1 in accuracy.

What's next

  • More languages
  • spaCy pipeline component
  • StreamCleaner skip flags (issue #19)
  • Stabilize API for v0.2.0

Full changelog at CHANGELOG.md.