Releases: 4thel00z/konan
Releases · 4thel00z/konan
v0.2.4
0.2.4 (2026-06-07)
Bug Fixes
- drop dead code left by the hot-path change (7e0e426)
Performance Improvements
- core: ICU4X sentence segmentation + lazy memchr unit scanners (276678d)
- core: stream fixed-size units lazily (49ffe7c)
Documentation
- refresh benchmark figures after the hot-path fixes (eadb77e)
v0.2.3
v0.2.2
v0.2.1
v0.2.0
0.2.0 (2026-06-07)
Features
- async-openai embedder backend, richer configs, config-revealing reprs (6caf2c2)
- benchmarks — bench.py, criterion benches, README results (5ed3bfa)
- core: Chunk, KonanError, span/offset utilities (3f40368)
- core: Chunker port and rayon chunk_many (f6a62d1)
- core: Embedder port and OpenAI-compatible adapter (7a7d7c2)
- core: FixedSizeChunker with overlap and sentence awareness (fe36188)
- core: MarkdownChunker with breadcrumbs and atomic code fences (c4bd62b)
- core: NaiveChunker (word-based) (99f2738)
- core: RecursiveChunker with separator hierarchy (3a1cbb1)
- core: SemanticChunker over the Embedder port (22621f8)
- core: SentenceChunker (unicode segmentation) (bf57195)
- core: TokenChunker (tiktoken cl100k/o200k) (d24ef27)
- py: bind six chunkers with sync, parallel and async methods (f115929)
- py: package surface with typed stubs and py.typed (b5fb787)
- py: SemanticChunker, OpenAIEmbedder, Python-callable embedder port (a8324ba)
- scaffold konan workspace (konan-core + konan-py, maturin) (8e2e74f)
Bug Fixes
- absolute logo URL so it renders on PyPI (618e185)
- clippy is_multiple_of lint in benches (CI stable toolchain) (1e7bb44)
- core: MarkdownChunker setext heading breadcrumbs (a242ce4)
- harden semantic chunking edge cases from review backlog (9e8e673)
Performance Improvements
- bpe-openai tokenizer + ASCII OffsetMap fast path (e877ea2)