You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
PDF image extraction panic on mismatched buffer lengths (#552): Replaced assert! with graceful error handling. Malformed PDF images are now skipped instead of panicking. Regression from v4.5.0.
pdf feature compilation without layout-detection (#550): config.layout reference gated behind #[cfg(feature = "layout-detection")].
WASM module resolution in Supabase/Deno edge functions (#551): Added explicit package.json exports and Deno detection in wasm-loader.
zip dependency pinned below 7.4: Avoids let-chain build failures on some stable Rust toolchains (#549).
Vendored HWP text extraction: Replaced external hwpers crate with vendored subset (~1,650 lines). Eliminates zip 2.x transitive dependency that caused WASM/CI build failures.
Ruby binding missing table_model field in LayoutDetectionConfig initializer.
Clippy/unused variable warnings in table recognition and pipeline modules.
Added
prepend_heading_context chunking option: When true and chunker_type is Markdown, prepends the heading hierarchy path (e.g. # Title > ## Section) to each chunk's content string. Useful for RAG pipelines where chunks need self-contained structural context. Available across all 10 language bindings, CLI, and WASM. Includes fixture-driven e2e tests and documentation for all languages.