Four formats. One engine. PDF, DOCX, XLSX, HTML → Markdown and typed JSON, 15–40× faster than equivalent-quality OSS. Rust core with strictly-typed Python bindings.
python html markdown rust pdf ai xlsx pdf-converter docx documents pdf-to-text tables document-parser rag pdf-to-json llm document-parsing langchain pdf-to-markdown tables-extraction
-
Updated
Apr 21, 2026 - Rust