You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add Docling-native PPTX and common image input support (png, jpg, jpeg, tif, tiff, bmp, webp).
Add targeted CJK Markdown normalization for agent-facing source.md while preserving raw source.docling.json.
Add text_normalization and text_integrity quality signals for CJK cleanup, replacement characters, formula placeholders, and residual compatibility glyphs.
Refine PDF page-quality aggregation so isolated long-document page failures become medium-risk warnings, while short documents and high failed-page ratios remain failed_for_agent/high.
Refactor non-PDF conversion internals to share the sidecar attempt builder while preserving format-specific routing and PDF remediation behavior.
Add lightweight project-owned TypedDict contracts and a ruff dev check for import/unused/syntax hygiene.