Skip to content

v0.6.9

@Mihailorama Mihailorama tagged this 04 Mar 10:23
PyMuPDFEngine now uses page.get_text("dict") to extract block-level
bounding boxes (type, coordinates, page number, text content) for
every processed PDF. This enables bbox overlay in the UI for all
digital PDFs, not just OCR-processed ones via Marker.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Assets 2
Loading