v2.0.0rc1
Pre-release
Pre-release
First release candidate for Camelot 2.0 — please test and report issues before the final 2.0.0.
Highlights
- New PDF backend —
pypdf+pdfminer.six→playa-pdf: smaller install, accurate encrypted-PDF handling, faster hot paths. - Optional neural
flavor="ml"(Table Transformer) for borderless tables, with optional OCR for scanned / image-only PDFs —pip install "camelot-py[ml,ocr]". Structure from the model, text from the PDF (no hallucinated values). engine="combined"(default) and render-freeengine="vector"lattice detection;flavor="auto"per-page routing;flavor="hybrid".- Performance — lattice render skips the PNG round-trip,
get_table_index/text_in_bboxvectorised, one PDF open per call. - Quality-of-life:
Table.confidence,TableList.filter()/stack_contiguous(),per_page=,replace_text=, bytes/file-like input.
Upgrading from 1.0.x
See the migration guide for breaking changes (Python 3.10+, line_scale 40→15, to_excel defaults, default engine="combined", …). Full details in CHANGELOG.md.
Install
pip install --pre camelot-py==2.0.0rc1