Skip to content

v2.0.0rc1

Pre-release
Pre-release

Choose a tag to compare

@bosd bosd released this 25 May 13:15
· 7 commits to master since this release
c227afd

First release candidate for Camelot 2.0 — please test and report issues before the final 2.0.0.

Highlights

  • New PDF backendpypdf + pdfminer.sixplaya-pdf: smaller install, accurate encrypted-PDF handling, faster hot paths.
  • Optional neural flavor="ml" (Table Transformer) for borderless tables, with optional OCR for scanned / image-only PDFspip install "camelot-py[ml,ocr]". Structure from the model, text from the PDF (no hallucinated values).
  • engine="combined" (default) and render-free engine="vector" lattice detection; flavor="auto" per-page routing; flavor="hybrid".
  • Performance — lattice render skips the PNG round-trip, get_table_index/text_in_bbox vectorised, one PDF open per call.
  • Quality-of-life: Table.confidence, TableList.filter()/stack_contiguous(), per_page=, replace_text=, bytes/file-like input.

Upgrading from 1.0.x

See the migration guide for breaking changes (Python 3.10+, line_scale 40→15, to_excel defaults, default engine="combined", …). Full details in CHANGELOG.md.

Install

pip install --pre camelot-py==2.0.0rc1