Skip to content

Releases: krockxz/pdf2docx-healer

v0.1.4 - Lean README

17 Jun 23:48

Choose a tag to compare

Streamlined README. Removed unnecessary sections, kept the essential info for PyPI landing page.

v0.1.3 - Improved Documentation

17 Jun 23:48

Choose a tag to compare

Improved README with badges, comparison table, and full documentation.

v0.1.2 - Initial Release

17 Jun 23:48

Choose a tag to compare

First public release of pdf2docx-healer.

What's included:

  • Bullet/numbered list detection (Unicode + ASCII) with Word styles and OOXML numbering injection
  • Hyperlink injection for http/https/www./mailto URLs (multi-URL runs)
  • CJK font fallback mapping (SimSun, MS Gothic, Malgun Gothic, etc.)
  • OCR pipeline via PyMuPDF + Tesseract with graceful fallback
  • CLI: pdf2docx-heal with --ocr, --no-lists, --no-hyperlinks, --no-font-fix, --aggressive, --quiet flags
  • GitHub Actions trusted publishing workflow to PyPI