Releases: krockxz/pdf2docx-healer
Releases · krockxz/pdf2docx-healer
v0.1.4 - Lean README
v0.1.3 - Improved Documentation
Improved README with badges, comparison table, and full documentation.
v0.1.2 - Initial Release
First public release of pdf2docx-healer.
What's included:
- Bullet/numbered list detection (Unicode + ASCII) with Word styles and OOXML numbering injection
- Hyperlink injection for http/https/www./mailto URLs (multi-URL runs)
- CJK font fallback mapping (SimSun, MS Gothic, Malgun Gothic, etc.)
- OCR pipeline via PyMuPDF + Tesseract with graceful fallback
- CLI: pdf2docx-heal with --ocr, --no-lists, --no-hyperlinks, --no-font-fix, --aggressive, --quiet flags
- GitHub Actions trusted publishing workflow to PyPI