Convert PDF files to simple, readable HTML using a command-line tool.
- Converts single PDFs or entire folders
- Retains original filenames
- Simple, semantic HTML output
- CLI-friendly and pip-installable
pip install .pipx install path/to/pdf2html/Convert a single file:
pdf2html path/to/file.pdf -o output_folderConvert all PDFs in a folder:
pdf2html path/to/folder -o output_folder- Python 3.8+
pdfminer.sixbeautifulsoup4
MIT