Convert text from PDF to XML.
JavaScript Python
Pull request Compare This branch is 2 commits ahead, 23 commits behind zejn:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
.gitignore
README.rst
pdf2xml
pdfxml2csv
setup.py

README.rst

pypdf2xml

This project started as an alternative to poppler's pdftoxml, which didn't properly decode CID Type2 fonts in PDFs. This script requires pdfminer.

License

Public domain.