Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Convert text from PDF to XML.
JavaScript Python
Branch: master
Pull request Compare This branch is 2 commits ahead, 21 commits behind zejn:master.

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
.gitignore
README.rst
pdf2xml
pdfxml2csv
setup.py

README.rst

pypdf2xml

This project started as an alternative to poppler's pdftoxml, which didn't properly decode CID Type2 fonts in PDFs. This script requires pdfminer.

License

Public domain.

Something went wrong with that request. Please try again.