Skip to content
Convert PDF to HTML without losing text or format.
C++ HTML JavaScript Python C CSS Other
Find file
Latest commit c9235e5 @coolwanglu Merge pull request #564 from mortonfox/patch-1
Fix link to geneve_1564 PDF

README.md

pdf2htmlEX

一图胜千言
A beautiful demo is worth a thousand words

pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies. Academic papers with lots of formulas and figures? Magazines with complicated layouts? No problem!

pdf2htmlEX is also an online publishing tool which is flexible for many different use cases.

Learn more about who and why should use pdf2htmlEX.

Features

  • Native HTML text with precise font and location.
  • Flexible output: all-in-one HTML or on demand page loading (needs JavaScript).
  • Moderate file size, sometimes even smaller than PDF.
  • Supporting links, outlines (bookmarks), printing, SVG background, Type 3 fonts and more...

Compare to others

Wiki Portals

Get in Touch

Get quick answers for common questions:

Don't miss the latest development news:

Discuss with the developers and the users of pdf2htmlEX

Chat with the main author: 王璐 (Lu Wang)

Want to help without coding? Thank you!

LICENSE

pdf2htmlEX, as a whole package, is licensed under GPLv3+. Some resource files are released with relaxed licenses, read LICENSE for more details.

Acknowledgements

pdf2htmlEX is made possible thanks to the following projects:

pdf2htmlEX is inspired by the following projects:

  • pdftohtml from poppler
  • MuPDF
  • PDF.js
  • Crocodoc
  • Google Doc

Special Thanks

  • Hongliang Tian
  • Wanmin Liu
Something went wrong with that request. Please try again.