Skip to content
This repository

Convert PDF to HTML without losing text or format.

Octocat-spinner-32 3rdparty .. November 06, 2013
Octocat-spinner-32 debian clean js November 15, 2013
Octocat-spinner-32 logo optimize pdf2htmlEX logo for web October 20, 2013
Octocat-spinner-32 share fix #315 March 07, 2014
Octocat-spinner-32 src fix build February 08, 2014
Octocat-spinner-32 test fix cmake test September 16, 2013
Octocat-spinner-32 .gitignore rescaling preserves the fixed point November 22, 2013
Octocat-spinner-32 .travis.yml fix travis March 23, 2014
Octocat-spinner-32 AUTHORS Merge pull request #254 from marcsanfacon/master January 11, 2014
Octocat-spinner-32 CMakeLists.txt remove support for old versions of fontforge March 22, 2014
Octocat-spinner-32 Update December 23, 2013
Octocat-spinner-32 ChangeLog remove support for old versions of fontforge March 22, 2014
Octocat-spinner-32 LICENSE 2014 January 18, 2014
Octocat-spinner-32 LICENSE_GPLv3 update License August 31, 2012
Octocat-spinner-32 Update February 26, 2014
Octocat-spinner-32 TODO clean January 11, 2014
Octocat-spinner-32 package for 13.10 November 04, 2013
Octocat-spinner-32 clean January 11, 2014


A beautiful demo is worth a thousand words:

Browser requirements


pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies. It aims to provide an accurate rendering, while keeping optimized for Web display.

pdf2htmlEX is best for text-based PDF files, for example scientific papers with complicated formulas and figures. Text, fonts and formats are natively preserved in HTML such that you can still search and copy. The generated HTML file is static, with optional features powered by JavaScript.

Learn more about who and why should use pdf2htmlEX.


  • Precise and native text in HTML
  • Flexible Output
  • Moderate Size
  • More PDF stuffs that you love: links, outlines & printing
  • SVG background output & Type 3 font conversion

Learn more
Compare with others

Wiki Portals

Get in Touch


pdf2htmlEX, as a whole package, is licensed under GPLv3. Some resource files are released with relaxed licenses, read LICENSE for more details.


pdf2htmlEX is made possible thanks to the following projects:

pdf2htmlEX is inspired by the following projects:

  • pdftops & pdftohtml from poppler
  • MuPDF
  • PDF.js
  • Crocodoc
  • Google Doc

Special Thanks

  • Hongliang Tian
  • Wanmin Liu
Something went wrong with that request. Please try again.