Skip to content

natlaughlin/pdftohtml5

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdftohtml5

A PDF document translator that produces an equivalent HTML5 document with near pixel-perfect accuracy.

  • Character positions are grouped into logical block elements.
  • Embedded PDF fonts are handled by FontForge, and re-encoded as base64 OpenType fonts in the header.

Examples

Requirements

Instructions

  • mvn clean package
  • java -cp ./target/pdftohtml5-0.0.1-SNAPSHOT.jar com.natlaughlin.pdftohtml5.PdfToHtml5 <OPTIONS> <PDF file>

TODO

  • Some special font types are not currently supported.
  • Add more PDF test cases.

About

A PDF document translator that produces an equivalent HTML5 document with near pixel-perfect accuracy.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors