Skip to content

greipfrut/pdftohtml5canvas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

HTML5 files using canvas elements may sometimes be an easier format to extract useful elements (e.g. time series waveforms).

We found that waveforms embedded in pdfs keep a predictable structure when going from PDF->SVG->HTML5 Canvas.

Previously, this required opening up in Inkscape and saving as an HTML5 Canvas file. Not bad for a few files, but >5k files, it becomes useful to automate.

Usage

python pdftohtml5canvas.py DIRECTORYofPDFs

About

Convert a folder of many PDFs that have embedded time-series waveforms into HTML5 Canvas for easier extraction.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages