Skip to content

jeremyjbowers/pdftable

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python module and command line utility that analyzes XML output from the
program pdftohtml in order to extract tables from PDF files. Outputs CSV.

For example:

pdftohtml -xml -stdout file.pdf | pdftable -f file%d.csv


See also 'pdftable -h' and http://sourceforge.net/projects/pdftable

Author: Kyle Cronan <kyle@pbx.org>

About

A fork of pdftable library, unmaintained since 2009.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages