Skip to content

Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.

License

Notifications You must be signed in to change notification settings

domoritz/pdffigures

 
 

Repository files navigation

pdffigures

pdffigures is a command line tool that can be used to extract figures, tables, and captions from scholarly documents. See the project website.

Usage

  1. Compile the command line tools:

make DEBUG=0

  1. Run on a new PDF document and display the results:

pdffigures -f /path/to/pdf

See pdffigures -help for a list of additional command line arguements.

Dependencies

pdffigures requires leptonica and poppler to be installed. On MAC both of these dependencies can be installl through homebrew:

brew install leptonica poppler

On Ubuntu these dependencies can be install through apt-get:

sudo apt-get install libpoppler-dev libleptonica-dev

pdffigures uses std::regex, therefore compiling on Ubuntu requires g++ >= 4.9

Support

pdffigures has been tested on MAC OS X 10.9 and Ubuntu 14.04, Windows is not supported.

About

Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 99.1%
  • Makefile 0.9%