little node script to parse a few hundred pdfs of the same form and save results in a csv
JavaScript
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
pdfs initial commit Nov 28, 2014
.gitignore changed gitignore Feb 3, 2015
LICENSE Initial commit Nov 28, 2014
README.md Update README.md Feb 11, 2015
package.json
parse.js initial commit Nov 28, 2014

README.md

node-pdfparser-example

little node script to parse a few hundred pdfs of the same form and save results in a csv. see corresponding blog post: http://timogrossenbacher.ch/2014/11/parsing-thousands-of-pdfs-with-javascript/

to install and run:

git clone https://github.com/wnstnsmth/node-pdfparser-example.git
npm install
node parse.js

output will be written to output.csv and there are some example pdfs in the pdfs folder.