little node script to parse a few hundred pdfs of the same form and save results in a csv
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
pdfs
.gitignore
LICENSE
README.md
package.json
parse.js

README.md

node-pdfparser-example

little node script to parse a few hundred pdfs of the same form and save results in a csv. see corresponding blog post: http://timogrossenbacher.ch/2014/11/parsing-thousands-of-pdfs-with-javascript/

to install and run:

git clone https://github.com/wnstnsmth/node-pdfparser-example.git
npm install
node parse.js

output will be written to output.csv and there are some example pdfs in the pdfs folder.