This script is used to read in the budget data of the Canton of Bern from PDF.
Requires Python 2+ and pdfminer
Use virtualbox
to create a local env, then
$ pip install -r requirements.txt
Run the script appending the PDF filename
$ python mine.py data/sample.pdf > output/sample.json
Download and compile pdfminer
$ git clone https://github.com/euske/pdfminer.git
Make a symlink in this folder
$ ln -s ../pdfminer/build/lib/pdfminer
Run the script appending the PDF filename
$ python mine.py data/sample.pdf > output/sample.json