This program converts a folder of PDF documents into a dictionary of 100-element arrays containing binary strings, which can be fed into a neural network.
Make sure these are installed from your terminal. You will need the package managers Brew and Pip. Brew installation instruction can be found here: https://brew.sh/ and pip3 will be bundled with the Homebrew python installation.
- brew install python
- brew install pkg-config poppler
- pip3 install glob2
- pip3 install pdftotext
- pip3 install numpy
Run in Atom using the Script plugin.