A script to upload documents from the National Labor Relations Board to Scribd
I will be using some of these resources, I imagine:
The long-term future of this project will be to be a better version of nlrb.gov
I would like to create a front-end that allows you to pull cases by union name, employer name, type (ULP, DFR, election, etc.), legal counsel, etc.
We will need to install mongodb to keep track of the files already uploaded. This will document the process in Ubuntu Server, but you can find more detailed instructions here http://docs.mongodb.org/manual/tutorial/install-mongodb-on-ubuntu/
sudo apt-key adv --keyserver hkp://keyserver.ubuntu.com:80 --recv 7F0CEB10 echo 'deb http://downloads-distro.mongodb.org/repo/ubuntu-upstart dist 10gen' | sudo tee /etc/apt/sources.list.d/10gen.list sudo apt-get update sudo apt-get install mongodb-10gen
Enter mongo shell by typing
pip install pymongo