forked from dbarlett/USAID-DEC
Data from the United States Agency for International Development (USAID) Development Experience Clearinghouse (DEC).
forked from jsfenfen/whatwordwhere
Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.
forked from opensecrets/OCRToolkit
Tools for working with Optical Character Recognition output
This project will liberate data from pdf files found on http://www.cityofjerseycity.com/pub-info.aspx?id=2430 and will create .csv and .json files to be uploaded on https://data.openjerseycity.org/dataset/jersey-city-2013-budget-adopted-spending
This uses regular expressions (in php, but can be any language) get data from the NYC EDC newsletters
(DC team) experimenting with available options for extracting info from PFDs
forked from palamago/pdf-hacks-2014
PDF liberation Hackaton - http://pdfliberation.wordpress.com/
forked from mroswell/pdf-liberation-examples
displaying various pdf liberation tools, at PDF Liberation Hackathon