Data + Narrative 2019 Advanced Liberating Data from Documents: Free and Reliable Methods

This repo contains information used in an advanced class on extracting data from PDFs taught at the Boston University Data + Narrative workshop on 6/4/19.

Our objective was to convert three files from PDFs into machine-readable data using two Python packages, and clean and analyze that data in pandas.

Required Software

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
data		data
pdfs		pdfs
.gitignore		.gitignore
README.md		README.md
pdf-conversion-with-python.ipynb		pdf-conversion-with-python.ipynb