Skip to content

raphaelpalacio/DSFSG_PERMIT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 

Repository files navigation

DSFSG_PERMIT

alt text This GitHub hosts the code for the permit group of the Data Science for Social Good Practicum (Summer 2023) held at Emory University

Tabel of contents

Description

This GitHub repository serves as a resource for managing building permits in the Metro Atlanta area, facilitating the end-to-end process through various functionalities. It begins with a web scraping component that efficiently extracts relevant permit data from local government websites, converting PDF to CSV in order for information to be in a standardized format. The integrated file conversion utilities then transform these data sets into universally accessible formats, ensuring that they can be readily employed across various systems. Additionally, the repository features a robust database management tool that organizes and maintains the permit data, making it easy to access and update. In essence, this GitHub repository presents a holistic solution for the systematic collection, conversion, and management of building permit data in the Metro Atlanta area. This project helps manage the building permits for the city of Atlanta as well as the following counties: Clayton, Cobb, DeKalb, and Gwinnett

Installation

Use the package manager pip to install the required Python Packages and then import the following libraries

pip install pdfplumber
import os
import re
import pdfplumber
import pandas as pd

Methods

File Conversion

Some of the counties in the metro Atlanta Area - such as Gwinnett - upload their building permit data as a PDF and not as a cleaned CSV. Therefore, we needed a pipeline to convert their PDFs to a CSV and then clean that CSV. This way it will be in a rectangular format and allow us to properly query the data into our database

Authors and Acknowledgment

  • Dr. Kevin McAlister
  • Raphael Palacio
  • Eric Xue
  • Feiyu Xiang
  • John Jaquez
  • Latifa Tan

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published