Skip to content

DavidWellsTheDeveloper/AirQualityAndDiseaseMortalityAnalysisInSpark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Geographical Air Quality Analysis of Disease Prevalence


Analytics Process

Data Selection

Datasets Found:

Respiratory Disease Mortality Rates by County Annual Timestep

Cardiovascular Disease Mortality Rates by County Annual Timestep

Infectious Disease Mortality Rates by County Annual Timestep

Air Quality By County Annual Timestep from EPA AirData

Data Cleaning

The Air Quality datasets were seperate zipped csv files for every year between 1980 and 2014. There is a script for extracting all of the air quality data here

Data Transformation

Analysis

Interpretation and Evaluation

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published