us-county-disaster-clustering

Clustering Algorithm to group US counties based on how at-risk they are to natural disasters

Part of the Data Mining class at the University of Chicago's Master of Science in Analytics program

Project Intro/Objective

For this project we have clustered US counties based on how at-risk they are to natural disasters. The risk calculation is partially based on FEMA's risk metric, which includes:

Expected Annual Loss: Hazard’s risk component measuring the expected loss of building value, population, and/or agriculture value each year due to natural hazards
Social Vulnerability Index: Consequence enhancing component and analyzes demographic characteristics to measure the susceptibility of social groups to the adverse impacts of natural hazards
Community Resilience: Consequence reduction component and uses demographic characteristics to measure a community’s ability to prepare for, adapt to, withstand, and recover from the effects of natural hazards

Risk = (Expected Annual Loss * Social Vulnerability Index) / Community Resilience

These metrics currently only focus on the tangible costs of natural disasters. However, we know there are intangible costs as well. Indirect effects are the subsequent or secondary results of the initial destruction, like the impact or the lasting effects on mental health. Therefore, our calculation of risk factors has been properly modified to include the risk and effects on mental health that prior environmental disasters have caused on the United States population. With this information, our calculation of risk has been modified accordingly to the below:

Risk = (Expected Annual Loss * Social Vulnerability Index * Mental Health Index) / Community Resilience

Methods Used

Clustering

Technologies

Python

Dataset

The data used in this analysis have originated from multiple national agencies. The primary sources are listed below:

National Risk Index - Federal Emergency Management Agency (FEMA)
- The primary dataset is built and maintained by FEMA in close collaboration with various stakeholders and partners in academia; local, state and federal government; and private industry.
- Link: Data Resources | National Risk Index (fema.gov)
- The dataset was the source of information for social vulnerability index, community resiliency, and expected annual loss data used to calculate “Risk”
FEMA Disaster Declaration Summaries - FEMA
- Link: [Disaster Declarations Summaries - v2 | FEMA.gov] (https://www.fema.gov/openfema-data-page/disaster-declarations-summaries-v2)
Social Vulnerability Index - Agency for Toxic Substances and Disease Registry (ASTDR)
- Link: CDC/ATSDR SVI Data and Documentation Download | Place and Health | ATSDR
- Used to analyze composition of what comprises the social vulnerability index used to assess demographic information that could predict the vulnerability of particular areas to environmental hazards based on demographic / social information
County Health Rankings and Roadmaps (CHR&R) – University of Wisconsin
- The CHR&R program of the University of Wisconsin publishes various measures related to the quality of life for each community including Mental health data.
- The Mental health data is sourced from the Behavioral Risk Factor Surveillance System (BRFSS). BRFSS is a state-based random digit dial (RDD) telephone survey that is conducted annually in all states.
- Link: County Health Rankings & Roadmaps
- Web-scraping: The only source for the Data was the information dynamically loaded onto the webpages using JavaScript. Simple requests of the webpage

Results

County clusters where red indicates the highest risk due to a high enhancing factor
County clusters where red indicates the highest risk due to low reduction factor

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Extra Resources		Extra Resources
Images		Images
data		data
.DS_Store		.DS_Store
README.md		README.md
Scrapping _MentalHealthData.ipynb		Scrapping _MentalHealthData.ipynb
code.zip		code.zip
mental_health_index.py		mental_health_index.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

us-county-disaster-clustering

Project Intro/Objective

Methods Used

Technologies

Dataset

Results

About

Releases

Packages

Languages

shroffp05/us-county-disaster-clustering

Folders and files

Latest commit

History

Repository files navigation

us-county-disaster-clustering

Project Intro/Objective

Methods Used

Technologies

Dataset

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages