
 # 📊 README: **Exploratory Data Analysis of Alzheimer’s Disease and Healthy Aging Trends in the U.S.**

**Author:** Jose Pantaleon Hernandez  
**Date:** October 2024  

---

## 🌟 **Project Description**

This project presents an exploratory data analysis (EDA) on Alzheimer’s disease and healthy aging in older Americans, utilizing data compiled by the CDC. The dataset, sourced from the Behavioral Risk Factor Surveillance System (BRFSS), National Health and Nutrition Examination Survey (NHANES), and National Health Interview Survey (NHIS), provides insights into the health conditions and behaviors that impact Alzheimer’s and related dementias. This analysis aims to reveal critical trends in aging-related health metrics across various U.S. states, with a focus on identifying geographic disparities and examining demographic variations in Alzheimer’s risk factors.  Further data data was accessed from Perplexity’s Search Tool.


The primary objective of this project is to provide a comprehensive view of cognitive health and aging trends to guide public health efforts and inform strategies for promoting healthy aging in diverse communities.

---

## ⭐ **Primary Hypothesis of the Project**

- There is a measurable relationship between declines in key health metrics—cognitive health, mental health, physical activity, and overall health—and an increased risk of Alzheimer's disease. Furthermore, certain age and ethnic groups may display a higher susceptibility to developing Alzheimer’s disease.

## ⭐⭐⭐ **Secondary Hypotheses**
- There is significant variation in health metrics related to Alzheimer's disease and aging across different states, indicating potential geographic disparities in risk factors and health outcomes.


👥 **Demographic Preferences**  
- The impact of Alzheimer’s varies significantly by demographic factors, with older adults in lower-income and minority groups displaying elevated risk markers.

📊 **Predictive Health Factors**  
- Key metrics like physical activity, smoking, and alcohol consumption are predictive indicators of cognitive decline and Alzheimer’s risk.

🔍 **Targeted Health Recommendations**  
- Tailored health interventions based on individual risk factors could potentially improve long-term health outcomes in at-risk populations.

🌍 **Geographic Disparities in Health**  
- Certain regions, particularly southern states and Puerto Rico, show disproportionately high metrics in cognitive decline and related health issues, highlighting the need for regionalized health strategies.

---

## 📁 **Project Content**

1. **Data Analyzed**:
   - **Cognitive Health and Aging Trends**: Analyzed from 2011 to 2020, focusing on trends in cognitive health, Alzheimer’s prevalence, and incidence in older adults.
   - **State-Level Cognitive Decline Comparisons**: Examined variations across states to understand regional differences in Alzheimer's and related dementias.
   - **Demographic Segmentation**: Explored factors such as age, gender, income, and education to see how socio-economic conditions influence Alzheimer’s risk.
   - **Behavioral and Health Risk Factors**: Assessed smoking, alcohol use, physical inactivity, and obesity as they relate to Alzheimer’s and cognitive decline risk.

2. **Caregiving and Support**:
   - **Resource Availability and Quality**: Reviewed caregiving resources by state, identifying areas with high or low caregiving infrastructure.
   - **Caregiver Burden Distribution**: Highlighted regions where caregiving responsibilities may be heavier, identifying areas that could benefit from additional support.

3. **Health Comparisons by State**:
   - **Alzheimer’s-Related Health Metrics**: Compared physical activity, nutrition, and overall health to understand their impact on cognitive aging.
   - **Substance Use Trends**: Analyzed smoking and alcohol use patterns across states and their association with cognitive health.

---

## 🛠️ **Methodology**

The project employed EDA techniques to identify significant patterns and trends within the Alzheimer’s-related data. Tools and libraries utilized included:

- **Python**: The primary programming language used for data cleaning and analysis.
- **Visualization Libraries**: A combination of `Matplotlib`, `Seaborn`, `Plotly`, and `Plotly Express` was used to create both static and interactive visualizations. `Matplotlib` and `Seaborn` allowed for clear static visuals of general trends, while `Plotly` facilitated interactive exploration of regional differences in caregiving, cognitive health, and overall health metrics.
- **Geospatial Analysis**: State-level and regional data visualizations highlighted geographic trends in Alzheimer’s prevalence and associated health factors, using geospatial data to locate areas with high caregiving needs and cognitive health challenges.

These tools provided a comprehensive understanding of the dataset, helping to pinpoint essential health trends and regional variations in Alzheimer’s-related indicators.

---

## ✨ **Key Findings**

- **Caregiving Support**: Texas, California, and Tennessee have the strongest caregiving metrics, indicating robust resources, while states like North Dakota and Connecticut reflect lower availability.
- **Cognitive Health Risks**: Puerto Rico and Mississippi exhibit high cognitive decline metrics, suggesting pressing cognitive health challenges in these regions.
- **Mental Health Challenges**: Puerto Rico, Washington, and Alaska show the highest mental health metrics, potentially reflecting greater mental health needs or higher reporting.
- **Physical Activity and Obesity**: Nebraska, Arkansas, and Mississippi indicate high levels of physical inactivity and obesity, implying a need for health programs targeting diet and exercise.
- **Overall Health Disparities**: Puerto Rico, Oklahoma, and Mississippi rank high in health concerns, whereas Hawaii and Nevada show comparatively better health metrics.
- **Smoking and Alcohol Use**: Guam and Minnesota lead in smoking and alcohol usage, indicating elevated lifestyle risk behaviors, while Utah and Puerto Rico show lower levels, suggesting healthier habits.
- **Regional Health Disparities**: Many southern states, particularly Mississippi, Alabama, and Louisiana, display high health metrics, from cognitive decline to smoking, pointing to pronounced health disparities.
- **Persistent Health Challenges**: States like Maryland and Alabama frequently appear with high values across several metrics, indicating complex health issues requiring multi-faceted solutions.
- **Healthier Lifestyle States**: Utah, Nevada, and Hawaii consistently rank low in unhealthy lifestyle indicators, suggesting better health outcomes.
- **Emerging Concerns in Puerto Rico**: Puerto Rico’s high scores across metrics underscore significant health needs, particularly in mental and cognitive health.
- **Mental Health and Substance Use Link**: States with elevated mental health metrics, such as Alaska and Washington, also report high levels of smoking and alcohol use, suggesting potential correlations.
- **Public Health Focus**: States such as Mississippi, Puerto Rico, and Oklahoma, which rank high across multiple metrics, may benefit most from targeted interventions to address overlapping health challenges.

---

## ✅ **Conclusion**

This analysis highlights critical disparities in health outcomes and resource accessibility across the United States and territories. Regions in the South and Puerto Rico consistently display elevated metrics across caregiving, cognitive health, and general health, underscoring the need for enhanced public health funding and targeted support. States like Utah, Nevada, and Hawaii, by contrast, show healthier averages, indicating stronger health outcomes and lifestyle factors.

Recurring health challenges in states like Maryland, Minnesota, and Alaska point to a need for multifaceted public health strategies that address both prevention and support. These insights could be pivotal in helping policymakers allocate resources effectively to reduce health disparities and improve aging-related health outcomes across diverse populations.

---

**Contact**:  
Jose Pantaleon Hernandez  
✉️ Email: j_pantaleon@hotmail.com  

This README provides an overview of the analysis. For a deeper look at the charts and data, explore the notebooks and visual resources in the repository. 📊





