A Python-based data analysis project exploring weekly death counts in the U.S., categorized by causes such as COVID-19, heart disease, cancer, and more. This project focuses on cleaning, transforming, and analyzing real-world mortality data to reveal meaningful patterns and trends using visual storytelling techniques.
This analysis leverages publicly available datasets to understand mortality trends across the United States, focusing on:
- Cause-specific death rates (COVID-19, heart disease, cancer, etc.)
- State-wise impact and jurisdictional comparisons
- Temporal trends from 2020 to 2021
- Identification of vulnerable populations
Through insightful visualizations and exploratory data analysis (EDA), the project delivers actionable insights into one of the most crucial public health topics.
- Python
- Pandas – Data cleaning & manipulation
- Matplotlib & Seaborn – Data visualization
- NumPy – Numerical operations
- Jupyter Notebook – Interactive analysis
The dataset used contains weekly provisional counts of deaths by select causes and jurisdiction in the U.S.
Source: CDC National Center for Health Statistics
- Stacked Area Charts – To compare death trends over time
- Heatmaps – To analyze intensity by state and cause
- Pie Charts – To show percentage distribution of deaths by cause
- Correlation Matrices – To identify statistical relationships between variables
- COVID-19 mortality spiked significantly during 2020–2021.
- Heart disease remains a consistently leading cause of death.
- Certain states were disproportionately affected across various causes.
- Visualizations help highlight how public health crises evolve over time.
- Advanced data wrangling with Pandas
- Effective visual storytelling techniques
- Handling missing/incomplete data
- Crafting insights from complex real-world datasets
Big thanks to my mentors, peers, and the online data science community for their support and guidance throughout this project.
-
Clone the repository:
git clone https://github.com/your-username/mortality-trends-analysis.git cd mortality-trends-analysis
-
Install required libraries:
pip install -r requirements.txt
-
Launch the Jupyter Notebook:
jupyter notebook
-
Open and run
mortality_trends_analysis.ipynb
.
Feel free to reach out or connect with me on LinkedIn for feedback, collaboration, or just to say hi!
This project is licensed under the MIT License - see the LICENSE file for details.