This project is a data analysis and visualization task using a real-world traffic accidents dataset. It explores patterns, trends, and critical insights to help understand factors that lead to different types of accidents.
The goal of this project is to:
- Perform Exploratory Data Analysis (EDA) on traffic accident data
- Identify key factors that influence accident severity
- Visualize the impact of conditions like weather, lighting, and damage level
- Use statistical plots like KDE plots, bar charts, and correlation heatmaps
File: traffic_accidents.csv
This dataset includes columns such as:
Crash Severity
Weather Condition
Lighting Condition
Damage Level
Crash Type
Speed Limit
, etc.
(Use df.columns
in code to list all columns.)
- Python 3.x
- Pandas β for data cleaning and manipulation
- Matplotlib & Seaborn β for data visualization
- NumPy β numerical operations
- KDE plots to analyze distributions (e.g., crash severity)
- Bar plots to compare categories like damage levels
- Correlation matrix for numerical attributes
- Clone the repository
- Install dependencies:
pip install pandas seaborn matplotlib numpy