# Traffic Accident Analysis
### Author: DIKSHA KUMARI
### Date: 2024-10-07

This notebook analyzes traffic accident data to identify patterns related to road conditions,
weather, and time of day. It also visualizes accident hotspots and contributing factors.

In [None]:
# 1. Load Libraries
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# Set Seaborn style
sns.set(style='whitegrid')

In [None]:
# 2. Load Dataset
data = pd.read_csv('traffic_accidents.csv')

# Display the first few rows of the dataset
data.head()

In [None]:
# 3. Data Cleaning
# Check for missing values
print(data.isnull().sum())

# Fill or drop missing values as needed
data.dropna(inplace=True)

# Convert date column to datetime
data['date'] = pd.to_datetime(data['date'])

In [None]:
# 4. Exploratory Data Analysis
# Analyze accidents by time of day
data['hour'] = data['date'].dt.hour
accidents_by_hour = data.groupby('hour').size()

# Plot accidents by hour
plt.figure(figsize=(10, 6))
sns.barplot(x=accidents_by_hour.index, y=accidents_by_hour.values)
plt.title('Accidents by Hour of Day')
plt.xlabel('Hour')
plt.ylabel('Number of Accidents')
plt.xticks(range(0, 24))
plt.grid()
plt.show()

In [None]:
# 5. Identify Patterns
# Analyze by road conditions and weather
road_conditions = data['road_condition'].value_counts()
weather_conditions = data['weather'].value_counts()

# Display road conditions and weather
print('Road Conditions:')
print(road_conditions)
print('\nWeather Conditions:')
print(weather_conditions)

In [None]:
# 6. Visualize Hotspots
plt.figure(figsize=(10, 6))
sns.scatterplot(x=data['longitude'], y=data['latitude'], hue=data['weather'], alpha=0.5)
plt.title('Accident Hotspots by Weather Conditions')
plt.xlabel('Longitude')
plt.ylabel('Latitude')
plt.legend()
plt.grid()
plt.show()

# Conclusion
This analysis highlights the times and conditions under which traffic accidents are more likely to occur.
Further analysis can be conducted to develop strategies for improving road safety.