# 📊 Phase 2: Traffic Accident Analysis in Turkey (TUIK + EGM)
This notebook contains data visualizations and statistical hypothesis tests for analyzing traffic accident trends in Turkey, using data from both TUIK (2022) and EGM (2023–2024).

In [None]:
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from scipy.stats import ttest_ind

sns.set(style='whitegrid')
plt.rcParams['figure.figsize'] = (10, 6)

In [None]:
# 📈 TUIK 2022: Monthly Accidents and Injuries
tuik_monthly_df = pd.DataFrame({
    'Month': ['January', 'February', 'March', 'April', 'May', 'June', 'July', 'August', 'September', 'October', 'November', 'December'],
    'Accidents': [12439, 11418, 13460, 14898, 18664, 17373, 19514, 19699, 19230, 18318, 16599, 15649],
    'Injuries': [18158, 16170, 19206, 20547, 27347, 25354, 30834, 30266, 27912, 26363, 24114, 22425]
})
sns.lineplot(data=tuik_monthly_df, x='Month', y='Accidents', marker='o', label='Accidents')
sns.lineplot(data=tuik_monthly_df, x='Month', y='Injuries', marker='o', label='Injuries')
plt.title('[TUIK] Monthly Traffic Accidents and Injuries (2022)')
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()

In [None]:
# 🌞 TUIK 2022: Accidents by Daylight Condition
sunlight_df = pd.DataFrame({
    'Daylight_Condition': ['Daytime', 'Night', 'Twilight'],
    'Accidents': [132009, 60415, 4837]
})
sns.barplot(data=sunlight_df, x='Daylight_Condition', y='Accidents')
plt.title('[TUIK] Traffic Accidents by Daylight Condition (2022)')
plt.tight_layout()
plt.show()

In [None]:
# 🧪 Hypothesis Testing on TUIK Data
day, night = sunlight_df.iloc[0, 1], sunlight_df.iloc[1, 1]
summer = tuik_monthly_df[tuik_monthly_df['Month'].isin(['June', 'July', 'August'])]
winter = tuik_monthly_df[tuik_monthly_df['Month'].isin(['December', 'January', 'February'])]
t_stat1, p_val1 = ttest_ind(summer['Accidents'], winter['Accidents'], equal_var=False)
t_stat2, p_val2 = ttest_ind([day]*10, [night]*10, equal_var=False)
print('T-Test 1 (Summer vs Winter Accidents) p-value:', p_val1)
print('T-Test 2 (Daytime vs Nighttime Accidents) p-value:', p_val2)

In [None]:
# 🚗 EGM May 2023: Top Vehicle Types
vehicle_may_df = pd.DataFrame({
    'Vehicle_Type': ['Car', 'Motorcycle', 'Pickup', 'Truck', 'Bus', 'Tow Truck', 'Minibus', 'Tractor', 'Bicycle', 'Other'],
    'Count': [17085, 6595, 4936, 627, 641, 831, 844, 231, 924, 741]
})
sns.barplot(data=vehicle_may_df, x='Vehicle_Type', y='Count')
plt.title('[EGM May 2023] Top 10 Vehicle Types Involved in Accidents')
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()

In [None]:
# ❄️ EGM February 2024: Top Vehicle Types
vehicle_feb_df = pd.DataFrame({
    'Vehicle_Type': ['Car', 'Motorcycle', 'Pickup', 'Truck', 'Bus', 'Tow Truck', 'Minibus', 'Tractor', 'Bicycle', 'Other'],
    'Count': [13334, 4829, 3724, 422, 489, 620, 642, 163, 572, 519]
})
sns.barplot(data=vehicle_feb_df, x='Vehicle_Type', y='Count')
plt.title('[EGM Feb 2024] Top 10 Vehicle Types Involved in Accidents')
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()

In [None]:
# 🎄 EGM December 2024: Top Vehicle Types
vehicle_dec_df = pd.DataFrame({
    'Vehicle_Type': ['Car', 'Motorcycle', 'Pickup', 'Tow Truck', 'Bus', 'Truck', 'Minibus', 'Bicycle', 'Tractor', 'Other'],
    'Count': [16255, 6175, 4687, 766, 585, 568, 764, 833, 214, 706]
})
sns.barplot(data=vehicle_dec_df, x='Vehicle_Type', y='Count')
plt.title('[EGM Dec 2024] Top 10 Vehicle Types Involved in Accidents')
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()

## ✅ Conclusion
- TUIK data shows that traffic accidents increase in summer and during daylight hours.
- Hypothesis tests confirm statistically significant differences.
- EGM monthly breakdowns reveal that cars and motorcycles consistently lead in accident involvement.

**All visualizations and analysis for Phase 2 are complete.**