# West Midlands Crime Analysis – Notebook 1

This notebook performs an exploratory analysis of West Midlands Police 
street-level crime data (Aug 2022 – Jul 2025).  

## Notebook Roadmap
1. Data preparation – combine and clean monthly files
2. Data cleaning – handle missing values, remove duplicates, standardize formats
3. Exploratory Data Analysis (EDA) – summary statistics, crime by type
4. Visualisation – time series trends and crime category breakdown
5. Insights & limitations


In [2]:
import pandas as pd
import glob
import os

# Path to the raw data folder
data_folder = '../data/raw'
# Get all CSV files in the folder
csv_files = glob.glob(os.path.join(data_folder, '*.csv'))

# Load and concatenate all CSVs
df_list = [pd.read_csv(f) for f in csv_files]
combined_df = pd.concat(df_list, ignore_index=True)

print(f"Loaded {len(csv_files)} files. Combined shape: {combined_df.shape}")

# Save combined DataFrame to processed section
processed_path = '../data/processed/combined.csv'
combined_df.to_csv(processed_path, index=False)
print(f"Combined CSV saved to {processed_path}")

Loaded 14 files. Combined shape: (392363, 12)
Combined CSV saved to ../data/processed/combined.csv
Combined CSV saved to ../data/processed/combined.csv
