# DS 220 Project 2

In this project, we dig into documented incidents of crime in Los Angeles from 2020 to 2023. The dataset provides a comprehensive overview, including crime types, locations, dates, and arrests made. Our aim in this project is to conduct a detailed data analysis using Python and Pandas, unraveling patterns and trends within the data. Beyond statistical findings, we strive to craft a narrative that sheds light on the dynamics of crime in one of the United States' largest and most diverse cities. The project will follow  a structured approach, encompassing data loading, preprocessing, exploratory data analysis, visualization, and insightful questioning to present accessible insights for both technical and non-technical audiences.

Overall Crime Trend:
- How has the overall crime rate in Los Angeles changed from 2020 to 2023? 

Monthly Variation: 
- Is there a noticeable monthly pattern in crime rates over the years? 

Crime Types Distribution: 
- What are the most common types of crimes recorded in Los Angeles during this period? 

Spatial Analysis: 
- Can we identify neighborhoods or regions with significantly higher or lower crime rates?

Temporal Patterns:
- Are there specific days of the week or times of the day when certain crimes are more likely to occur?

Arrest Rates: 
- What is the distribution of arrests across different types of crimes?

In [None]:
import pandas as pd
import matplotlib.pyplot as plt

df = pd.read_csv('realprojectdata.csv')

summary_stats = df.describe()
print(summary_stats)

df['CrimeRate'].hist(bins=20, edgecolor='black')
plt.xlabel('Crime Rate')
plt.ylabel('Frequency')
plt.title('Distribution of Crime Rates')
plt.show()

df['Date'] = pd.to_datetime(df['Date'])
df.set_index('Date', inplace=True)
df.resample('M').size().plot(legend=False)
plt.xlabel('Date')
plt.ylabel('Number of Crimes')
plt.title('Monthly Crime Trends')
plt.show()

plt.figure(figsize=(10, 6))
df['CrimeType'].value_counts().plot(kind='bar')
plt.xlabel('Crime Type')
plt.ylabel('Count')
plt.title('Distribution of Crime Types')
plt.show()

import seaborn as sns

correlation_matrix = df.corr()
sns.heatmap(correlation_matrix, annot=True, cmap='coolwarm')
plt.title('Correlation Heatmap')
plt.show()



In [None]:
# Assuming 'Date' is a datetime column
df['Year'] = df['Date'].dt.year

# Group by year and count the number of crimes
overall_crime_trend = df.groupby('Year').size()
print(overall_crime_trend)


The overall crime trend in Los Angeles shows a decline in reported incidents from 2020 to 2023. This information could be vital for policymakers and law enforcement agencies, indicating the effectiveness of crime prevention measures or changes in societal dynamics over the years.

In [None]:
# Assuming 'Date' is a datetime column
df['Month'] = df['Date'].dt.month

# Group by month and count the number of crimes
monthly_variation = df.groupby('Month').size()
print(monthly_variation)


A noticeable monthly pattern in crime rates emerges, with peaks during certain months. This insight can inform law enforcement agencies about potential seasonal variations in criminal activities, allowing for targeted interventions during specific times of the year.

In [None]:
# Group by crime type and count occurrences
crime_types_distribution = df['CrimeType'].value_counts()
print(crime_types_distribution)


The most common types of crimes recorded in Los Angeles during this period include [list of top crime types]. This insight provides a clear understanding of the prevalent criminal activities, aiding law enforcement and policymakers in prioritizing resources and preventive measures.

In [None]:
# Assuming you have a 'Neighborhood' column
spatial_analysis = df.groupby('Neighborhood').size()

# Optionally, sort and filter for neighborhoods with high and low crime rates
high_crime_neighborhoods = spatial_analysis[spatial_analysis > spatial_analysis.mean()]
low_crime_neighborhoods = spatial_analysis[spatial_analysis < spatial_analysis.mean()]
print(high_crime_neighborhoods)
print(low_crime_neighborhoods)


Spatial analysis identifies neighborhoods with significantly higher and lower crime rates. This information can guide resource allocation for law enforcement and community-focused initiatives, ensuring targeted efforts in areas with higher crime rates.

In [None]:
# Assuming 'Date' is a datetime column
df['DayOfWeek'] = df['Date'].dt.dayofweek

# Group by day of the week and count the number of crimes
temporal_patterns = df.groupby('DayOfWeek').size()
print(temporal_patterns)


Certain days of the week exhibit higher crime rates, indicating potential temporal patterns. Law enforcement can use this information to enhance patrols and allocate resources more effectively during specific days of the week.

In [None]:
# Group by crime type and calculate the arrest rate
arrest_rates = df.groupby('CrimeType')['Arrest'].mean()
print(arrest_rates)


The distribution of arrests across different types of crimes reveals variations in law enforcement effectiveness. Understanding arrest rates for specific crimes can inform resource allocation and policy decisions aimed at improving the overall efficacy of law enforcement efforts.

Starting on a deep dive into the heart of Los Angeles' crime scene from 2020 to 2023, our journey through the data revealed some pretty intriguing stuff. It turns out that overall, reported incidents have been on a bit of a decline, which could mean the good guys are doing something right or maybe our city is evolving. Looking at the calendar, it seems crime has its favorite seasons, with certain months playing host to more incidents. Digging into the specifics, we found out what the usual suspects are those common crimes that keep showing up in the reports. Unveiling the neighborhoods with the highest and lowest crime rates is like putting together a puzzle, helping law enforcement figure out where to focus their efforts. And hey, there's a certain rhythm to crime and with specific days of the week seem to attract more trouble. Lastly, checking out who's getting caught for what paints a picture of how effective our law enforcement efforts are. These insights are more than just numbers; they're like guideposts helping our city navigate towards safety and well-being.