# Project Proposal and MVP

## Predicting non-NYPD Subject Injuries during NYPD Force Incidents
***
### Question/Need:

My client is the New York City Police Department. In this scenario, they are interested in improving their relationship with the communities they serve. One way in which they seek to do this is by reducing the rate of injuries among non-NYPD subjects involved in force incidents, which was approximately 1/3 between 2020-2021.

To that end, I propose building a classification model to predict whether a non-NYPD subject will be injured during a force incident involving 1+ NYPD members of service and identify factors that are most useful in making these predictions. The NYPD can use this information to modify its training and use of force policy in order to decrease the percentage of non-NYPD subjects that are injured during force incidents.
***
### Impact Hypothesis

By being able to predict whether a non-NYPD subject involved in a force incident will be injured based on various aspects of a force incident and those involved - such as the type of force used by 1+ members of service and the type of force used (if any) by non-NYPD subjects - the NYPD will be able to identify which aspects contribute most to the likelihood of these injuries occuring and can make changes to its training and use of force policy based on them to reduce the rate of non-NYPD subject injuries.
***
### Data

- <a href="https://data.cityofnewyork.us/Public-Safety/NYPD-Use-of-Force-Incidents/f4tj-796d">NYPD Use of Force Incidents</a> - This dataset contains information about use of force incidents involving NYPD members of service and non-NYPD subjects between January 2020 - December 2021.
- <a href="https://data.cityofnewyork.us/Public-Safety/NYPD-Use-of-Force-Subjects/dufe-vxb7">NYPD Use of Force: Subjects</a> - This dataset contains information about non-NYPD subjects involved in use of force incidents between January 2020 - December 2021.
- <a href="https://data.cityofnewyork.us/Public-Safety/NYPD-Use-of-Force-Members-of-Service/v5jd-6wqn">NYPD Use of Force: Members of Service</a> - This dataset contains information about NYPD members of service involved in use of force incidents between January 2020 - December 2021.
***
### Tools

- Numpy, Pandas, Seaborn and Matplotlib for data cleaning and EDA
- Scikit-learn for modeling
- Tableau and/or Seaborn/Matplotlib for data visualizations
- SQL for storage
***
### MVP

As a baseline, I built a logistic regression model with one binary feature: whether a non-NYPD subject used force or not during the force incident. Since false negatives (predicting that a non-NYPD subject involved in a force incident doesn't get injured when in fact they do) are more costly for the NYPD, I chose recall as my evaluation metric. The mean recall score after cross-validation for this baseline model was 0.000.

After exprimenting with different combinations of 11 features, I found the model that had the highest mean recall after cross-validation used three features: 
- the type of force used by the NYPD member(s) of service
- the type of force (if any) used by the non-NYPD subject
- the subject's race
The mean recall score was 0.402.

Below is a chart of the importance level for each non-reference category in the three features:

![mvp.png](attachment:mvp.png)

Some interpretation of the importance levels of the non-reference categories of the features:

- Non-NYPD subjects that experienced force in the form of a police canine or firearm are more likely to have gotten injured than those who experienced force in the form of an electrical weapon (i.e., taser)
- Non-NYPD subjects that experienced force in the form of an impact weapon, OC spray, restraining mesh blanket or physical force are more likely to have gotten injured than those who experienced force in the form of an electrical weapon
- Non-NYPD subjects that exerted force in the form of an impact weapon, physical force or firearm on NYPD members of service are more likely to have gotten injured than those who exerted no force
- Non-NYPD subjects that exerted force in the form of a displayed weapon or cutting instrument on NYPD members of service are more likekly to have gotten injured than those who exerted no force
- Non-NYPD subjects that were any race except Other/Unknown were more likely to have gotten injured than those who were Other/Unknown