# Analysis of crime trends in New York City - "Is the next crime predictable?"

## Problem statement

We are going to study the patterns of crime incidents in the city of New York, and build a set of models to predict if a crime is likely to happen given a specific time and location.

## Audience

* Law enforcement agencies
* Media outlets
* General population

## Data

NYC OpenData provides a dataset with incidents of felony, misdemeanor and violation crimes, provided by New York Police Department in 2016:

* https://data.cityofnewyork.us/Public-Safety/NYPD-Complaint-Data-Current-YTD/5uac-w243 (03/26/2017)

This dataset holds complaint incidents with date, time, police department, description, law category, borough, and geographical information (latitude, longitude). It does not include street name, number, zip code, borough and census tract. 

In order to augment the data from the (latitude, longitude) key, the following datasets will be used: 

* OpenAddress.io http://results.openaddresses.io/ (03/26/2017) which includes street-level data
* https://data.cityofnewyork.us/City-Government/2000-Census-Tracts/ysjj-vb9j (03/26/2017) which includes boroughs and census tracts

An initial processing of the data can be found [here](data/complaints-data.ipynb). A visualization of the results can be found [here](maps.ipynb).

## Approach

* Determine which variables provide the highest correlation with crime incidents
    * Visualize incidents in a map for different choices of variables and filters
    * Heatmaps,  kernel density estimate plots to correlate variables
    * Feature selection
* Time series analysis on variables
    * Identify seasonal effects
    * Determine anomalies
* Prediction models
    * Regression (to determine the evolution of patterns over time)
    * Clustering (to determine common patterns across census tracts, boroughs, etc.)
    * Classification (to predict if a given crime is likely to happen in a given time and location)

## Deliverables

A notebook including:
* Analysis of crime patterns
* Models for predicting crime