# More Crimes are Reported on Weekdays

This submission takes a look at the Seattle dataset. 
As a first step it is usually a good idea to have a look at the dataset.

In [None]:
# import the neccessary tools
import numpy as np
import pandas as pd
from bokeh.plotting import figure, show, output_notebook
from bokeh.charts import Bar, Histogram
import datetime

In [None]:
# use pandas `read_csv` function to load the csv into a DataFrame
crime = pd.read_csv("seattle_incidents_summer_2014.csv")
# just ignore the warning for now

In [None]:
# the first five colums of the DataFrame
crime.head(5)

## Dates

There are three columns with dates: `Date Reported`, `Occured Date or Date Range Start` and `Occured Date Range End`. For the sake of simplicity the `Occured Date Range End` Column will be ignored for now, thereby treating all events as a single point in time. 

Next, we will look at the `Date Reported` and `Occured Date or Date Range Start`.


In [None]:
# extract the column, convert the string representation of the dates to datetime.datetime objects and get the weekday
day_reported = len(crime["Date Reported"]) * [7]
for n in range(0, len(crime["Date Reported"])):
    day_reported[n]=datetime.datetime.strptime(crime["Date Reported"][n], '%m/%d/%Y %H:%M:%S %p').weekday()


In [None]:

df = pd.DataFrame({'reported': pd.Series(day_reported)})
plot = Bar(df, 'reported', values='reported', agg='count', title='crimes reported by weekday (monday = 0, sunday = 6)', xlabel='weekday', ylabel='number of crimes reported')
output_notebook()
show(plot)

It looks as if fewer crimes are reported over the weekends. Why could that be? Do criminals have weekends too and commit more crimes during the week? Do people rather enjoy the weekend and wait until monday to report a crime? Who knows. In any case, more analysis would be required to determine if there is a statistical effect, and if so, if that effect is significant. But that is out of scope of this assignment.
Instead, let's do the same for the `Occured Date or Date Range Start` column.

In [None]:
occured = crime["Occurred Date or Date Range Start"]
occured_day = len(occured) * [7]
for n in range(0, len(committed)):
    occured_day[n]=datetime.datetime.strptime(occured[n], '%m/%d/%Y %H:%M:%S %p').weekday()
df = pd.DataFrame({'occured': pd.Series(occured_day)})
plot = Bar(df, 'occured', values='occured', agg='count', title='crimes committed by weekday (monday = 0, sunday = 6)', xlabel='weekday',ylabel='number of crimes committed')
show(plot)

The pattern from the `Date Reported` column (if there was one) does not seem to repeat itself here.