# Applied Data Science Capstone - The Battle of Neighborhoods

## 1. Introduction of this project - A data journey to London

The other day I received an email from a friend of mine in Japan and he told me he has received an attractive offer from a company in London. He is now planning to relocate with his wife and a 4-year-old boy.
However, he has no experience of living abroad and do not have a clue where to start with finding a right information of the neighborhood there. 
So I have decided to help him with my newly acquired skill of data analysis from the data science course at Coursera.

### Goal of the project: To find suitable accommodation areas inside/around London for a family with a small kid.

## 2. Data

I first decided to check the crime rate as it is usually the most importatant condition for a family with small children. I also know that they love Asian cuisine and they cannot live without it. So this needs to be taken into account as well.

### Target: Find an area(s) with 1) Low Criminal Rate and 2) Good access to Asian cuisine  (Extra - (if possible) good access to Metro station(s) )

### Data source

#### 1) Criminal Record in London - https://data.london.gov.uk/dataset/recorded_crime_summary (MPS Borough Level Crime in csv format)
#### 2) Geographical coordinate fo Boroughs and Wards in London - https://data.london.gov.uk/dataset/excel-mapping-template-for-london-boroughs-and-wards
#### 3) Neighborhood Venues - with Foursquare API.
#### (4) Location of Metro Stations in London - https://www.doogal.co.uk/london_stations.php (Name of the station, Geographical coordinate(longitute, latititude))

1) Criminal Record in London includes the category of crime, Borough Name and the number of crimes for each month from June.2018 till May, 2020. 

2) Geographical coordinate of Boroughs and Wards in London recquires cleaning as they are not in one file.

## 3. Data Analysis

Importing libraries

In [2]:
import numpy as np 
import pandas as pd

import json

from geopy.geocoders import Nominatim # convert an address into latitude and longitude values

import requests # library to handle requests
from pandas.io.json import json_normalize # tranform JSON file into a pandas dataframe

import matplotlib.cm as cm
import matplotlib.colors as colors

from sklearn.cluster import KMeans

import folium # map rendering library


In [7]:
df_london_boroughs = pd.read_csv('MPS Borough Level Crime (most recent 24 months).csv')
df_london_boroughs.head()

Unnamed: 0,MajorText,MinorText,LookUp_BoroughName,201806,201807,201808,201809,201810,201811,201812,...,201908,201909,201910,201911,201912,202001,202002,202003,202004,202005
0,Arson and Criminal Damage,Arson,Barking and Dagenham,12,6,5,3,8,5,1,...,3,6,9,8,6,4,5,6,2,2
1,Arson and Criminal Damage,Criminal Damage,Barking and Dagenham,123,127,101,107,132,105,88,...,118,109,109,97,121,97,103,107,80,88
2,Burglary,Burglary - Business and Community,Barking and Dagenham,33,30,18,33,32,39,33,...,35,37,30,30,25,31,17,28,29,16
3,Burglary,Burglary - Residential,Barking and Dagenham,77,94,84,99,94,106,164,...,67,80,97,114,130,116,123,97,57,41
4,Drug Offences,Drug Trafficking,Barking and Dagenham,6,8,7,10,7,7,4,...,7,7,10,12,3,13,4,6,11,11


In [12]:
df_london_boroughs.shape

(1566, 27)

Let's see what type of crimes are categorized.

In [11]:
print(df_london_boroughs['MinorText'].unique())

['Arson' 'Criminal Damage' 'Burglary - Business and Community'
 'Burglary - Residential' 'Drug Trafficking' 'Possession of Drugs'
 'Bail Offences' 'Bigamy' 'Dangerous Driving'
 'Disclosure, Obstruction, False or Misleading State'
 'Exploitation of Prostitution'
 'Fraud or Forgery Associated with Driver Records'
 'Going Equipped for Stealing' 'Handling Stolen Goods'
 'Making, Supplying or Possessing Articles for use i'
 'Obscene Publications' 'Other Forgery' 'Other Notifiable Offences'
 'Perjury' 'Perverting Course of Justice' 'Possession of False Documents'
 'Profitting From or Concealing Proceeds of Crime'
 'Threat or Possession With Intent to Commit Crimina' 'Wildlife Crime'
 'Other Firearm Offences' 'Possession of Article with Blade or Point'
 'Possession of Firearm with Intent' 'Possession of Firearms Offences'
 'Possession of Other Weapon'
 'Other Offences Against the State, or Public Order'
 'Public Fear Alarm or Distress'
 'Racially or Religiously Aggravated Public Fear, Al' 'Vi