# Relocating to Toronto 

### Data

In order to find the neighborhoods which meet the family's needs and desires, we will use data from two sources:
- City of Toronto Website (https://www.toronto.ca/) via their Open Data Portal (https://open.toronto.ca/)
    - Neighborhood Boundary Data
    - Neighborhood Profile Data
- Foursquare (https://foursquare.com/) via the Foursquare API

##### City of Toronto Open Data Portal

The City of Toronto's Open Data Portal contains extensive information on the neighborhoods of Toronto.  For this analysis we will use the geospatial data providing boundary information of the 140 neighborhoods and the neighborhood profile data containing demographic and socio-economic data for each neighborhood.

###### ... Boundary Data

The neighborhood boundary information (source: https://open.toronto.ca/dataset/neighbourhood-profiles/) is contained in the file *Neighbourhoods.csv* and provides the boundaries of each neighborhood in the form of geospatial coordinates of the vertices of the polygon which encloses the neighborhood.  Here is an example of the data provided for each neighborhood:

In [1]:
import pandas as pd
nbhd_bound = pd.read_csv('Neighbourhoods.csv')  
nbhd_bound[['AREA_NAME','geometry']]

Unnamed: 0,AREA_NAME,geometry
0,Casa Loma (96),"{u'type': u'Polygon', u'coordinates': (((-79.4..."
1,Annex (95),"{u'type': u'Polygon', u'coordinates': (((-79.3..."
2,Caledonia-Fairbank (109),"{u'type': u'Polygon', u'coordinates': (((-79.4..."
3,Woodbine Corridor (64),"{u'type': u'Polygon', u'coordinates': (((-79.3..."
4,Lawrence Park South (103),"{u'type': u'Polygon', u'coordinates': (((-79.4..."
...,...,...
135,Dorset Park (126),"{u'type': u'Polygon', u'coordinates': (((-79.2..."
136,Centennial Scarborough (133),"{u'type': u'Polygon', u'coordinates': (((-79.1..."
137,Humbermede (22),"{u'type': u'Polygon', u'coordinates': (((-79.5..."
138,Willowdale West (37),"{u'type': u'Polygon', u'coordinates': (((-79.4..."


For each neighborhood (**AREA_NAME**), the coordinates of the vertices of the ploygon which enclose it are given (**geometry**).  A centroid of this polygona can be calculated and used as a center point of the neighborhood.  This data will be used in calls to the Foursquare API to get venue information for a given neighborhood.

##### ... Profile Data

The neighborhood profile information (source: https://open.toronto.ca/dataset/neighbourhood-profiles/) is contained in the file *neighbourhood-profiles-2016-csv.csv* and provides a wealth of demographic, social and economic data about the residents of each neighborhood.

For example, we will access neighborhood data such as that indicated below:

In [2]:
nbhd_profile = pd.read_csv('neighbourhood-profiles-2016-csv.csv')
list(nbhd_profile['Category'].unique()[[0,2,9,10]])

['Neighbourhood Information',
 'Families, households and marital status',
 'Education',
 'Housing']

This data will allow use to find neighborhoods with charaterstics (families with children, highly-educated, etc.) as requested by the family.

##### Foursquare

Foursquare is a location-based search tool for exploring numerous types of "venues" in a prescibed geographical region.  These venues include businesses, recreational areas and schools, just to name a few.  We will be using data obtained from Foursquare via the Foursquare API. 

For example, a venue search might give a list of venues such as:

<img src="FoursquareSamp.jpg" align="left" width=600>

Thus, the mere existence of the particular number of venues of a certain category in a neighborhood can be determined.  This allows us to limit our choice of neighborhoods for the family based on whether or not a certain venue appears in a neghborhood and/or whether a certain type of venue is common in that neighborhood.  We can use that information to find clusters of neighborhoods with desirable characteristics and a metric for ranking them.

In our analysis, we will use this data to find neighborhoods with colleges or universities which contain parks (and similar outdoor recreational sites) as well as a variety of grocery stores, pizza places, Italian restaurants and confort-food restuarants as desired by the family.  We will also look to find neighborhoods in general proximity to sports stadiums as the family has indicated a secondary desire to be near such venues.