# Final Project
Year project was done: 2019  
Name: Marc-Henri Bleu-Laine  
Contact: bleulaine.marchenri@gmail.com

## Problem Statement and Background
<img src="ATL.jpg" title="Atlanta Skyline" />

New businesses all face the same issue before they even start operating: finding the ideal location. In particular, opening a new African restaurant in the city of Atlanta, Georgia is not an easy task. There are several common questions that new restaurant/business owners ask themselves[[1]](http://alltopstartups.com/2018/03/15/the-importance-of-location-in-business)[[2]](https://www.thebalancesmb.com/choosing-restaurant-location-2888543):
<ul>
    <li>Does the success of your restaurant rely heavily on customer traffic?</li>
    <li>Does the location of the restaurant bring visibility?</li>
    <li>Are there already other restaurants proposing the same service in the neighborhood?</li>
    <li>What are the owner's and the landloard's responsabilities? Does the location rent meet the business budget?</li>
    <li>Is the location and the building safe? </li>
    <li>etc...</li>
</ul>
There is a strong African community in Atlanta as well as more and more people willing to try food from the continent. The potential market is large and it could help advance the culture that remains somewhat unknown.   
  
For this work though the word "African" is used, it was chosen to be general as the work is meant to be a proof of concept. For an actual customer, it will be better to focus on actually what type of African restaurant is to be opened (i.e. Ivorian, Ethipian, etc...).  
  
The work focuses on trying to help find the best location for a new African restaurant in the city of Atlantaa, GA. The ideal location has few international competitors (international restaurants are limited in a given neighborhood), but is still known for its restaurant and activities (top places are restaurants, and other venue catogires that attract lots of people regularly). This location also has low crime rate as well as a cheap rent.  
  
Obviously, choosing the right neighborhoods gets complicated when taking into account all these parameters. Using clustering algorithms, it is possible to group neighborhoods by their similarities, which would help the restaurant owner find a list of potential neighborhoods in which the business will strive. 

## Data Description
Data fusion of multiple datasets will be used for this project. The common information that will be used to join the data are the zipcodes and the neighboorhood names

#### Foursquare [[3]](https://foursquare.com)

The main source of data used will be FourSquare since it contains information about venues, their names, their location, and their categories. Using this source, the venues will be grouped by their types such that the each neighborhood can have a ranking of the venues' type from the most common to the least ones. 

#### Zillow Rent Index [[4]](https://www.zillow.com/corp/About.htm)

Zillow is the leading real estate and rental marketplace dedicated to empowering consumers with data, inspiration and knowledge around the place they call home, and connecting them with the best local professionals who can help. Zillow Rent Index (ZRI) is used to track the monthly median rent in particular geographical regions.  
The data contains the monthly rent in dollar of the rent of single family, condominium, cooperative, and multifamily. The average median of year 2019 will be used. A snapshot of the dataset is presented below.  
I will focus on the following counties in Georgia:
<ul>
    <li>Gwinnet County</li>
    <li>Fulton County</li>
    <li>Cobb County</li>
    <li>Dekalb County</li>
</ul>

In [2]:
import pandas as pd
ZRI = pd.read_csv("Zip_Zri_AllHomesPlusMultifamily.csv", skipfooter=1, engine="python")
ZRI[ZRI.State=="GA"].head()

Unnamed: 0,RegionID,RegionName,City,State,Metro,CountyName,SizeRank,2010-11,2010-12,2011-01,...,2018-08,2018-09,2018-10,2018-11,2018-12,2019-01,2019-02,2019-03,2019-04,2019-05
31,71067,30349,Riverdale,GA,Atlanta-Sandy Springs-Roswell,Clayton County,32,1069.0,1058.0,1051.0,...,1149,1149,1152,1164,1179,1190,1195,1201,1208,1222
48,70829,30044,Lawrenceville,GA,Atlanta-Sandy Springs-Roswell,Gwinnett County,49,1106.0,1114.0,1120.0,...,1390,1390,1392,1398,1410,1422,1431,1436,1440,1443
52,70828,30043,Lawrenceville,GA,Atlanta-Sandy Springs-Roswell,Gwinnett County,53,1264.0,1270.0,1267.0,...,1498,1498,1502,1517,1534,1547,1551,1556,1562,1574
59,70874,30096,Duluth,GA,Atlanta-Sandy Springs-Roswell,Gwinnett County,60,1204.0,1207.0,1214.0,...,1446,1446,1451,1468,1490,1509,1517,1522,1527,1534
71,70810,30024,Suwanee,GA,Atlanta-Sandy Springs-Roswell,Gwinnett County,72,1598.0,1596.0,1591.0,...,1871,1870,1873,1880,1890,1900,1907,1914,1923,1941


#### COBRA [[5]](http://www.atlantapd.org/i-want-to/crime-data-downloads)

COBRA is a dataset obtained from the Atlanta Police Department. It contains the number of crimes, their types, and their location in the greater Atlanta area.

In [5]:
ATLCrime = pd.read_csv("COBRA-2019.csv")
ATLCrime.head()

Unnamed: 0,Report Number,Report Date,Occur Date,Occur Time,Possible Date,Possible Time,Beat,Apartment Office Prefix,Apartment Number,Location,Shift Occurrence,Location Type,UCR Literal,UCR #,IBR Code,Neighborhood,NPU,Latitude,Longitude
0,190010138,2019-01-01,2019-01-01,20,2019-01-01,25,511.0,,,50 UPPER ALABAMA ST SW,Morning Watch,13.0,LARCENY-NON VEHICLE,620,2302,Downtown,M,33.75194,-84.38964
1,190010299,2019-01-01,2019-01-01,120,2019-01-01,130,511.0,,,20 BROAD ST,Morning Watch,,LARCENY-NON VEHICLE,620,2302,Downtown,M,33.75312,-84.39208
2,190011858,2019-01-01,2019-01-01,1740,2019-01-01,1750,411.0,,A15,3000 CONTINENTAL COLONY PKWY SW,Evening Watch,26.0,LARCENY-NON VEHICLE,620,2302,Greenbriar,R,33.68077,-84.4937
3,190010845,2019-01-01,2019-01-01,415,2019-01-01,420,607.0,,,1362 BOULEVARD SE,Morning Watch,23.0,LARCENY-NON VEHICLE,630,2303,Benteen Park,W,33.71744,-84.36818
4,190011541,2019-01-01,2019-01-01,1400,2019-01-01,1430,210.0,,,3393 PEACHTREE RD NE @LENOX MALL,Evening Watch,8.0,LARCENY-NON VEHICLE,630,2303,Lenox,B,33.84676,-84.36212
