## The Battle of Neighborhoods
April 2020

### Objective
The project is to leverage the Foursquare location data to explore or compare neighborhoods or cities of your choice or to come up with a problem that you can use the Foursquare location data to solve.
### Project Requirements
Introduction: where you discuss the business problem and who would be interested in this project. \
Data: where you describe the data that will be used to solve the problem and the source of the data.\
Methodology: which represents the main component of the report where you discuss and describe any exploratory data analysis that you did, any inferential statistical testing that you performed, if any, and what machine learnings were used and why.\
Results: where you discuss the results.\
Discussion: where you discuss any observations you noted and any recommendations you can make based on the results.\
Conclusion: where you conclude the report.

## Project Title: Choosing a proper location for openning a Korean restaurant in city of Singapore
##### This project notebook contains 2 parts 
Part 1: \
A description of the problem and a discussion of the background \
A description of the data and how it will be used to solve the problem \
Part 2: \
Methodology and Exploratory Data Analysis \
Inferences and Discussion

### Part 1
### 1.1 Introduction 
Singapore, city-state located at the southern tip of the Malay Peninsula, about 137 kilometres or 85 miles north of the Equator. It consists of the diamond-shaped Singapore Island and some 60 small islets; the main island occupies all but about 18 square miles of this combined area.
Singapore is a major financial and shipping hub, consistently ranked the most expensive city to live in since 2013. Singapore is also a popular tourist destination, with well-known landmarks such as the Merlion, Marina Bay Sands, Gardens by the Bay, the Jewel, the Orchard Road shopping belt, the resort island of Sentosa, and the Singapore Botanic Gardens, the only tropical garden in the world to be honoured as a UNESCO World Heritage Site.[1] \
The beautiful "Garen city" Singapore is full of commercial opportunities, however there are tough challenges for any business investors to open new business there.\
[1] https://en.wikipedia.org/wiki/Singapore

### 1.2 Business Objective and Problem 
A Korean business investor is certain that Korean cuisine will be loved by the people in Singapore and plans to open a HIGH-END korean restaurant there. The investor expects the restaurant to be located in a busy business and commercial district with large people volumn, including office workers and tourists, and without furious competetion from nearby similar restaurants.\
There are a number of administratiave districts with extablished businesses, and many different types of restaurants catering various needs in those districts. To invest the restaurant the investor first needs to select the location by taking into account of commercial activities, business establishments, restaurant competetion, and cost of operation in the area. The problem of this work to be addressed is: deciding the proper location of the planned Korean restaurant that meets the investor's expection. 

### 1.3 Data collection and processing

To find a proper restaurant location, one needs to select a district among many possible choices, by collecting and analyzing the available information about the districts, i.e., the information on the surroundings, businesses and commercial activities, especially restaurants.
#### 1.3.1 Singapore districts data 
Singapore is divided into 28 districts under the old postal district system[2,3].\
[2] https://www.singaporeexpats.com/housing-in-singapore/singapore-district-guide.htm \
[3] https://www.redbrick.sg/blog/singapore-districts-and-regions/

#### Singapore Districts Map:
![image.png](attachment:image.png)

The districts are grouped into 3 regions: Core Central Region (CCR); Rest of Central Region (RCR); Outside Central Region (OCR). They are descriptions of the three main regions of Singapore, constituting of several districts within them, which in turn contains the various towns or streets of the country.
#### District Table:
![image.png](attachment:image.png)


#### 1.3.2 Latitudes and Longitudes of districts in core central region (CCR)

CCR region is said to be located in Singapore’s main prime metropolitan area, of which owning any private residential property here would consider one as a ‘high net-worth individual’. This is largely due to its locational attributes, its close proximity to the central business district (CBD), the quality of the properties, as well as the high property values as compared to the rest of the regions.\
As the core central region has the busiest commecial activities and high-end residential properties, it has the most attractive reason to open a high-end restraurant in this region of 6 districts. Let's focus on and explore this core central region. 
#### The CCR region with latitude and longitude data table  
![image.png](attachment:image.png)

In [1]:
import numpy as np # library to handle data in a vectorized manner

import pandas as pd # library for data analsysis
pd.set_option('display.max_columns', None)
pd.set_option('display.max_rows', None)

import json # library to handle JSON files

!conda install -c conda-forge geopy --yes # uncomment this line if you haven't completed the Foursquare API lab
from geopy.geocoders import Nominatim # convert an address into latitude and longitude values

import requests # library to handle requests
from pandas.io.json import json_normalize # tranform JSON file into a pandas dataframe

# Matplotlib and associated plotting modules
import matplotlib.cm as cm
import matplotlib.colors as colors

# import k-means from clustering stage
from sklearn.cluster import KMeans

#!conda install -c conda-forge folium=0.5.0 --yes # uncomment this line if you haven't completed the Foursquare API lab
#import folium # map rendering library

Solving environment: done

## Package Plan ##

  environment location: /opt/conda/envs/Python36

  added / updated specs: 
    - geopy


The following packages will be downloaded:

    package                    |            build
    ---------------------------|-----------------
    certifi-2020.4.5.1         |   py36h9f0ad1d_0         151 KB  conda-forge
    geographiclib-1.50         |             py_0          34 KB  conda-forge
    geopy-1.21.0               |             py_0          58 KB  conda-forge
    openssl-1.1.1f             |       h516909a_0         2.1 MB  conda-forge
    python_abi-3.6             |          1_cp36m           4 KB  conda-forge
    ca-certificates-2020.4.5.1 |       hecc5488_0         146 KB  conda-forge
    ------------------------------------------------------------
                                           Total:         2.5 MB

The following NEW packages will be INSTALLED:

    geographiclib:   1.50-py_0         conda-forge
    geopy:           1

In [2]:
!pip -q install folium

In [3]:
import folium # map rendering library

In [3]:
# The code was removed by Watson Studio for sharing.

Unnamed: 0,District,Code,Latitude,Longitude
0,CCR,1,1.28895,103.84904
1,CCR,2,1.27929,103.85055
2,CCR,6,1.2925,103.85392
3,CCR,9,1.30374,103.83214
4,CCR,10,1.31355,103.79071


After the data collection and selection from the referred Website, the data analysis process will proceed. All data related to locations and quality of restaurants will be obtained via the FourSquare API utilized via the Request library in Python.

### Part 2 (Week 2)
#### 2.1 Methodology and Exploratory Data Analysis


Foursquare ID

#### let's get the top 100 venues that are in downtown Singapore within a radius of 500 meters, which is only one neighborhood!

#### Send the GET request and examine the resutls

### Analyze Each Neighborhood

#### let's group rows by neighborhood and by taking the mean of the frequency of occurrence of each category