# How To Use Data Science to Find the Best Chinese Cuisine in NYC

## 1. Introduction:

This project explores Chinese cuisine in New York city. New York City (NYC) has two major hubs for Chinese food. Manhattan’s Chinatown spans roughly ten blocks between Canal Street and Worth Street. The second NYC hub is Flushing, Queens. New York is a major metropolitan area with more than 8.4 million (census.gov, 2019) people living within city limits. It has a population density of 28,000 residents per square mile.


Chinese immigration to the United States were restricted due to the Chinese Exclusion Act. After being revoked in 1943, the Chinese community slowly started growing. Today Chinatown in Manhattan has grown from the original ten blocks to the present inclusion of the once Italian and Jewish Lower East Side. The large influx of Cantonese-speaking Chinese immigrants caused the extension of the neighborhood. In the 1970s-80s, the influx of Guangdong and Hong Kong immigrants began to rapidly develop newer portions of Manhattan’s Chinatown, going north of Canal Street and then later the east of the Bowery.

With almost 60,000 (mascaulay.cuny.ed, 2020) Manhattan inhabitants reporting Chinese ancestry, the need to find and enjoy Sichuan, Anhui, Cantonese, Fujian, Hunan, Jiangsu, Shandong, Szechuan and Zhejiang and other Chinese cuisines.

This report explores which neighborhoods and boroughs of New York City have the most as well as the best Chinese restaurants as per FourSquare data. Additionally, I will attempt to answer the questions “Where should one open a Chinese Restaurant?” and “Where should one stay If he/she wants great Chinese food?”




## 2. Data Section

To answer the above questions, we need to accumulate data. The data on New York City neighborhoods, boroughs should include boundaries, latitude, longitude, restaurants, and restaurant ratings and tips. There are numbers of API/Websites that provide us with this data.

New York City data containing the neighborhoods and boroughs, latitudes, and longitudes will be obtained from the data source: https://cocl.us/new_york_dataset

New York City data containing neighborhood boundaries will be obtained from the data source: https://data.cityofnewyork.us/City-Government/Borough-Boundaries/tqmj-j8zm

All data related to locations and quality of Chinese restaurants will be acquired from Foursquare via Foursquare API. We will utilize the Foursquare API with NYC neighborhoods data to request detailed restaurants data. For this we will use Request library in Python.



## 3. Methodology Section

**Data is collected from https://cocl.us/new_york_dataset and cleansed and processed into a dataframe.**

<img src="https://miro.medium.com/max/800/1*6241RrDDYzW8YI3tiqc1tQ.png">

**I used python folium library to visualize geographic details of NYC and its boroughs and I created a map of NYC with boroughs superimposed on top. I used latitude and longitude values to get the visual as below:**

<img src="https://miro.medium.com/max/1400/1*hsM3qsqsK-Uog4Xt0rljAA.png">



**Foursquare is used to locate all venues and then filter only for Chinese restaurants. Ratings, tips, and likes by users is counted and added to the dataframe.**

<img src="https://miro.medium.com/max/1400/1*PJkLcIBGVj-1-nZsAnUTWQ.png">

**We sorted data and calculate average rating**

<img src="https://miro.medium.com/max/1050/1*XbEAlAHUuNPA4ItnLFiIeA.png">

**Finally, the data be will be visually assessed using bar graph and maps from various Python libraries such as matplotlib and folium. These are used in result section.**


## Problem Statement

1. What is the best location for Chinese cuisine in New York City?
2. In what neighborhood and/or borough should I open a Chinese restaurant to have the best chance of success?
3. Where would I go in New York City to have the best Chinese food?


## Analysis

Click here for full code: 
%%html
<a href="https://github.com/hnawaz007/Final---Battle-of-the-neighorhood-NYC/blob/master/Capstone%20Final%20Project%20%E2%80%93%20The%20Battle%20of%20Neighborhoods%20%20-%20Finding%20a%20Best%20Chines%20Cuisine.ipynb">Nootebook</a>






## Results

**1. Manhattan has the least number of neighborhoods per borough.**
<img src="https://miro.medium.com/max/1400/1*QCM8K5GuAD28JDDF9XhHBQ.png">

**2. Queens takes the lead in the most numbers of restaurants in NYC.**
<img src="https://miro.medium.com/max/1050/1*CDjZLeJIUOasIcNt83OvHw.png">

**3. The top six neighborhoods represent Queens, Manhattan, and Brooklyn boroughs. Chinatown in Manhattan holds the most Chinese Restaurants, 10.**
<img src="https://miro.medium.com/max/1050/1*lRZwYhBFLLYyvg3cdyP4sQ.png">

**4. Despite having the least number of neighborhood, Manhattan has the highest rate restaurant with average rating of greater than 8.**
<img src="https://miro.medium.com/max/1050/1*gGD5CN4-elJihrog0dDIkA.png">

**5. Manhattan clearly has the highest rated Chinese restaurants out of all the boroughs. It certainly makes it the competition tough and gives consumer lots of options to choose from.**
<img src="https://miro.medium.com/max/1050/1*dh84q7dTf0bzP5jUeyVhfg.png">

**6. Boroughs like Bronx and Staten Island provide less competition as they have a smaller number of Chinese restaurants as compared to other boroughs.**
<img src="https://miro.medium.com/max/1050/1*7g0wvyaqXIrEID1qlpNcKw.png">

**7. Top five Chinese restaurants in NYC**
<img src="https://miro.medium.com/max/767/1*DwgGW6gFtI5mjr1X5ikWNw.png">

**8. Per NY Eatery Chinese restaurants open during pandemic.**
<img src="https://miro.medium.com/max/678/1*jkA-KuelpheYD77HKRZ40A.png">




## Conclusion

Manhattan and Queens have the best rated Chinese restaurants on average. Queens slightly edges out Brooklyn in the ratings. Staten Island and The Bronx have the least number of Chinese restaurants per borough. It is no surprise that Manhattan’s Chinatown has the highest number of Chinese restaurants in NYC. While surprisingly ,Little Neck in Queens takes the lead in the highest number of Chinese restaurants. The obvious choice would be Flushing in Queens.

Though Manhattan has the fewest number of neighborhoods it ranks third in terms of the number of Chinese restaurants (behind Queens and Brooklyn). Based on this information, I would state that Manhattan is the best location for Chinese cuisine in NYC. Rent and other factors notwithstanding, Manhattan could be considered the best location to open a new Chinese restaurant. Manhattan has most neighborhoods with average ratings exceeding 8.0 on a scale of 1.0 to 10.0 and it is still behind Brooklyn and Queens in number of Chinese restaurants making competition easier than in other two boroughs.

Finally, I would go to Chelsea in Manhattan for the best Chinese food based on 1488 likes, Buddakan. As a final note, all the above analysis is depended on the accuracy of Foursquare data. A more comprehensive analysis and future work would need to incorporate data from other external databases.


#### References
Census.gov (06/09/2020) NYC Population
https://www.census.gov/quickfacts/newyorkcitynewyork

Mascaulay.cuny.edu (2020/06/09) Manhattan Chinese population
https://eportfolios.macaulay.cuny.edu/beemanneighborhoods/timelinehistory/

Foursquare API
https://foursquare.com/

NY Eatery (open restaurants during pandemic)
https://ny.eater.com/maps/nyc-chinese-restaurant-delivery-takeout


#### Libraries Which are Used to Develope the Project:

> Pandas: For creating and manipulating dataframes.
>
> Folium: Python visualization library would be used to visualize the neighborhoods cluster distribution of using interactive leaflet map.
> 
> Scikit Learn: For importing k-means clustering.
>
> JSON: Library to handle JSON files.
>
> XML: To separate data from presentation and XML stores data in plain text format.
> 
> Geocoder: To retrieve Location Data.
> 
> Beautiful Soup and Requests: To scrap and library to handle http requests.
> 
> Matplotlib: Python Plotting Module.

<br/>
Blog Post Link: https://medium.com/@hnawaz100/a-data-scientists-guide-to-the-best-chinese-cuisine-in-nyc-4518cb18fdc8