Clustering and Comparing the Neighborhoods of New York City and Toronto
This is the repository for the capstone project of IBM Data Science Professional Certificate. In this project, the neighborhoods of New York City and Toronto are clustered into groups of similar neighborhoods where the similarity is based on the categories (types) of venues in the neighborhoods.
Foursquare API was used to retrieve data on the venues of the neighborhoods and their categories.
Files in this Repository
This repository contains the following files:
Report.pdffile is the PDF report of the project. It contains detailed explanation of the stages of the project.
Notebook.ipynbfile is the Jupyter notebook that contains all the code used to conduct the project.
newyork_data.jsonfile is needed when running the notebook code.
You can also read my blog post about this project here.