This repository contains an analysis done on the AirBnB listings in 2019 in New York City. Follwing the CRISP-DM approach, I have identified some potential business queries and come up with the corressponding answers. The project gives you an insight about the distribution of properties both in terms of price and in terms of the type of accomodations in different neighbourhood groups in New York City. In my project, I will be answering the following questions.
- Identifying the hosts with the highest number of listings in the City.
- What are the average listing prices and the distribution of these prices in different neighbourhoods.
- Analysing variations or percentage differences in Room types in different neighborhoods.
- What are the potential hotspots if you are looking to list an AirBnB.
The following modules have been used in the work done in this Project.
- Pandas: The module is used for data manipulation.
- Matplotlib and Seaborn: For data visualizations.
- Geopandas and Shapely: For converting and plotting Geopandas Dataframes
The repository consists of a total of 4 Files.
- This Readme file.
- The Ipython Notebook NYC_Airbnb_Udacity. The files contains all the analysis and answers the questions discussed above.
- The dataset used in the analysis AB_NYC_2019.csv
- A CSV nynta.csv file containing the geographical mappings of all the neighborhoods of New York City.
- Udacity: This project is part of the data science nano degree I am doing in collaboration with Udacity.
- Kaggle Dataset: The link to the AirBnB dataset.
- City of Newyork Neighbourhood Mappings: This is where the neighbourhood mappings used in the analysis are taken from.
The results of the anaylsis and conclusions can be found in the following article. Muneeb Aizaz AirBnB NYC report