Skip to content

Real estate insights project, using a dataset from Airbnb, in New York, to help the company's CEO to evaluate the behavior of the prices of properties available in the city.

Notifications You must be signed in to change notification settings

m4theus4ndr4de/insights-real-state-rent

Repository files navigation

logo

Real State Rent

This is a fictional project for studying purposes. The context, questions and people involved are fictional. The dataset used in this project is from Kaggle and it is available there.

1. Description of the Business Problem

The CEO from a company would like to diversify his business and start investing in real estate. He defined that he would buy real estate in New York City, in the United States. As it is one of the most expensive places to live in the country, he believes that he will obtain a satisfactory return on his investments if he rents properties in the city. Since all of his decisions are based on data, he wants someone to help him in this endeavor.

He initially plans to lease the properties acquired and for that reason he defined that he will use the Airbnb platform for this purpose. He found a dataset on the internet containing data on the behavior of hosts and their properties in New York City.

He wants help to find out the regions with the highest rentals and highest prices and which apartments or houses are in favorable regions of New York City. He believes these characteristics will help him to recover the money invested in the acquisition of some properties more quickly. The challenge is to conduct an exploratory analysis and answer the questions posed by the CEO.

2. Dataset Attributes

Information about the atrributes can be found here.

AttributeDescription
idUnique ID for each house/apt
host_idID of the host of the house/apt
host_nameName of the host
neighbourhood_groupName of the borough the property is located in
neighbourhoodName of the neighbourhood the property is located in
latitudeProperty latitude
longitudeProperty longitude
room_typeType of property being rented, may be the entire house/apt, a private room or a shared room
pricePrice to pay for the rent
minimum_nightsMinimum quantity of nights someone can rent a property
number_of_reviewsNumber of reviews that property received
last_reviewThe date that specific property was reviewed for the last time
reviews_per_monthAverage number of times that property is reviewed per month
calculated_host_listings_countNumber of properties the host from a specific property is responsible for in the dataset
availability_365Number of days in the next 365 days that property is available to be rent

New York City Boroughs

logo

Properties Distribution in New York City

logo

3. Business Premises

The premises that were assumed for the development of the business problem solution are:

  • Every property rented was reviewed once the contract has come to an end.
  • Every property has been avilable since the date of the last review.
  • Rows with NA values in the 'last_review' column represents properties that were never rented or reviewed.
  • The foundation of Airbnb were considered to fill the NA values in the 'last_review' column.

4. Solution Strategy

  1. Download the dataset from Kaggle.
  2. Understand the business problem.
  3. Clean, analyse and explore the dataset using data science packages in Python.
  4. Propose a few insights about the dataset and test them.
  5. Answer the questions from the CEO. The questions and answers can be found here.

5. The Insights

I1: Private rooms are more than 20% more expensive than shared rooms.

True: Private rooms are 28% more expensive than shared rooms.

I2: Most neighbourhoods among the 10 with the highest prices and with more than 20 houses/apts are in Manhattan.

True: The first one taht is not in Manhattan is in Brooklyn and it is the 15th.

I3:Properties that were never reviewed are 10% cheaper than those at least once reviewed.

False: The average price of the properties that were never reviewed are greater than the price of those properties that were reviewed at least once.

6. Conclusion

The questions from the CEO that motivated this project were answered. Some of the answers can lead to future profit. Based on data, the CEO can take further steps on the real state market.

7. Future Work

  • Create more insights.
  • Develop a machine learning model to predict the rest price of a property.
  • Develop a machine learning model to predict if a property with certain characteristics should be bought or not.

About

Real estate insights project, using a dataset from Airbnb, in New York, to help the company's CEO to evaluate the behavior of the prices of properties available in the city.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published