GitHub - cghimire/Predicting-Airbnb-Listing-Price: This project aims to use machine learning models to predict the base price for properties, and also to explore Airbnb listing data, in order to help Airbnb hosts maximize their earnings.

Predicting Airbnb listing Price

This project aims to use machine learning models to predict the base price for properties, and also to explore Airbnb listing data, in order to help Airbnb hosts maximize their earnings.

🧐 About

Airbnb is a home-sharing platform that allows home-owners and renters ('hosts') to put their properties ('listings') online, so that guests can pay to stay in them. Hosts are expected to set their own prices for their listings. Although Airbnb and other sites provide some general guidance, there are currently no free services which help hosts price their properties.

Airbnb pricing is important to get right, particularly in big cities where there is lots of competition and even small differences in prices can make the difference between optimum occupancy and high earnings, or being priced out of the market. It is also a difficult thing to do correctly, in order to balance the price with occupancy (which varies inversely with price) in order to maximise revenue.

This project aims to use machine learning models to predict the base price for properties, and also to explore Airbnb listing data, in order to help Airbnb hosts maximize their earnings.

🎈 Data Exploration and Preparation

Metadata of the data which contains 20677 records (rows) and 106 features (columns). There are some missing values and I applied imputation method to handle those missing values. After I cleaned the date, dimension of the data is reduced to 13837 rows and 24 columns

From the above histogram, it can be seen that one column only contain one category and can be dropped

We see the distribution for pricing is strongly skewed right. This makes sense as a majority of the listings on Airbnb are single individual listings and Airbnb does strongly cater to travelers who are looking for cheaper places to stay for short durations of time.

Correlation:

Correlation matrix shows that price column has positive correlation with beds, bedroom, and accommodates but are not highly correlated (less than 0.50).

🚀 Data Modeling and Model Evaluation

I used Random Forest and Neural networks model to predict listing price

The easier metric to understand is the mean absolute error, this means that our predictions were perfect but on average 39.28 away from the true prediction with the random forest model. Our model’s MAE is 39.28, which is fairly small given that our data’s PRICE range from 0 to about 2000.

RMSE is the difference between model predictions and true values. We get the RMSE value about 76: which means this model is better to predict the airbnb price. We can create models with different hyperparameters tuning to try and boost performance.

Feature Selection:

Based on my random forest regtression model, we can see that the amenities and accommodates are top 2 important features to predict price. Which makes sence because these two features are important to determine the listing price.

Neural Network

The score return the coefficient of determination R^2 of the prediction. R square compares the fit of the chosen model with that of a horizontal straight line (the null hypothesis). If the chosen model fits worse than a horizontal line, then R square is negative. So, this is not a best fit model to predict the listing price.

Future Plan

Due to time constraints, I couldn't able to do indepth analysis of all the features. If I get a chance to do further analysis in the future, I will perform some others ML models such as XGBoost to compare the best fit model to predict listing price. I tried to use XGBoost, but I got error while importing XGBoost library in my MacOS.

To further improve our models, I could include more feature engineering, for example time-based features.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Figures		Figures
Airbnb EDA Project - Price Prediction.ipynb		Airbnb EDA Project - Price Prediction.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Airbnb listing Price

📝 Table of Contents

🧐 About

🎈 Data Exploration and Preparation

Correlation:

🚀 Data Modeling and Model Evaluation

Feature Selection:

Neural Network

Future Plan

About

Releases

Packages

Languages

cghimire/Predicting-Airbnb-Listing-Price

Folders and files

Latest commit

History

Repository files navigation

Predicting Airbnb listing Price

📝 Table of Contents

🧐 About

🎈 Data Exploration and Preparation

Correlation:

🚀 Data Modeling and Model Evaluation

Feature Selection:

Neural Network

Future Plan

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages