Skip to content

guillemforto/airbnb-price-prediction

Repository files navigation

Airbnb Price Prediction     airbnb_logo.png

In view of the number of housing units in Paris, how to define a fair price according to the location, the equipment or the capacity of a unit?

Context

Since its inception in 2008, the number of Airbnb rentals listed on its website has grown exponentially each year. In France, no city is more popular than Paris. This means that Paris is one of the most popular markets for Airbnb in Europe, with more than 60 000 homes listed in 2018.

Since tree-based models such as XGBoost or CatBoost have proven to be the new best-in-class techniques in recent years, the main challenge of this competition was to find good features to predict the price.
Unsurprisingly, some of these turned up to be the neighborhood, the number of accommodates and the number of bedrooms.
If you want more details of what was done, feel free to check out the keynote presentation of this repository: keynote_pres.pdf.

This project was part of an in-class Kaggle competition, for the Quantitative marketing course at Toulouse School of Economics.

Installation of libraries

To run the code, you'll need Python 3 installed with pip.
Here is a non-exhaustive list of dependencies you might want to install:

pip install xgboost
pip install catboost
pip install tensorflow
pip install scikit-learn

Running the code

Start by downloading the necessary data files:

You're then ready to run the notebook main_notebook.ipynb, which contains all the code to build the engineered features and to train the models. You'll need Jupyter notebook (>= v. 6.0.3) to run it.

About

🏡 Predicting prices of Paris Airbnb accommodations

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published