Skip to content

Medium article identifying trends in Seattle's Airbnb listing prices and trying to predict prices using linear regression.

Notifications You must be signed in to change notification settings

jhmarlow/predicting-airbnb-prices-in-seattle

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Airbnb Seattle Analysis Blog Post

Udacity DataScience NanoDegree Project 1

This repo contains the code used to produce the data analysis and interactive visualistions for a blog post on Medium.

Business Understanding

Airbnb has become the go-to marketplace when trying to find a place to stay. However, it can sometimes be overwhelming when faced with a mass of choices, and numerous factors to filter against. In the article, we take a look at Seattle's Airbnb ecosystem. Looking at how location can affect price, how prices change throughout the year, as well as the type of listings available these neighbourhoods.

Questions:

  1. How does distance to the city centre affect listings?
  2. What is the best time of year to go?
  3. What type of listings are available in Seattle?
  4. How are the listings managed in Seattle?
  5. Can we predict the Airbnb prices using factors assessed?

Data Understanding

The data was provided from the Kaggle project (https://www.kaggle.com/airbnb/seattle/data) and the opensource datasets provided by airbnb (http://insideairbnb.com/get-the-data.html)

The data made available was:

  • Calendar.csv - listing prices across the year
  • Listings.csv - property listings available through Airbnb in Seattle in 2016
  • Reviews.csv - reviews left for listings

Modelling and Preparation

Data cleaning was performed, including:

ML Features:

  • neighbourhood_group_cleansed
  • host_is_superhost
  • room_type
  • bathrooms
  • bedrooms
  • beds
  • review_scores_rating
  • review_scores_cleanliness
  • review_scores_location
  • review_scores_value
  • reviews_per_month

Target:

  • price

Modelling

As part of the evaluation a simple Linear Regression Machine Learning Model was created using the sklearn library to assess the usage of reviews to predict the price associated with the listing.

R^2 score of 54%

Evaluation

To review the output please look at the medium article here.

Deployment

test

Acknowledgement

This was completed as Project 1 of Udacity's Datascience Nanodegree

About

Medium article identifying trends in Seattle's Airbnb listing prices and trying to predict prices using linear regression.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published