# AIRBNB PROJECT

CRISP-DM process

Cross Industry Standard Process for Data Mining

Phases
- Business Understanding
- Data Understanding
- Data Preparation
- Model Development
- Evaluation
- Deployment

Project Details/Instructions & Deliverables

'''
<!-- Key Steps for the Project
Feel free to be creative with your solutions, but do follow the CRISP-DM process in finding your solutions.

Pick a dataset, as mentioned on the previous page.

Pose at least three questions related to business or real-world applications of how the data could be used.

Create a Jupyter Notebook, using any associated packages you'd like, to:

Prepare data:
Gather necessary data to answer your questions
Handle categorical and missing data
Provide insight into the methods you chose and why you chose them
Analyze, Model, and Visualize
Provide a clear connection between your business questions and how the data answers them
Communicate your business insights:
Create a Github repository to share your code and data wrangling/modeling techniques, with a technical audience in mind
Create a blog post to share your questions and insights with a non-technical audience
Project Deliverables
There are two deliverables that are required for project completion.

A Github repository for your code.
A blog post of your findings.
Your Github repository must have the following contents:

A README.md file that communicates the libraries used, the motivation for the project, the files in the repository with a small description of each, a summary of the results of the analysis, and necessary acknowledgments.
Your code in a Jupyter notebook, with appropriate comments, analysis, and documentation.
You may also provide any other necessary documentation you find necessary.
For the blog post, pick a platform of your own choice. For example, it can be on your website, a Medium post (Josh's sample report(opens in a new tab) on How Do YOU Become A Developer?), or a Github blog post. Your blog** **must provide the following:

A clear and engaging title and image.
Your questions of interest.
Your findings for those questions with a supporting statistic(s), table, or visual.
Note: The post should not dive into technical details or difficulties of the analysis; this should be saved for Github. The post should be understandable for non-technical people from many fields. -->
'''

## Phases/Process

- Business Understanding
   * Motivation
I plan on renting an apartment in NYC(Manhattan) that is $\$3500$, I would prefer to leave alone and expect to travel at least one weekend every month. I need to find out using the public airbnb data if it is possible to make at least $\$500$ by posting the apartment on Airbnb on the weekend so my monthly out of pocket rent is $\$3000$

    * Questions
 
        * How much do studios in NYC make on weekends?
        * What areas make the most?
        * What factors are most influential for pricing?

* Data Understanding

In [54]:
# importing necessary modules and reading the required files containing the data

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import Normalizer
from sklearn.model_selection import train_test_split
from sklearn.metrics import r2_score, mean_squared_error
from sklearn.pipeline import Pipeline
%matplotlib inline

l_sum = pd.read_csv('../Data/summary_listings.csv')
r_sum = pd.read_csv('../Data/summary_reviews.csv')
n_sum = pd.read_csv('../Data/neighbourhoods.csv')
listing = pd.read_csv('../Data/Raw Data/listings.csv') 

In [56]:
listing.head(10)

Unnamed: 0,id,listing_url,scrape_id,last_scraped,source,name,description,neighborhood_overview,picture_url,host_id,...,review_scores_communication,review_scores_location,review_scores_value,license,instant_bookable,calculated_host_listings_count,calculated_host_listings_count_entire_homes,calculated_host_listings_count_private_rooms,calculated_host_listings_count_shared_rooms,reviews_per_month
0,2595,https://www.airbnb.com/rooms/2595,20241104040953,2024-11-04,city scrape,Skylit Midtown Castle Sanctuary,"Beautiful, spacious skylit studio in the heart...",Centrally located in the heart of Manhattan ju...,https://a0.muscache.com/pictures/miso/Hosting-...,2845,...,4.8,4.81,4.4,,f,3,3,0,0,0.27
1,6848,https://www.airbnb.com/rooms/6848,20241104040953,2024-11-04,city scrape,Only 2 stops to Manhattan studio,Comfortable studio apartment with super comfor...,,https://a0.muscache.com/pictures/e4f031a7-f146...,15991,...,4.8,4.69,4.58,,f,1,1,0,0,1.04
2,6872,https://www.airbnb.com/rooms/6872,20241104040953,2024-11-04,city scrape,Uptown Sanctuary w/ Private Bath (Month to Month),This charming distancing-friendly month-to-mon...,This sweet Harlem sanctuary is a 10-20 minute ...,https://a0.muscache.com/pictures/miso/Hosting-...,16104,...,5.0,5.0,5.0,,f,2,0,2,0,0.03
3,6990,https://www.airbnb.com/rooms/6990,20241104040953,2024-11-04,city scrape,UES Beautiful Blue Room,Beautiful peaceful healthy home,"Location: Five minutes to Central Park, Museum...",https://a0.muscache.com/pictures/be6cd5b3-9295...,16800,...,4.95,4.85,4.85,,f,1,0,1,0,1.37
4,7064,https://www.airbnb.com/rooms/7064,20241104040953,2024-11-04,previous scrape,"Amazing location! Wburg. Large, bright & tranquil","Large, private loft-like room in a spacious 2-...","- One stop from the East Village, Lower East S...",https://a0.muscache.com/pictures/13708959/7e74...,17297,...,5.0,5.0,5.0,,f,2,0,2,0,0.08
5,7097,https://www.airbnb.com/rooms/7097,20241104040953,2024-11-04,city scrape,"Perfect for Your Parents, With Garden & Patio",Parents/grandparents coming to town or are you...,"Residential, village-like atmosphere. Lots of ...",https://a0.muscache.com/pictures/aaac19fc-4b4d...,17571,...,4.93,4.95,4.82,OSE-STRREG-0000008,t,2,0,2,0,2.16
6,7801,https://www.airbnb.com/rooms/7801,20241104040953,2024-11-04,city scrape,Sunny Williamsburg Loft with Sauna,A huge loft in a repurposed factory building i...,We've lived here for over 15 years and love Wi...,https://a0.muscache.com/pictures/miso/Hosting-...,21207,...,4.78,5.0,4.89,,f,1,1,0,0,0.07
7,8490,https://www.airbnb.com/rooms/8490,20241104040953,2024-11-04,city scrape,"Maison des Sirenes1,bohemian, luminous apartment",Soak up the modern and vintage charm<br />of t...,,https://a0.muscache.com/pictures/1d0d9773-c829...,25183,...,4.88,4.67,4.76,,f,2,2,0,0,1.03
8,9357,https://www.airbnb.com/rooms/9357,20241104040953,2024-11-04,city scrape,Midtown Pied-a-terre,PLEASE DO NOT REQUEST TO BOOK UNTIL WE HAVE ME...,Quiet residential block near many restaurants ...,https://a0.muscache.com/pictures/90036/4e60665...,30193,...,5.0,4.95,4.58,,f,1,1,0,0,0.32
9,10452,https://www.airbnb.com/rooms/10452,20241104040953,2024-11-04,city scrape,Radiant Oasis B&B Style room,Great location.,Great neighborhood with lots of restaurants an...,https://a0.muscache.com/pictures/9d79499f-7319...,35935,...,4.85,4.42,4.67,,f,6,0,6,0,0.46
