<a href="https://colab.research.google.com/github/piyushbg/AirBnb/blob/main/AirBnb_booking_analysis.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# **Project Name**    - AirBnb Bookings Analysis



##### **Project Type**    - EDA
##### **Contribution**    - Individual
##### **Team Member 1 - Piyush Sanjay Bagul**


# **Project Summary -**

Airbnb has been utilized by both hosts and guests to create personalized and unique travel experiences, leading to its recognition as a one-of-a-kind service worldwide. The vast amount of data generated by millions of listings on the platform is crucial for Airbnb's operations. This data can be analyzed to inform decision-making on security, business strategies, customer and host behavior, marketing initiatives, and the implementation of innovative services, among other things. The dataset used for analysis contains approximately 49,000 observations across 16 columns, containing a combination of categorical and numeric values.

 conducted an in-depth exploratory analysis of the dataset to identify ways in which Airbnb could enhance their business. I began by performing data cleaning tasks such as removing duplicate values, handling outliers, and dealing with missing values. I then visualized the data and identified several key insights by examining the relationships among the variables. Based on these insights, I developed potential solutions that could help Airbnb improve their business.

The solutions I have put forward are based on my analysis and understanding of the dataset. To gain a better understanding of customer behavior, additional variables such as ratings and user reviews would be beneficial. In addition, attributes such as amenities could also have a significant impact on customer behavior. Overall, the dataset contains a wealth of attributes that can be analyzed beyond the scope of my findings.

# **GitHub Link -**

https://github.com/piyushbg/AirBnb

# **Problem Statement**


Since its inception in 2008, Airbnb has revolutionized the way in which guests and hosts travel and experience the world. Today, it has become a one-of-a-kind service used by people all over the world. With millions of listings on its platform, data analysts have become a crucial factor in Airbnb's success. The vast amount of data generated by these listings can be analyzed to enhance security, inform business decisions, understand customer and provider behavior, introduce new services, guide marketing initiatives, and much more. While there are competitors like oyo, it is important for businesses to constantly strive for improvement as nothing is permanent.

The dataset used in this analysis consists of approximately 48,000 observations with 16 columns, containing a mix of categorical and numerical values.





#### **Define Your Business Objective?**

#  To make informed business decisions by gaining insights into the behavior of both customers and providers on the platform, or during their stays.

# **General Guidelines** : -  

1.   Well-structured, formatted, and commented code is required. 
2.   Exception Handling, Production Grade Code & Deployment Ready Code will be a plus. Those students will be awarded some additional credits. 
     
     The additional credits will have advantages over other students during Star Student selection.
       
             [ Note: - Deployment Ready Code is defined as, the whole .ipynb notebook should be executable in one go
                       without a single error logged. ]

3.   Each and every logic should have proper comments.
4. You may add as many number of charts you want. Make Sure for each and every chart the following format should be answered.
        

```
# Chart visualization code
```
            

*   Why did you pick the specific chart?
*   What is/are the insight(s) found from the chart?
* Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

5. You have to create at least 20 logical & meaningful charts having important insights.


[ Hints : - Do the Vizualization in  a structured way while following "UBM" Rule. 

U - Univariate Analysis,

B - Bivariate Analysis (Numerical - Categorical, Numerical - Numerical, Categorical - Categorical)

M - Multivariate Analysis
 ]





# ***Let's Begin !***

## ***1. Know Your Data***

### Import Libraries

In [None]:
# Import Libraries
import numpy as np
import pandas as pd
from numpy import math
from numpy import loadtxt
import seaborn as sns
import matplotlib.pyplot as plt
%matplotlib inline

import warnings
warnings.filterwarnings('ignore') # setting ignore as a parameter


### Dataset Loading

In [None]:
from google.colab import drive
drive.mount('/content/drive')

In [None]:
# Load Dataset
file_path="/content/drive/MyDrive/Colab Notebooks/AirBnbcap"
data=pd.read_csv(file_path +"/Airbnb NYC 2019.csv")

### Dataset First View

In [None]:
# Dataset First Look
data.head()


In [None]:
data.tail(2)

### Dataset Rows & Columns count

In [None]:
# Dataset Rows & Columns count
data.shape

### Dataset Information

In [None]:
# Dataset Info
data.info

#### Duplicate Values

In [None]:
# Dataset Duplicate Value Count
len(data[data.duplicated()])

#### Missing Values/Null Values

In [None]:
# Missing Values/Null Values Count
data.isnull().sum()

In [None]:
# Visualizing the missing values
sns.heatmap(data.isnull(), cbar=False)

### What did you know about your dataset?

 1. Dataset has 48,895 rows and 16 columns and it is a mix of categorical and  numerical values.
 
 2. neighbourhood_group, neighbourhood, room_type belongs to catogorical values.
 
 3.id,latitude,longitude,price,minimum_nights,number_of_reviews,last_review, reviews_per_month, calculated_host_listings_count, availability_365 this columns belongs to numerical values.
 
 4.There are no Duplicate values but large no of Null values in colums like
 last review ,reviews per month 

## ***2. Understanding Your Variables***

In [None]:
# Dataset Columns
data.columns

In [None]:
# Dataset Describe
data.describe()

In [None]:
len(data["host_name"].unique())

In [None]:
data["neighbourhood_group"].unique()

### Variables Description 

* **id**        - unique id
* **name**      - description of the property
* **host id**   - unique id for host
* **host_name** - host name. There are 11452 hosts out of 48895 in which 21 are null values and remaining suggest that a single host owns multiple properties

* **neighbourhood_group** - location. we have 5 unique locations(Manhattan, Brooklyn, Queens, Bronx, Staten Island)

* **neighbourhood** -area under neighbourhood group
* **Longitude**     -location 
* **Latitude**      -location 
* **room_type**     -Room type -private,shared,Entire home/apt
* **price**         -price of room
* **minimum_nights**-minimum nights to be paid for.
* **number_of_reviews** - total count of reviews of that listing
* **last_review**       - last review date of that listing
* **reviews_per_month** - number of reviews per month of that listing
* **calculated_host_listings_count -total number of listings registered under host
* **availability_365**  -  number of days that the listing is availabale during 365 days

### Check Unique Values for each variable.

In [None]:
# Check Unique Values for each variable.
column_list = data.columns.values.tolist()
for column_name in column_list:
  print("unique values in ",column_name,"is",data[column_name].nunique())


## 3. ***Data Wrangling***

### Data Wrangling Code

In [None]:
# Write your code to make your dataset analysis ready.

### What all manipulations have you done and insights you found?

Answer Here.

## ***4. Data Vizualization, Storytelling & Experimenting with charts : Understand the relationships between variables***

#### Chart - 1

In [None]:
# Chart - 1 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 2

In [None]:
# Chart - 2 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 3

In [None]:
# Chart - 3 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 4

In [None]:
# Chart - 4 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 5

In [None]:
# Chart - 5 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 6

In [None]:
# Chart - 6 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 7

In [None]:
# Chart - 7 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 8

In [None]:
# Chart - 8 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 9

In [None]:
# Chart - 9 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 10

In [None]:
# Chart - 10 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 11

In [None]:
# Chart - 11 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 12

In [None]:
# Chart - 12 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 13

In [None]:
# Chart - 13 visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

##### 3. Will the gained insights help creating a positive business impact? 
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 14 - Correlation Heatmap

In [None]:
# Correlation Heatmap visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

#### Chart - 15 - Pair Plot 

In [None]:
# Pair Plot visualization code

##### 1. Why did you pick the specific chart?

Answer Here.

##### 2. What is/are the insight(s) found from the chart?

Answer Here

## **5. Solution to Business Objective**

#### What do you suggest the client to achieve Business Objective ? 
Explain Briefly.

Answer Here.

# **Conclusion**

Write the conclusion here.

### ***Hurrah! You have successfully completed your EDA Capstone Project !!!***