<a href="https://colab.research.google.com/github/fash098/Airbnb-Booking-Analysis/blob/main/EDA_Project_Airbnb_Booking_Analysis_Fathima_Shanavas.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# **Project Name**    -



##### **Project Type**    - EDA (Exploratory Data Analysis)
##### **Contribution**    - Individual/Team


# **Project Summary -**

Since 2008, Airbnb has facilitated a novel and personalized approach to travel for both guests and hosts, which has resulted in its recognition as a one-of-a-kind service worldwide. The company places great importance on data analysis, particularly when it comes to the millions of listings available on its platform. These listings generate an immense amount of data, which can be used to enhance security, make informed business decisions, gain insight into customer and host behavior and performance, and guide marketing efforts, as well as implement innovative additional services. The dataset in question comprises approximately 49,000 observations, with a mix of categorical and numerical values across 16 columns.

Our aim is to explore and analyze the data to discover key understandings (not limited to these) such as :


1) What are the most popular cities for Airbnb bookings?

2) Are there any geographic patterns in Airbnb bookings (e.g. certain neighborhoods or areas that are more popular)?

3) What types of properties (e.g. apartments, houses, private rooms) are most popular among Airbnb guests?

4) Who are some of the best hosts and some insights about their listings?

5) What is the average length of stay for Airbnb guests?

6) Which is the most expensive location? How does price range differ for different locations?


# **GitHub Link -**

Provide your GitHub Link here.

# **Problem Statement**


Pricing Optimization: To understand pricing factors, trends, and demand patterns to optimize listing prices, maximize revenue, and stay competitive.

Revenue Management: Businesses can adjust pricing strategies, optimize inventory allocation, and effectively manage revenue based on booking patterns and demand fluctuations.

Customer Segmentation: To identify customer segments based on demographics, preferences, and booking behavior. This supports targeted marketing, personalized recommendations, and improved customer satisfaction and loyalty.

Property Investment Decisions: Provides valuable insights for property investors regarding profitable opportunities, property acquisition, pricing, and rental management based on factors like location, property type, and amenities.

Market Expansion: To identify untapped markets and areas with high demand by examining booking patterns and customer reviews.

#### **Define Your Business Objective?**

Our objective is to optimize pricing, maximize revenue, and enhance customer satisfaction through data-driven insights from Airbnb booking dataset.


# **General Guidelines** : -  

1.   Well-structured, formatted, and commented code is required.
2.   Exception Handling, Production Grade Code & Deployment Ready Code will be a plus. Those students will be awarded some additional credits.
     
     The additional credits will have advantages over other students during Star Student selection.
       
             [ Note: - Deployment Ready Code is defined as, the whole .ipynb notebook should be executable in one go
                       without a single error logged. ]

3.   Each and every logic should have proper comments.
4. You may add as many number of charts you want. Make Sure for each and every chart the following format should be answered.
        

```
# Chart visualization code
```
            

*   Why did you pick the specific chart?
*   What is/are the insight(s) found from the chart?
* Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

5. You have to create at least 20 logical & meaningful charts having important insights.


[ Hints : - Do the Vizualization in  a structured way while following "UBM" Rule.

U - Univariate Analysis,

B - Bivariate Analysis (Numerical - Categorical, Numerical - Numerical, Categorical - Categorical)

M - Multivariate Analysis
 ]





# ***Let's Begin !***

## ***1. Know Your Data***

### Import Libraries

In [None]:
# Import Libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import plotly.express as px
import plotly.graph_objects as go
%matplotlib inline

### Dataset Loading

In [None]:
# Load Dataset
from google.colab import drive
drive.mount('/content/drive')


In [None]:
path_file =('/content/drive/MyDrive/AlmaBetter/Capstone_Project/Airbnb_NYC_2019.csv')
airbnb_df = pd.read_csv(path_file)

### Dataset First View

In [None]:
# Dataset First Look
airbnb_df.head(6)

### Dataset Rows & Columns count

In [None]:
# Dataset Rows & Columns count
airbnb_df.shape

There are 48895 rows and 16 columns in this dataset

### Dataset Information

In [None]:
# Dataset Info
airbnb_df.info


#### Duplicate Values

In [None]:
# Dataset Duplicate Value Count
len(airbnb_df[airbnb_df.duplicated()])

There are no duplicate values in this dataset

#### Missing Values/Null Values

In [None]:
# Missing Values/Null Values Count
print(airbnb_df.isnull().sum())

In [None]:
# Visualizing the missing values
# Checking Null Value by plotting Heatmap
sns.heatmap(airbnb_df.isnull(), cbar=False)

last_review and reviews_per_month has reasonably high null values.

### What did you know about your dataset?

The dataset provided pertains to the booking details of Airbnb for the year 2019 and serves as the foundation for deriving valuable insights to enhance stakeholders' business operations. The dataset comprises 48,895 rows and 16 columns, demonstrating a comprehensive representation of the data. Notably, the dataset exhibits zero instances of duplicate entries, ensuring data integrity. However, it is worth noting that certain columns, specifically 'last_reviews' and 'reviews per month', contain a significant number of missing values. Rest assured, addressing and resolving these missing values will be an integral part of our ongoing project, guaranteeing the reliability and completeness of our analysis.

## ***2. Understanding Your Variables***

In [None]:
# Dataset Columns
airbnb_df.columns

In [None]:
# Dataset Describe
airbnb_df.describe(include='all')

### Variables Description

The average minimum number of nights for bookings is 7.029.

The lowest recorded minimum nights for bookings is 1.

An unusually high maximum value of 1250 for minimum nights is observed, indicating a potential anomaly.

The minimum price charged for bookings is 0, which is not a reasonable value.

The average price for bookings is $152.72.

The highest recorded price for bookings is $10,000.

### Check Unique Values for each variable.

In [None]:
# Check Unique Values for each variable.
for column in airbnb_df.columns.tolist():
    unique_count = airbnb_df[column].nunique()
    print("The number of unique values in", column, "is", unique_count)

## 3. ***Data Wrangling***

### Data Wrangling Code

In [None]:
# Write your code to make your dataset analysis ready.
#Preparing the dataset for wrangling
# Creating a copy of the dataset
df = airbnb_df.copy()


In [None]:
#Checking for null values
df.isnull().sum()

In [None]:
#Handling missing values by adding zero to it.
df.fillna({'reviews_per_month':0},inplace=True)
df.fillna({'last_review':0},inplace=True)
df.fillna({'name':0},inplace=True)
df.fillna({'host_name':0},inplace=True)

In [None]:
df.isnull().sum()

#missing values successfully filled.

In [None]:
#Checking for the datatypes of each column
df.dtypes

#Need not change any datatype for now

In [None]:
#It was observed that the minimum price of booking is 0 which must have been an error. So let's clean that value and choose only price above 0.

df = df[df['price'] > 0]
df.describe()


In [None]:
# Performing some aggregate functions for obtaining better insights about the data
#Airbnb booking trend with respect to property type
df_property_types = df['room_type'].value_counts()
print(df_property_types)

In [None]:
# Airbnb booking trend with respect to neighbourhood
df_area = df["neighbourhood_group"].value_counts()
print(df_area)

In [None]:
#Price per neighbourhood_group according to different room types.
price_per_group = df.groupby(['neighbourhood_group','room_type'])['price'].sum().sort_values(ascending=False).reset_index()
print(price_per_group)

In [None]:
#An overall picture about property preferences
property_preference=df.groupby(['latitude','longitude','name','host_id',"neighbourhood_group",'neighbourhood', 'room_type'])['number_of_reviews'].sum().sort_values(ascending=False).nlargest(n=10).reset_index()
print(property_preference)

In [None]:
#Most revenue yielding neighbourhoods and respective neighbourhood groups.
revenue_yeilding_neighbourhoods=df.groupby(["neighbourhood_group",'neighbourhood', 'room_type'])['price'].sum().sort_values(ascending=False).nlargest(n=10).reset_index()
print(revenue_yeilding_neighbourhoods)

In [None]:
# Understanding hosts with maximum listings.
maximum_listings=df['host_id'].value_counts().head(10)
print(maximum_listings)

In [None]:
# Understanding top hosts.
top_hosts=df.groupby(["host_id"])['reviews_per_month'].sum().sort_values(ascending=False).nlargest(n=10).reset_index()
print(top_hosts)

### What all manipulations have you done and insights you found?

All the manipulations done are on the copy of the dataset keeping the original dataset intact.

1) First step was to look out for null values and filling those voids. The approach we adopted here was replacing nulls with 0.

2) While we used describe method on the data some anomalies were observed. One most important anomaly among them was minimum price. Minimum price was shown 0. Infact there were many rows with 0 as the per night booking price. This must have occured as an error while updating the data and such rows are not useful for the study. So we removed such rows from the study.

3) Next few steps were to understand the data deeper. It was understood that the data was about the Airbnb booking data of New York in the year 2019. This data is spead across the 5 boroughs of New York :

    a) Manhattan
    b) Brooklyn
    c) Queens
    d) Bronx
    e) Staten Island

In these 5 neighbourhood groups Airbnb stays are of 3 types

    a) Entire home/apartment
    b) Private room
    c) Shared room

4) Later we performed some group by functions to get a broader idea about some of the best locations and room types that brings in maximum revenue and by how much. We also delved deeper into property preferences. We understood that the room type that brings in maximum revenue is actually private rooms among which

    1) Jamaica of Queens,
    2) Harlem of Manhattan and
    3) East Elmhurst of Queens
   
are the most popular neighbourhoods.

5) Finally we got a good picture on the hosts with maximum listings as well as top performing hosts. Host with host_id '219517861' is the best host with the highest number of properties.


## ***4. Data Vizualization, Storytelling & Experimenting with charts : Understand the relationships between variables***

#### Chart - 1 | Bar Chart on Property Types

In [None]:
# Chart - 1 visualization code
ax = df_property_types.plot.bar(figsize = (10,5),fontsize = 14)
# Set the title
ax.set_title("Trend in Airbnb bookings with respect to property type", fontsize = 20)
# Set x and y-labels
ax.set_xlabel("Property Type", fontsize = 15)
ax.set_ylabel("Number of bookings ", fontsize = 15)


##### 1. Why did you pick the specific chart?

A bar chart plots numeric values for levels of a categorical feature as bars. Levels are plotted on one chart axis, and values are plotted on the other axis. Each categorical value claims one bar, and the length of each bar corresponds to the bar’s value. Bars are plotted on a common baseline to allow for easy comparison of values. To understand the trend between bookings and property type, bar chart seemed like the best possible visualization.



##### 2. What is/are the insight(s) found from the chart?

Shared rooms are the least popular type of lodging for Airbnb customers, whereas more people prefer to rent out entire apartments or homes closely followed by private rooms


##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

It is suggested to increase availability of entire apartments/homes as well as private rooms to boost the profits.

#### Chart - 2 | Pie Chart on Neighbourhood Groups



In [None]:
# Chart - 2 visualization code
sns.set_theme(palette='pastel',font_scale=1)
ax = df_area.plot.pie(figsize = (10,5),fontsize = 7, autopct='%.0f%%')
#Set the title
ax.set_title("Trend in Airbnb bookings with respect to neighbourhood groups", fontsize = 20)


##### 1. Why did you pick the specific chart?

A data's part-to-whole relationship is illustrated with a pie chart. The area covered in a circle with various colours makes it simple to illustrate how the percentages compare. When comparing different percentages, pie charts are widely utilised. I thus developed a pie chart, which enabled me to compare the percentage of bookings with repect to different neighbourbood groups.

##### 2. What is/are the insight(s) found from the chart?

It is observed that 44% of the guests book Airbnbs in Manhattan which is closely followed by Brooklyn with 41%. Queens hosts only 12% of guests. Bronx and Staten island hosts the least number of guests with a meagre 2 and 1 percent of guests respectively

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

- Most guests travel to Manhattan and Brooklyn. Queens also hosts guests of significant proportion. Listings could be increased in these locations.
- It is assumed that Bronx and Staten Island has their geographical disadvantages due to which guests are less. More research could be done on Bronx and Staten Island.

#### Chart - 3 | Stacked Bar Chart on Revenue Per Borough and the Specific Room Types

In [None]:
# Chart - 3 visualization code
ax = (df.pivot_table(index='neighbourhood_group', columns='room_type', values='price', aggfunc='sum'))
ax.plot(kind='bar', stacked=True)
#Set Title
plt.title('REVENUE PER NEIGHBOURHOOD GROUPS AND ROOM TYPES')
#Set y label
plt.ylabel('price')
plt.show()

##### 1. Why did you pick the specific chart?

The stacked bar chart extends the standard bar chart from looking at numeric values across one categorical variable to two. Each bar in a standard bar chart is divided into a number of sub-bars stacked end to end, each one corresponding to a level of the second categorical variable. We are clearly able to visualize how the revenue distribution is among different room types for each neighbouring groups.



##### 2. What is/are the insight(s) found from the chart?

- In Manhattan and Brooklyn almost 75% of guests prefer entire home/apartment. Almost 20-23 percent prefer private room and a meagre 1-2% opt for shared rooms.

- In Queens the trend is different. Entire home/apartment and shared rooms are preferred almost equally with a very small percent opting for shared rooms.

- In Bronx and Staten Island airbnb presence itself is very limited. Entire home/apartment and private rooms are preffered over shared rooms.

- In a nutshell Entire home/apartments are most preferred and shared rooms are least preffered irrespective of locations based on boroughs.



##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Again, Entire apartment/homes and Private rooms are most preferred. It is assumed that shared rooms are only preferred when people are travelling on high budget.
If we want to boost sales for shared spaces, introducing fun and aesthetic listings under shared rooms could be helpful.

#### Chart - 4 | Bar Chart on Most Revenue Yielding Neighbourhoods



In [None]:
# Chart - 4 visualization code
# Most revenue yielding neighbourhoods
sns.set_theme(palette='pastel',font_scale=1)
print(revenue_yeilding_neighbourhoods)
df_area = revenue_yeilding_neighbourhoods
df_area.plot(kind='bar', x='neighbourhood', y='price')
plt.title('TOP 10 REVENUE YIELDING NEIGHBOURHOODS')
plt.ylabel('price')

##### 1. Why did you pick the specific chart?

Bar charts though one of the most basic visualization method, is most preferred for this study as it clearly depicts revenue distribution among the top 10 neighbourhoods.

##### 2. What is/are the insight(s) found from the chart?

Top 3 revenue yeilding neighbourhoods are :    
     a) Williamsburg in Brooklyn,
     b) Midtown in Manhattan,
     c) Upperwest side in Manhattan.

It's also observed that in top 10 listings, 8 are from Manhattan and 2 are from Brooklyn.    


##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

- When we looked at booking trends, both Manhattan and Brooklyn had comparably similar booking rates.
- However there is a considerable disparity among the revenue bought as well as the top performing listings when we compare these boroughs. This could be because of the overall charm of the listings, pricings, amenities, reviews, etc.

#### Chart - 5 | Bar Chart on Top Hosts

In [None]:
top_hosts=df.groupby(["host_id"])['reviews_per_month'].sum().sort_values(ascending=False).nlargest(n=10).reset_index()
top_hosts.plot(kind='bar', x='host_id', y='reviews_per_month')
#Setting title and y label.
plt.title('TOP 10 HOSTS')
plt.ylabel('reviews_per_month')



##### 1. Why did you pick the specific chart?

Bar char gives the best representation for host_id versus reviews per month.

##### 2. What is/are the insight(s) found from the chart?

- Host with Host ID '219517861' is the best host with most number of reviews and properties.
- There is a large disparity between host '219517861' and other hosts. This could be because of the best quality, service, aminities that the host is providing which is standing out from the other hosts.
- It is also to be noted that this host has maximum number of properties. It is also possible that there are other hosts who are providing equally best properties but due to lesser number of listings we have not uncovered them.

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Multiple hosts with a huge number of listings are not getting enough guests. This needs to be checked.

#### Chart - 6 | Joint Plot on Host ID And Reviews

In [None]:
# Chart - 6 visualization code
#Jointplot to evaluate correlation between host_id and reviews.
sns.jointplot(x='host_id', y='reviews_per_month', data=df)


##### 1. Why did you pick the specific chart?

The joint plot is a way of understanding the relationship between two variables and the distribution of individuals of each variable. The joint plot mainly consists of three separate plots in which, one of it was the middle figure that is used to see the relationship between x and y. Here we are trying to understand the relationship between host_id and reviews_per_month

##### 2. What is/are the insight(s) found from the chart?

We are understanding that there is a positive correlation between host_id and reviews_per_month

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Addressing the performance of hosts with numerous listings but low guest engagement could potentially improve overall host performance.

#### Chart - 7 | Box Plot on Price Per Neighbourhood Groups

In [None]:
# Chart - 7 visualization code
sns.set_style("whitegrid")
price_distribution = df[df.price <700]
plt.title("PRICE PER NEIGHBOURHOOD GROUP")
sns.boxplot(y= 'price',x= 'neighbourhood_group',data=price_distribution)
plt.show()

##### 1. Why did you pick the specific chart?

A box plot uses boxes and lines to depict the distributions of one or more groups of numeric data. Box limits indicate the range of the central 50% of the data, with a central line marking the median value. Lines extend from each box to capture the range of the remaining data, with dots placed past the line edges to indicate outliers.

##### 2. What is/are the insight(s) found from the chart?

- Manhattan seems to have the highest price range for the listings followed by Brooklyn
- Queens and Staten Island very similar type of distribution with Queens leading by a conspicous difference
- Listings in Bronx evidently are the cheapest accomodation options.

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

#### Chart - 8 - Correlation Heatmap

In [None]:
# Correlation between different variables

corr = df.corr()
# Set up the matplotlib plot configuration
f, ax = plt.subplots(figsize=(12, 10))
# Generate a mask for upper traingle
mask = np.triu(np.ones_like(corr, dtype=bool))
# Configure a custom diverging colormap
cmap = sns.diverging_palette(230, 20, as_cmap=True)
sns.heatmap(corr, annot=True, mask = mask, cmap=cmap)


##### 1. Why did you pick the specific chart?

Correlation heatmaps are a type of plot that visualize the strength of relationships between numerical variables. Correlation plots are used to understand which variables are related to each other and the strength of this relationship.
We used correlation heatmap here to check if some variables have any direct correlation with other variables.


##### 2. What is/are the insight(s) found from the chart?

- Reviwes per month and Number of reviews are highly correlated as expected.
- Host ID has some correlation with 3 other variables such as

    1) Availability
    2) Listings
    3) Availability through 365 days.

## **5. Solution to Business Objective**

#### What do you suggest the client to achieve Business Objective ?
Explain Briefly.

Based on the observations made during the exploratory data analysis of the Airbnb booking dataset, here are some suggestions to achieve the business objectives of optimizing pricing, maximizing revenue, and enhancing customer satisfaction:

- **Room Type Optimization:**

 Private rooms are the most revenue-generating room type, especially in popular neighborhoods like Jamaica (Queens), Harlem (Manhattan), and East Elmhurst (Queens).
 To optimize pricing and maximize revenue, consider offering more private rooms in these popular neighborhoods.
 However, don't neglect the demand for entire apartments/homes as they also contribute significantly to revenue.


- **Geographical Focus:**

 Manhattan and Brooklyn are the most popular boroughs among guests, with Queens also hosting a significant number of guests.
 Consider increasing the number of listings in Manhattan and Brooklyn to further cater to guest demand.
 While Bronx and Staten Island have lower guest numbers, it might be worth conducting further research to understand any geographical disadvantages that could be impacting demand in these areas.

- **Room Type Preference by Borough:**

 Guests overwhelmingly prefer entire home/apartment rentals in Manhattan and Brooklyn, with private rooms also being popular.
 In Queens, there's a more balanced preference between entire home/apartment and shared rooms.
 In Bronx and Staten Island, prioritize offering entire home/apartment and private room options over shared rooms.
 Consider adjusting the listing distribution in each borough to align with these preferences.

- **Top Revenue-Generating Neighborhoods:**

 Focus on neighborhoods like Williamsburg (Brooklyn), Midtown (Manhattan), and Upper West Side (Manhattan) for higher revenue.
 Since most of the top revenue-yielding neighborhoods are in Manhattan, ensure that listings in these neighborhoods are well-maintained, competitively priced, and offer exceptional amenities.

- **Host Performance and Listings:**

 Investigate why the host with Host ID '219517861' is outperforming others significantly. Analyze their properties, services, and amenities to identify best practices.
 Address the imbalance between hosts with a large number of listings and their actual booking rates. Some hosts might need assistance to improve their listings' visibility and attractiveness.

- **Pricing Strategy:**

 Review the pricing strategy for listings in different boroughs. For instance, since Manhattan and Brooklyn have higher price ranges, ensure that the prices are justified by the quality of accommodations and services.
 Consider introducing competitive pricing for listings in Bronx and Staten Island to attract more budget-conscious travelers.

- **Correlation Insights:**

 High correlation between reviews per month and the number of reviews suggests that actively reviewed properties tend to have more bookings.
 Host ID's correlation with availability, listings, and availability through 365 days could indicate that certain hosts are more consistent in managing and maintaining their properties.

- **Negative Growth Considerations:**

 Shared rooms are less popular and generate less revenue compared to entire apartments/homes or private rooms. While you might want to continue offering shared rooms for budget travelers, the focus should be on increasing the availability and quality of entire apartments/homes and private rooms.

- **Enhancing Guest Experience:**

 Collect guest feedback and reviews to identify areas of improvement. Address any recurring issues or concerns raised by guests to enhance overall satisfaction.
 Consider providing unique and aesthetically pleasing shared room options to attract more guests to this category.

- **Marketing and Promotion:**

 Use data insights to inform marketing strategies. Promote listings in popular neighborhoods and boroughs to attract more bookings.
 Highlight unique features and amenities of listings to stand out from competitors.

# **Conclusion**

In conclusion, our exploratory data analysis of the Airbnb booking dataset for New York in the year 2019 has provided valuable insights into the booking trends and preferences of Airbnb guests. Through a comprehensive examination of various factors, we aimed to uncover key understandings that can contribute to informed decision-making and business strategies. Here are some of the significant findings from our analysis:

- ***Popular Cities and Geographic Patterns:***

Manhattan and Brooklyn emerged as the most popular boroughs for Airbnb bookings, with a majority of guests preferring these locations.
Queens also hosted a considerable number of guests, while Bronx and Staten Island had fewer bookings, potentially due to geographical factors.
To maximize business impact, focusing on Manhattan, Brooklyn, and Queens for expanding listings and marketing efforts could lead to positive growth.

- ***Property Types and Preferences:***

Private rooms were the top revenue-generating property type, particularly in neighborhoods like Jamaica (Queens), Harlem (Manhattan), and East Elmhurst (Queens).
Our analysis suggests that offering more private rooms and entire home/apartments could enhance profitability and customer satisfaction.


- ***Host Performance and Insights: ***

Host with Host ID '219517861' stood out as the best host with the highest number of properties and reviews.
This host's exceptional service quality, amenities, and property management practices likely contributed to their success.
Addressing the performance of hosts with numerous listings but low guest engagement could potentially improve overall host performance.


- ***Pricing and Revenue Distribution:***

Revenue distribution varied across different room types and neighborhoods. Private rooms and entire home/apartments were preferred over shared rooms.
Our data-driven recommendations include adjusting pricing strategies for shared rooms and emphasizing unique features to attract guests.


- ***Revenue-Yielding Neighborhoods:***

Williamsburg (Brooklyn), Midtown, and Upper West Side (both in Manhattan) emerged as the top revenue-generating neighborhoods.
Capitalizing on the charm, amenities, and positive reviews of listings in these neighborhoods could drive further revenue growth.

- ***Price Variation and Correlation Insights:***

Price distribution varied across boroughs, with Manhattan and Brooklyn having higher price ranges.
The correlation heatmap provided insights into the relationships between variables, such as reviews per month and number of reviews, which could guide strategies for maintaining and improving guest satisfaction.


In summary, our exploratory data analysis has illuminated several avenues for optimizing pricing, maximizing revenue, and enhancing customer satisfaction within the Airbnb booking ecosystem. By strategically focusing on specific property types, neighborhoods, and pricing strategies, businesses can create positive impacts and cultivate an enhanced guest experience. However, it is important to continuously monitor trends, gather additional insights, and refine strategies based on evolving market dynamics to achieve sustained success in the competitive landscape of the hospitality industry.




# **Thank you for viewing!**