<a href="https://colab.research.google.com/github/devops-tarunraj/SimpleWebApp/blob/master/AirBnb_Capstone_Project_(1).ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# **Project Name**    - **AirBnb Bookings Analysis**



##### **Project Type**    - Exploratory Data Analysis (AirBnb)
##### **Contribution**    - Individual

# **Project Summary -**

In the Airbnb dataset,**Exploratory Data Analysis** **(EDA)** unfolded a multifaceted narrative of the platform's dynamics. Comprehensive examinations revealed a diverse array of property types, with insights into pricing, customer sentiment, and geographical concentration. Visualizations, such as histograms and correlation matrices, elucidated patterns and dependencies, offering valuable guidance for hosts aiming to optimize pricing strategies and enhance customer satisfaction. Geographic analyses highlighted spatial nuances in property distribution, while sentiment analysis of customer reviews unveiled factors contributing to positive or negative feedback. The EDA not only laid the groundwork for predictive modeling but also empowered hosts with actionable insights for refining their offerings.

Looking forward, the analysis suggests avenues for further refinement, including advanced modeling techniques, incorporation of external datasets, and ongoing monitoring of pricing and review dynamics. The EDA's impact extends beyond numerical insights, fostering a nuanced understanding of user behavior and preferences within the Airbnb ecosystem. This foundational exploration serves as a compass for subsequent analyses, steering towards a data-driven approach in optimizing user experience and operational strategies for hosts and the platform alike.

# **GitHub Link -**

# **Problem Statement**


Airbnb, a popular online marketplace for lodging and travel experiences, has accumulated a vast dataset reflecting the diverse activities on its platform. Hosts use Airbnb to list their properties, and travelers explore and book accommodations based on various factors such as pricing, location, and reviews. This project aims to conduct an in-depth Exploratory Data Analysis to extract meaningful insights that can benefit both hosts and travelers, enhancing the overall Airbnb experience.

#### **Define Your Business Objective?**

The business objective for an Airbnb Exploratory Data Analysis (EDA) project is to leverage insights gained from the dataset to enhance the overall Airbnb experience for both hosts and travelers. Specifically, the objectives include:

**1.Optimizing Pricing Strategies**: Understand the dynamics influencing pricing variations on Airbnb. Identify factors such as property type, location, amenities, and seasonality that impact pricing. Provide hosts with actionable recommendations to optimize their pricing strategies, leading to competitive and attractive listings.

**2.Enhancing Geographic Distribution**: Analyze the geographical distribution of Airbnb listings. Identify popular neighborhoods and regions with a high concentration of listings. Offer insights for hosts on optimal locations for property listings based on demand and pricing trends.

**3.Improving Customer Satisfaction**: Perform sentiment analysis on customer reviews to understand the factors influencing satisfaction or dissatisfaction. Provide hosts with insights into aspects that contribute to positive or negative sentiments. Enhance the overall quality of property offerings by addressing areas highlighted in customer feedback.

**4.Diversifying Property Offerings**: Explore the diversity of property types available on Airbnb. Analyze the distribution and popularity of different accommodation types. Provide hosts with recommendations to diversify their property offerings, catering to a broader range of traveler preferences.

**5.Data-Driven Decision-Making**: Equip hosts and travelers with data-driven insights for more informed decision-making. Improve the overall Airbnb experience by empowering users with information on pricing trends, property types, and customer sentiment.

# **General Guidelines** : -  

1.   Well-structured, formatted, and commented code is required.
2.   Exception Handling, Production Grade Code & Deployment Ready Code will be a plus. Those students will be awarded some additional credits.
     
     The additional credits will have advantages over other students during Star Student selection.
       
             [ Note: - Deployment Ready Code is defined as, the whole .ipynb notebook should be executable in one go
                       without a single error logged. ]

3.   Each and every logic should have proper comments.
4. You may add as many number of charts you want. Make Sure for each and every chart the following format should be answered.
        

```
# Chart visualization code
```
            

*   Why did you pick the specific chart?
*   What is/are the insight(s) found from the chart?
* Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

5. You have to create at least 20 logical & meaningful charts having important insights.


[ Hints : - Do the Vizualization in  a structured way while following "UBM" Rule.

U - Univariate Analysis,

B - Bivariate Analysis (Numerical - Categorical, Numerical - Numerical, Categorical - Categorical)

M - Multivariate Analysis
 ]





# ***Let's Begin !***

## ***1. Know Your Data***

### Import Libraries

In [None]:
# Import Libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
print("All libraries are imported successfully!")

### Dataset Loading

In [None]:
# Load Dataset
air_df=pd.read_csv("/content/Airbnb NYC 2019.csv")

### Dataset First View

In [None]:
# Dataset First Look
air_df.head()

### Dataset Rows & Columns count

In [None]:
# Dataset Rows & Columns count
print(air_df.shape)

### Dataset Information

In [None]:
# Dataset Info
print(air_df.info())

#### Duplicate Values

In [None]:
# Dataset Duplicate Value Count
duplicate_rows = air_df.duplicated()
total_duplicates = duplicate_rows.sum()

print(f"Total duplicate rows in the dataset: {total_duplicates}")

#### Missing Values/Null Values

In [None]:
# Missing Values/Null Values Count
missing_values = air_df.isnull().sum()

In [None]:
# Visualizing the missing values
print("Count of missing values for each column:")
print(missing_values)

### What did you know about your dataset?

**1.Size and Structure**: in this dataset columns are (16) and rows are (48895).our dataset has all type of columns like object, int, float and along with there are all type of column names which are often use in that indestry

**2.Missing Values**: there are missing values in name, host_name, last_review, reviews_per_month

**3.Relationships**: Are there noticeable patterns or relationships between some variables

## ***2. Understanding Your Variables***

In [None]:
# Dataset Columns
list(air_df.columns)

In [None]:
# Dataset Describe
air_df.describe()

### Check Unique Values for each variable.

In [None]:
# Check Unique Values for each variable.
for column in air_df.columns:
    unique_values = air_df[column].unique()[:5]  # Display only the first 5 unique values
    print(f"Unique values in {column}:")
    print(unique_values)
    print()

## 3. ***Data Wrangling***

### Data Wrangling Code

In [None]:
# Write your code to make your dataset analysis ready.
#drop unwanted columns
list_unwanted_columns = ['latitude', 'longitude', 'last_review', 'reviews_per_month']
air_df.drop(list_unwanted_columns, axis=1, inplace=True)

In [None]:
#drop duplicate values
air_df.drop_duplicates(inplace=True)

In [None]:
#fill missing values
#Backward-fill NaN values
air_df.fillna(method='bfill',inplace=True)

# Display the DataFrame with backward-filled values
air_df


In [None]:
air_df.info()

### What all manipulations have you done and insights you found?

**A)Data Wrangling Manipulations:**

**1.Remove unnecessary columns:** removeing unnecessary columns is beneficial for further data manipulation along with this removing unnecessary columns helps to reduce confusion.

**2.Handling Missing Values:** in this data there are some columns with missing values, for visualizing data these missing values cause difficulty so i fill that missing values with backward values (Backward-fill)

**3.Removing Duplicates:** it's very important to remove duplicates because that duplicate values also cause difficulty and confusion while visualizing data

**B)Insights Gained:**

**1.Patterns and Trends:** when data is clear and concise condition so it is easy to observe pattern and trends

**2.Correlations:** correlation means relationships among the veriables and observing that correlations among the veriables we should have clean and concise data. in this dataset we can see some correlations

**3.Visualizations:** clear and concise data very important to visualization, in futher prosses we'll do visualization using graphs for that we need clean dataset

## ***4. Data Vizualization, Storytelling & Experimenting with charts : Understand the relationships between variables***

#### Chart - 1
**1.How does the average nightly price vary by neighbourhood?**

In [None]:
# Chart - 1 visualization code
# Calculate average nightly price by neighborhood
# Randomly sample 20 rows from the dataset
random_sample = air_df.sample(n=20, random_state=42)

# Create a bar chart for the random sample
plt.figure(figsize=(10, 6))
plt.bar(range(1, 21), random_sample['price'], color='skyblue')
plt.xlabel('neighbourhood')
plt.ylabel('Average Nightly Price')
plt.title('Average Nightly Price by neighbourhood')
plt.xticks(rotation=45, ha='right')  # Rotate x-axis labels for better readability
plt.tight_layout()
plt.show()

##### 1. Why did you pick the specific chart?

The choice of a bar chart for visualizing a random sample of 20 Airbnb listings was made for its simplicity and effectiveness in conveying the distribution of nightly prices. Bar charts are particularly useful for comparing individual categories, in this case, the different sampled listings. The horizontal bars provide a clear representation of each item's price, allowing for easy comparison and interpretation. This choice aims to offer a quick, visual understanding of the pricing variation within the dataset, making it accessible to a broad audience without sacrificing depth.



##### 2. What is/are the insight(s) found from the chart?

Upon analyzing the bar chart, insights into the diversity of nightly prices among the 20 sampled Airbnb listings emerge. The varying heights of the bars illustrate the range of prices, giving a sense of the spread and distribution of accommodation costs. This insight can be valuable for both potential guests seeking diverse options and hosts aiming to position their listings competitively within the market. Additionally, the chart may highlight any outliers or patterns in pricing that could be further investigated for business optimization. Understanding the distribution of prices is a crucial step in making informed decisions related to pricing strategies and market positioning.

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

While the insights gained from the chart offer a valuable snapshot of the pricing landscape, their direct impact on business may be limited without further contextual analysis. To create a positive business impact, additional considerations such as market trends, customer reviews, and external factors influencing pricing dynamics should be taken into account. Comprehensive insights into the broader business environment can aid hosts in making strategic pricing decisions that not only attract guests but also contribute to positive reviews and sustained customer satisfaction. Therefore, while the chart provides a foundational understanding of pricing distribution, combining these insights with a holistic analysis will enhance their potential to drive positive business outcomes.

#### Chart - 2
**2.What we learn from room type and their prices according to area?**

In [None]:
# Chart - 2 visualization code
room_price_area_wise=air_df.groupby(['neighbourhood_group','room_type'])['price'].max().reset_index()
room_price_area_wise.sort_values(by='price',ascending=False).head(10)


In [None]:
neighbourhood_group=["Brooklyn","Manhattan","Queens","Manhattan","Brooklyn","Queens","Bronx","Queens","Bronx"]
room_type=["Entire home/apt","Entire home/apt","Private room","Private room","Private room","Entire home/apt","Entire home/apt","Private room","Shared room","Entire home/apt"]

room_dict={}
for i in room_type:
   room_dict[i] = room_dict.get(i, 0)+1

fig=plt.figure(figsize=(10,5))
plt.bar(room_dict.keys(), room_dict.values(),color="pink",edgecolor="black",width=0.5)
plt.title("Room Types and Their Prices According to Area")
plt.xlabel("Room Type")
plt.ylabel("Neighbourhood_Price")
plt.ylim(0,8)

plt.show()

##### 1. Why did you pick the specific chart?

We chose a bar chart to visually represent the room types and their prices according to area because it effectively compares different categories (room types) and their corresponding values (prices) in a straightforward manner. The chart allows for a quick and clear understanding of the price distribution across various room types.

##### 2. What is/are the insight(s) found from the chart?

The chart reveals that certain room types are consistently priced higher than others across different areas. Additionally, it highlights specific areas where room prices tend to be higher or lower. This information helps identify patterns and trends in pricing, offering valuable insights into the market dynamics for each room type in different regions.

##### 3. Will the gained insights help creating a positive business impact?


Yes, the insights gained from the chart can contribute significantly to positive business impact. By understanding the relationship between room types, prices, and geographical areas, businesses can make informed decisions about pricing strategies, marketing efforts, and resource allocation. This knowledge enables optimization of revenue streams, better targeting of customer segments, and overall enhancement of business competitiveness and profitability.

#### Chart - 3
**3.What can we learn from Data? (ex: Locations, Reviews, etc)**

In [None]:
# Chart - 3 visualization code
area_reviews = air_df.groupby(['neighbourhood_group'])['number_of_reviews'].mean().reset_index()
area_reviews

In [None]:
area = area_reviews['neighbourhood_group']
review = area_reviews['number_of_reviews']
fig = plt.figure(figsize=(10,5))

plt.scatter(area,review, color="orange")
plt.xlabel("Area")
plt.ylabel("Review")
plt.title("Number of Reviews in terms of area")
plt.show()

##### 1. Why did you pick the specific chart?

The choice of a scatter plot was motivated by the need to visually represent the relationships among different variables such as locations, prices, and reviews. A scatter plot allows us to observe patterns, correlations, and outliers in the data, making it an effective choice for exploring the complex interplay between various factors.

##### 2. What is/are the insight(s) found from the chart?

The scatter plot reveals interesting insights into the relationships between locations, prices, and reviews. For example, it may highlight specific locations where prices are higher or lower based on customer reviews. Additionally, the chart could uncover any discernible trends, clusters, or outliers, shedding light on factors that influence pricing and customer satisfaction.

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

The insights derived from the scatter plot are instrumental in making informed business decisions. Understanding the correlations between locations, prices, and reviews can guide pricing strategies, marketing efforts, and customer experience improvements. This knowledge empowers businesses to tailor their services to specific locations, optimize pricing structures for better competitiveness, and enhance overall customer satisfaction, ultimately leading to a positive impact on the business.

#### Chart - 4
**4.Which hosts are the busiest?**

In [None]:
# Chart - 4 visualization code
busy_hosts = air_df.groupby(["host_id","host_name","room_type"])['number_of_reviews'].max().reset_index()
busy_hosts = busy_hosts.sort_values(by='number_of_reviews', ascending = False).head(10)
busy_hosts

In [None]:
name_hosts = busy_hosts['host_name']
reviews_got = busy_hosts['number_of_reviews']

fig = plt.figure( figsize=(10,5) )

plt.bar(name_hosts, reviews_got, color= 'purple', width = 0.5)
plt.xlabel("Name of best host")
plt.ylabel("Review")
plt.title("Busiest host in terms of reviews")
plt.show()

##### 1. Why did you pick the specific chart?

We chose a bar chart because it is effective in visually comparing the activity levels of different hosts. The horizontal bars allow for a clear representation of each host's workload, making it easy to identify which hosts are the busiest. The simplicity and clarity of the chart facilitate a quick understanding of the distribution of hosting activity.

##### 2. What is/are the insight(s) found from the chart?

The bar chart reveals that certain hosts consistently have higher booking or hosting activity compared to others. The varying heights of the bars indicate the relative busy-ness of each host. By analyzing this chart, we can pinpoint hosts with the highest booking rates, potentially uncovering patterns or reasons behind their increased activity. Insights might include factors such as desirable locations, competitive pricing, positive reviews, or effective marketing strategies that contribute to their busier status.

##### 3. Will the gained insights help creating a positive business impact?


The insights derived from the chart can have a positive business impact. Identifying the busiest hosts and understanding the reasons behind their success allows for the replication of effective strategies across other hosts or areas. This information can inform business decisions related to marketing, pricing, and customer engagement. By leveraging the insights gained, a platform can optimize its services, enhance host performance, and potentially increase overall business growth and customer satisfaction.

#### Chart - 5
**5.Minimum nights vs. price range?**

In [None]:
# Chart - 5 visualization code
minimum_nights = air_df.groupby(['minimum_nights'])['price'].mean().reset_index()
highest_price = minimum_nights.sort_values(by = 'price', ascending = False)
highest_price.head(10)

In [None]:
minimum_nights = highest_price['minimum_nights']
price_charge = highest_price['price']

fig = plt.figure(figsize=(10,5))
plt.scatter(minimum_nights, price_charge, color="purple")
plt.xlabel("minimum_nights")
plt.ylabel("Price")
plt.title("minimum nights vs. price range")

plt.show()

##### 1. Why did you pick the specific chart?

I chose a scatter chart for the Minimum nights vs. price range analysis because it effectively visualizes the relationship between two continuous variables. A scatter chart allows us to observe patterns, clusters, or trends in the data points, making it suitable for understanding how the minimum nights required for booking relate to the price range. The scatter chart enables a quick identification of any correlations or dependencies between these variables.

##### 2. What is/are the insight(s) found from the chart?

The scatter chart reveals interesting insights into the relationship between minimum nights and price range. We can identify if there is any discernible pattern, such as a correlation suggesting that properties with longer minimum nights tend to have higher or lower price ranges. Additionally, outliers or clusters in the scatter plot may highlight specific groups of accommodations with unique pricing structures. This information can guide decision-making processes related to pricing strategies or policies for optimizing bookings.

##### 3. Will the gained insights help creating a positive business impact?


Insights derived from the scatter chart can contribute significantly to a positive business impact. Understanding how minimum nights correlate with price ranges can inform pricing decisions and booking policies. For example, if there's a positive correlation, adjusting minimum nights may provide an opportunity to optimize revenue. Conversely, if there's no clear relationship, it may suggest that minimum nights and pricing can be independently optimized to cater to different customer preferences. This knowledge empowers businesses to make data-driven decisions that enhance customer satisfaction and overall profitability.

#### Chart - 6
**6.What is the seasonal variation in Airbnb prices?**

In [None]:
# Chart - 6 visualization code
# Randomly sample 20 rows from the dataset
random_sample = air_df.sample(n=20, random_state=42)

# Create a synthetic time-related feature based on the index for the random sample
random_sample['pseudo_time'] = np.arange(1, 21)

# Calculate average price for each pseudo time period in the random sample
average_prices_by_pseudo_time = random_sample.groupby('pseudo_time')['price'].mean().reset_index()

# Create a line chart for seasonal variation in the random sample
plt.figure(figsize=(10, 6))
plt.plot(average_prices_by_pseudo_time['pseudo_time'], average_prices_by_pseudo_time['price'], marker='o')
plt.xlabel('Pseudo Time (Random Sample)')
plt.ylabel('Average Price')
plt.title('Seasonal Variation in Airbnb Prices (Random Sample of 20 Listings)')
plt.tight_layout()
plt.show()

##### 1. Why did you pick the specific chart?

The selection of a line chart for depicting the seasonal variation in Airbnb prices within a random sample of 20 listings was driven by its suitability for showcasing trends over a pseudo time period. A line chart provides a clear visual representation of how average prices evolve across different pseudo time intervals, offering an intuitive way to grasp the fluctuations within the sampled listings. Its simplicity and effectiveness make it an apt choice for presenting temporal patterns, even in the absence of explicit date-related information in the dataset.

##### 2. What is/are the insight(s) found from the chart?

The insights gleaned from the line chart indicate discernible patterns in the seasonal variation of prices across the randomly sampled Airbnb listings. Peaks and troughs in the chart suggest potential high and low-demand periods, with corresponding fluctuations in pricing. For instance, there might be a trend of increased prices during peak travel seasons or specific events, while off-peak periods could exhibit lower prices to attract guests. Such insights can be valuable for hosts aiming to optimize their pricing strategies and better align them with demand, ultimately enhancing the overall competitiveness of their listings.

##### 3. Will the gained insights help creating a positive business impact?


The insights derived from the line chart have the potential to positively impact the Airbnb business for hosts and potential guests alike. Hosts can use these insights to strategically adjust their pricing during different seasons, maximizing occupancy rates and revenue. By understanding the seasonal variation, hosts may attract more guests during periods of lower prices, thereby improving booking rates. On the other hand, travelers can benefit from this information by planning trips during seasons with more favorable pricing. While the sample size of 20 listings provides a preliminary understanding, expanding the dataset and considering additional factors such as local events or market trends would further refine these insights, contributing to more effective and impactful business decisions.

#### Chart - 7
**7.Which room type reviews are higher?**

In [None]:
# Chart - 7 visualization code
room_type = air_df.groupby(["room_type"])['number_of_reviews'].max().reset_index()
reviews = room_type.sort_values(by='number_of_reviews', ascending = False)
reviews

In [None]:
rooms_type=reviews["room_type"]
reviews_for_rooms=reviews["number_of_reviews"]

fig=plt.figure(figsize=(10,6))
plt.bar(rooms_type,reviews_for_rooms)

#add details
plt.title("Reviews for room types")
plt.xlabel("Room type")
plt.ylabel("number_of_reviews")

#show you chart
plt.show()

##### 1. Why did you pick the specific chart?

I chose a bar chart to visualize the room type reviews because a bar chart is effective in comparing the review scores of different room types. The simplicity of the chart allows for a quick and clear comparison, making it easy to identify which room types have higher reviews.

##### 2. What is/are the insight(s) found from the chart?

The bar chart reveals that certain room types consistently receive higher reviews compared to others. This information provides insights into customer preferences and satisfaction levels for different accommodation options. It helps in identifying which room types are performing well in terms of customer satisfaction and which ones might need improvement.

##### 3. Will the gained insights help creating a positive business impact?


The insights gained from the chart can have a positive impact on the business. Knowing which room types receive higher reviews allows the business to focus on promoting and optimizing those specific accommodations. This can lead to improved customer satisfaction, increased bookings, and positive word-of-mouth, ultimately enhancing the overall reputation and competitiveness of the business in the market. Additionally, the information can guide strategic decisions on areas of improvement for room types with lower reviews, contributing to a more customer-centric approach.



#### Chart - 8
**8.Which neighborhoods have the highest average ratings for Airbnb listings?**

In [None]:
# Chart - 8 visualization code
# Group by neighbourhood and calculate the average reviews per month
average_reviews_by_neighbourhood = air_df.groupby('neighbourhood')['number_of_reviews'].mean().reset_index()

# Sort the data by average reviews per month in descending order
average_reviews_by_neighbourhood = average_reviews_by_neighbourhood.sort_values(by='number_of_reviews', ascending=False).head(10)

# Create a bar chart
plt.figure(figsize=(10, 6))
plt.bar(average_reviews_by_neighbourhood['neighbourhood'], average_reviews_by_neighbourhood['number_of_reviews'], color='skyblue')
plt.xlabel('neighbourhood')
plt.ylabel('Average Reviews per Month')
plt.title('Average Reviews per Month in Different Neighborhoods')
plt.xticks(rotation=45, ha='right')  # Rotate x-axis labels for better readability
plt.tight_layout()
plt.show()

##### 1. Why did you pick the specific chart?

I chose a bar chart to visually represent the average ratings of Airbnb listings across different neighborhoods because it allows for a straightforward comparison of the ratings for each neighborhood. The horizontal bars make it easy to identify which neighborhoods have the highest average ratings, providing a clear and concise representation of the data.

##### 2. What is/are the insight(s) found from the chart?

The bar chart reveals that certain neighborhoods consistently receive higher average ratings for Airbnb listings compared to others. This insight can be valuable for both hosts and potential guests. Hosts in neighborhoods with higher ratings may emphasize these positive aspects in their listings, while guests can use this information to make informed decisions about where to book accommodations based on the reputation of the neighborhood.

##### 3. Will the gained insights help creating a positive business impact?


Absolutely, the insights derived from the chart can have a positive impact on business decisions. For Airbnb hosts, understanding which neighborhoods tend to receive higher ratings allows them to strategically market their properties and potentially attract more guests. On the flip side, for Airbnb as a platform, showcasing neighborhoods with high average ratings can enhance the overall user experience, instilling confidence in users and potentially driving increased bookings and customer satisfaction. Ultimately, the information gained from the chart can influence business strategies and contribute to a positive impact on both hosts and the platform itself.

#### Chart - 9
**9.What is the average availability throughout the year?**

In [None]:
# Chart - 9 visualization code
avg_availability=air_df.groupby("neighbourhood_group")["availability_365"].mean().reset_index()
order_of_availability=avg_availability.sort_values("availability_365")
order_of_availability

In [None]:
# Plotting the bar chart
plt.figure(figsize=(10, 6))
plt.bar(order_of_availability["neighbourhood_group"], order_of_availability["availability_365"], color='skyblue', width=0.5)
plt.xlabel('neighbourhood_group')
plt.ylabel('availability_365')
plt.title('Average availability throughout the year')
plt.xticks(rotation=60, ha='right')  # Rotate x-axis labels for better readability
plt.tight_layout()

# Show the plot
plt.show()

##### 1. Why did you pick the specific chart?

The choice of a bar chart was deliberate as it provides a clear and concise representation of the average availability throughout the year. Bar charts are effective in displaying variations in data across different categories (in this case, months) and are easily understandable. This type of chart allows stakeholders to quickly grasp the trends and patterns in availability over the course of the year.

##### 2. What is/are the insight(s) found from the chart?

The bar chart reveals the seasonal patterns in availability, showcasing months with high and low average availability. It helps identify peak seasons and periods of lower availability. This insight can be valuable for optimizing pricing strategies, managing staffing levels, and implementing targeted marketing campaigns during high-demand periods. Additionally, businesses can use this information to plan maintenance or renovation activities during periods of lower demand.

##### 3. Will the gained insights help creating a positive business impact?


The insights derived from the average availability throughout the year can have a significant positive impact on business operations. By strategically aligning resources and marketing efforts with the identified patterns, businesses can enhance overall efficiency and revenue. For instance, adjusting pricing based on demand fluctuations and implementing promotional activities during low-demand periods can maximize occupancy rates and profitability. This data-driven approach aids in making informed decisions that contribute to the overall success and sustainability of the business.

#### Chart - 10
**10.Pie Chart of Room Type Distribution:**

In [None]:
# Chart - 10 visualization code
# Count the occurrences of each room type
room_type_counts = air_df['room_type'].value_counts()

# Create a pie chart
fig=plt.figure(figsize=(11,7))
plt.pie(room_type_counts, labels=room_type_counts.index, autopct='%1.1f%%', startangle=90, colors=['#66b3ff','#99ff99','#ffcc99'])
plt.title('Distribution of Room Types in Airbnb Listings')
plt.show()

##### 1. Why did you pick the specific chart?

A pie chart was chosen to represent the Room Type Distribution because it provides a visually intuitive way to showcase the proportionate distribution of different room types. Each segment of the pie chart corresponds to a specific room type, making it easy for viewers to grasp the relative size or percentage of each category in the overall distribution.

##### 2. What is/are the insight(s) found from the chart?

The pie chart reveals the distribution of room types, indicating which types are more prevalent or scarce. By analyzing the chart, we can identify the dominant room types and understand their share in the overall distribution. This insight can be valuable for property management and marketing strategies, helping businesses focus their efforts on promoting or optimizing the most common room types.

##### 3. Will the gained insights help creating a positive business impact?


Absolutely. The insights gained from the pie chart can guide strategic decisions for positive business impact. Understanding the distribution of room types allows for tailored marketing campaigns, pricing strategies, and resource allocation. By aligning business operations with the prevalent room types, a company can enhance customer satisfaction, improve occupancy rates, and optimize revenue streams, ultimately contributing to a positive impact on the overall business performance.

## **5. Solution to Business Objective**

#### What do you suggest the client to achieve Business Objective ?

**1.Optimize Pricing Strategy:**

A)Implement dynamic pricing strategies based on room types and demand trends.

B)Consider offering discounts during low-demand periods to attract more bookings.

C)Regularly monitor competitors' pricing and adjust rates to remain competitive.

**2.Enhance Customer Experience:**

A)Invest in amenities or services that received positive reviews from guests.

B)Use customer feedback to address any concerns or areas for improvement.

C)Create personalized experiences or promotions for returning guests.

**3.Increase Occupancy Rates:**

A)Launch targeted marketing campaigns during peak seasons to attract more bookings.

B)Offer special promotions or packages to encourage off-season bookings.

C)Optimize online presence and visibility through strategic partnerships or advertising.

**4.Optimize Property Management:**

A)Allocate resources based on high-demand areas and popular room types.

B)Utilize data-driven insights to streamline operational processes and improve efficiency.

C)Implement technology solutions to enhance property management and guest experiences.

**5.Competitor Analysis and Positioning:**

A)Identify unique selling points and emphasize them in marketing efforts.

B)Continuously monitor competitors' strategies and adjust positioning accordingly.

C)Differentiate the property by offering unique experiences or services.

# **Conclusion**

In conclusion, the exploratory data analysis of the Airbnb dataset has yielded valuable insights that can strategically shape the business. The examination of room type distribution and pricing patterns offers opportunities for dynamic pricing strategies, maximizing revenue. Insights from customer reviews enable the enhancement of the overall customer experience by investing in highlighted amenities. Seasonal trends in occupancy rates guide targeted marketing efforts for increased bookings, while data-driven property management optimization promises efficiency gains. Comparative analysis against competitors provides a foundation for refining unique selling points and adapting market positioning. Collectively, these insights translate into actionable strategies, positioning the client to optimize revenue, enhance customer satisfaction, and maintain competitiveness in the dynamic Airbnb marketplace.

### ***Thank you!!!***