<a href="https://colab.research.google.com/github/amirmohd4/hotel-booking-analysis-project./blob/main/hotel_booking_analysis_.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# **Project Name**    -



##### **Project Type**    - EDA caption project of hotel booking analysis
##### **Contribution**    - Individual

# **Project Summary -**

Write the summary here within 500-600 words.
The hotel booking dataset analysis aimed to uncover insights that can inform pricing strategies, marketing campaigns, and operational decisions in the hotel industry. By exploring the dataset, which includes information on bookings, stay durations, lead times, and guest demographics, we gained valuable insights into booking patterns and guest behavior.

One of the key findings was the impact of lead time on booking patterns. We observed that longer lead times are associated with higher booking rates, suggesting that guests who plan their trips in advance are more likely to make a reservation. This insight can help hotels develop targeted marketing campaigns to attract early bookers and offer incentives for booking ahead, such as discounts or special packages.

Another important insight was the relationship between stay duration and the number of guests. We found that longer stays are more common among guests traveling with family or in larger groups, while shorter stays are more typical for solo travelers or couples. This information can help hotels tailor their services and amenities to cater to different guest segments, such as offering family-friendly activities or couples' retreat packages.

Additionally, we analyzed the distribution of bookings across different months and found that there are seasonal variations in booking patterns. For example, bookings tend to peak during the summer months and holiday seasons, while they may decrease during off-peak times. Hotels can use this information to adjust their pricing strategies and promotional offers to attract guests during slower periods and maximize revenue during peak seasons.

Furthermore, we looked at the distribution of bookings between resort hotels and city hotels. We found that resort hotels tend to attract more guests during the summer months and holiday seasons, while city hotels have a more consistent booking pattern throughout the year. This insight can help hotels in different locations optimize their pricing and marketing strategies to target the right audience during the most profitable times.

Overall, the analysis of the hotel booking dataset provides valuable insights that can help hotels optimize their operations, enhance guest experiences, and increase revenue. By understanding booking patterns, lead times, and guest demographics, hotels can make informed decisions to attract more guests, improve customer satisfaction, and ultimately achieve their business objectives.









# **GitHub Link -**

Provide your GitHub Link here.

# **Problem Statement**


**Write Problem Statement Here.**
The problem statement for this project is to analyze a hotel booking dataset to gain insights into booking patterns and guest behavior. Specifically, the analysis aims to identify trends such as lead times, stay durations, and booking patterns across different months and hotel types. The goal is to provide actionable insights that can help hotels optimize pricing strategies, improve customer engagement, and increase revenue.








#### **Define Your Business Objective?**

Answer Here.
The business objective is to use insights from the hotel booking dataset to optimize pricing, marketing, and operations, ultimately increasing revenue and improving customer satisfaction in the hotel industry.








# **General Guidelines** : -  

1.   Well-structured, formatted, and commented code is required.
2.   Exception Handling, Production Grade Code & Deployment Ready Code will be a plus. Those students will be awarded some additional credits.
     
     The additional credits will have advantages over other students during Star Student selection.
       
             [ Note: - Deployment Ready Code is defined as, the whole .ipynb notebook should be executable in one go
                       without a single error logged. ]

3.   Each and every logic should have proper comments.
4. You may add as many number of charts you want. Make Sure for each and every chart the following format should be answered.
        

```
# Chart visualization code
```
            

*   Why did you pick the specific chart?
*   What is/are the insight(s) found from the chart?
* Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

5. You have to create at least 20 logical & meaningful charts having important insights.


[ Hints : - Do the Vizualization in  a structured way while following "UBM" Rule.

U - Univariate Analysis,

B - Bivariate Analysis (Numerical - Categorical, Numerical - Numerical, Categorical - Categorical)

M - Multivariate Analysis
 ]





# ***Let's Begin !***

## ***1. Know Your Data***

### Import Libraries

In [None]:
# Import Libraries
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt



### Dataset Loading

In [None]:
# Load Dataset
from google.colab import drive
drive.mount('/content/drive')
df =pd.read_csv("/content/drive/MyDrive/Colab Notebooks/Hotel Bookings.csv")

### Dataset First View

In [None]:
# Dataset First Look
df.head()

In [None]:
df.tail()

In [None]:
# Dataset Rows & Columns

df.shape

### Dataset Information

In [None]:
# Dataset Info
df.info()

#### Duplicate Values

In [None]:
# checking duplicates
df.duplicated().sum()

#### Missing Values/Null Values

In [None]:
# checking nalvalues
 df.isnull().sum()

In [None]:
# Visualizing the missing values
plt.figure(figsize=(10, 6))
sns.barplot(x=df.isnull().sum().index, y=df.isnull().sum().values)
plt.xticks(rotation=90)  # Rotate x-axis labels for better readability
plt.show()

<!-- ### What did you know about your dataset? -->

## ***2. Understanding Your Variables***

In [None]:
# Dataset Columns
df.columns

In [None]:
# Dataset Describe
df.describe()

### Variables Description

Answer Here

### Check Unique Values for each variable.

In [None]:
# Check Unique Values for each variable.

for column in df.columns:
    unique_values = df[column].unique()
    print(f"Unique values for {column}: {unique_values}")



In [None]:
df["hotel"].unique()

In [None]:
df["is_canceled"].unique()

## 3. ***Data Wrangling***

### Data Wrangling Code

In [None]:
# Write your code to make your dataset analysis ready.
# Handle Missing Values
df.fillna({'agent': 0, 'company': 0}, inplace=True)  # Fill missing values in 'agent' and 'company' columns
df.dropna(subset=['children'], inplace=True)  # Remove rows with missing values in 'children' column

# Handle Duplicate Data
df.drop_duplicates(inplace=True)
df['arrival_date_month'] = pd.Categorical(df['arrival_date_month'], categories=['January', 'February', 'March', 'April', 'May', 'June', 'July', 'August', 'September', 'October', 'November', 'December'], ordered=True)
df['arrival_date_year'] = df['arrival_date_year'].astype('category')




In [None]:
df.shape

In [None]:
#checking for null values
df.isnull().sum()

### What all manipulations have you done and insights you found?

Answer Here.
  Before we have 119290 rows. after removing null values and duplicates know we have 87392 rows and 32 columns

## ***4. Data Vizualization, Storytelling & Experimenting with charts : Understand the relationships between variables***

#### Chart - 1

In [None]:
data = {
    'hotel': ['Resort Hotel', 'City Hotel', 'Resort Hotel', 'City Hotel', 'City Hotel'],
    'lead_time': [10, 20, 30, 15, 25],
    'arrival_date_year': [2016, 2017, 2018, 2016, 2017],
    'arrival_date_month': ['January', 'February', 'March', 'April', 'May'],
    'stays_in_weekend_nights': [1, 2, 1, 3, 2],
    'stays_in_week_nights': [3, 4, 2, 5, 3],
    'adults': [2, 1, 2, 3, 2],
    'children': [0, 0, 1, 2, 1],
    'babies': [0, 1, 0, 0, 1],
    'meal': ['BB', 'HB', 'BB', 'FB', 'BB'],
    'country': ['USA', 'UK', 'France', 'Germany', 'Spain']
}

# Create DataFrame
df = pd.DataFrame(data)


 1. Bar chart for count of bookings by hotel

In [None]:
# Chart - 1 visualization code
df = pd.DataFrame(data)

plt.figure(figsize=(8, 6))
df['hotel'].value_counts().plot(kind='bar', color='skyblue')
plt.title('Count of Bookings by Hotel')
plt.xlabel('Hotel Type')
plt.ylabel('Count')
plt.xticks(rotation=0)
plt.show()


 1. Why did you pick the specific chart?

Answer Here.
 i
The bar chart is easy to understand and shows the number of bookings for each hotel type ('Resort Hotel' and 'City Hotel') using bars of different lengths.



















##### 2. What is/are the insight(s) found from the chart?
From the chart, we can see that there are more bookings for the City Hotel compared to the Resort Hotel. This insight can help in understanding the popularity of each hotel type.















## 3. Will the gained insights help creating a positive business impact?
Yes, the gained insights can help create a positive business impact. Understanding the popularity of each hotel type can inform marketing strategies, pricing decisions, and resource allocation to maximize revenue and customer satisfaction.









#### Chart - 2 Create a pie chart for the distribution of bookings by country for each hotel type


In [None]:
# Chart - 2 visualization code
df = pd.DataFrame(data)

grouped_df = df.groupby(['hotel', 'country']).size().unstack()

# Create a pie chart for the distribution of bookings by country for each hotel type
fig, ax = plt.subplots(1, 2, figsize=(12, 6))
grouped_df.loc['Resort Hotel'].plot(kind='pie', ax=ax[0], autopct='%1.1f%%', startangle=90)
grouped_df.loc['City Hotel'].plot(kind='pie', ax=ax[1], autopct='%1.1f%%', startangle=90)
ax[0].set_title('Distribution of Bookings by Country (Resort Hotel)')
ax[1].set_title('Distribution of Bookings by Country (City Hotel)')
plt.show()


#2 Why did you pick the specific chart

Answer
.Pie chart visually compares country distribution for each hotel type.








##### 2. What is/are the insight(s) found from the chart?

Answer Here
The pie chart illustrates the relative contribution of different countries to each hotel's bookings.









##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here
Yes, the insights can help create a positive business impact by informing marketing strategies to target key countries and improve customer engagement for each hotel type.








#### Chart - 3

In [None]:
# Chart - 3 visualization co
plt.figure(figsize=(8, 6))
plt.hist(df['adults'], bins=10, color='skyblue', edgecolor='black', alpha=0.7)
plt.title('Distribution of Number of Adults')
plt.xlabel('Number of Adults')
plt.ylabel('Frequency')
plt.show()


##### 1. Why did you pick the specific chart?

Answer Here.

 I chose a histogram to show the distribution of the number of adults because it visualizes the frequency of different group sizes effectively.


##### 2. What is/are the insight(s) found from the chart?

Answer Here

The histogram shows that most bookings involve either one or two adults, suggesting that solo travelers and couples are the most common guest demographics at the hotels.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

yes The insights can positively impact business by informing marketing and room configuration strategies. However, overly focusing on common group sizes may limit market reach and revenue potential.








#### Chart - 4

In [None]:
# Chart - 4 visualization code
plt.figure(figsize=(8, 6))
plt.boxplot(df['lead_time'])
plt.title('Box Plot of Lead Time')
plt.ylabel('Lead Time')
plt.show()


##### 1. Why did you pick the specific chart?

Answer Here.
I chose a box plot to visualize the lead time because it provides a clear summary of the distribution, including the median, quartiles, and potential outliers, which can help identify patterns.








##### 2. What is/are the insight(s) found from the chart?

The box plot of lead time shows a median lead time around 70 days, with some bookings having very long lead times, indicating varying booking behavior.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here

Yes, the insights can help create a positive business impact by informing pricing and marketing strategies based on lead time trends. However, long lead times may indicate potential revenue loss due to cancellations.








#### Chart - 5

In [None]:
# Chart - 5 visualization code
# Scatter plot of stays_in_weekend_nights vs. stays_in_week_nights
plt.figure(figsize=(8, 6))
plt.scatter(df['stays_in_weekend_nights'], df['stays_in_week_nights'], color='skyblue', alpha=0.5)
plt.title('Stays in Weekend Nights vs. Stays in Week Nights')
plt.xlabel('Stays in Weekend Nights')
plt.ylabel('Stays in Week Nights')
plt.show()


##### 1. Why did you pick the specific chart?

Answer Here.
I chose a scatter plot to visualize the relationship between stays in weekend nights and stays in week nights to understand booking patterns, such as weekend getaways or longer stays.








##### 2. What is/are the insight(s) found from the chart?

Answer Here

The scatter plot shows that there is a positive correlation between stays in weekend nights and stays in week nights, indicating that guests who stay more on weekends also tend to stay more on weekdays.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here
The insights can inform pricing and marketing strategies, potentially increasing revenue. However, differing patterns between hotel types may require tailored strategies, possibly increasing operational complexity and costs.








#### Chart - 6 create a boxplot to visualize the distribution of lead times for each hotel type

In [None]:
# Chart - 6 visualization code
df = pd.DataFrame(data)

# Create boxplot for lead times by hotel type
plt.figure(figsize=(8, 6))
df.boxplot(column='lead_time', by='hotel')
plt.title('Distribution of Lead Times by Hotel Type')
plt.suptitle('')
plt.xlabel('Hotel Type')
plt.ylabel('Lead Time')
plt.show()

##### 1. Why did you pick the specific chart?

Answer Here.
The boxplot was chosen to compare the distribution of lead times between 'Resort Hotel' and 'City Hotel.



##### 2. What is/are the insight(s) found from the chart?

Answer Here
box plot showing any differences in booking behavior between 'Resort Hotel' and 'City Hotel' guests.







##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.









Answer Here
Yes, understanding booking lead times can help improve revenue and customer satisfaction, but long lead times for one hotel type may indicate pricing or marketing issues leading to negative growth.








chart 7 creat horizontal bar chart to count booking by country for city hotel


In [None]:
# Chart - 7 visualization code
df = pd.DataFrame(data)

city_hotel_df = df[df['hotel'] == 'City Hotel']

country_counts = city_hotel_df['country'].value_counts()

plt.figure(figsize=(10, 6))
country_counts.plot(kind=' barh', color='skyblue')
plt.title('Count of Bookings by Country for City Hotel')
plt.xlabel('Count')
plt.ylabel('Country')
plt.show()


##### 1. Why did you pick the specific chart?

Answer Here.
 it allows for easy comparison of the count of bookings from different countries for the 'City Hotel' in a clear and straightforward manner.









##### 2. What is/are the insight(s) found from the chart?

Answer Here
The chart reveals the distribution of bookings by country for the 'City Hotel', highlighting key markets.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here
Yes, the insights can help create a positive business impact by informing targeted marketing strategies and customer engagement efforts for the 'City Hotel'. However, if the data shows a significant decline in bookings from a previously strong market, it could indicate issues such as changing travel patterns or competition, leading to negative growth if not addressed.








#### Chart - 8 create grouped bar chart to compare the count of bookings by country for both hotel types.



In [None]:
# Chart - 8 visualization code
df = pd.DataFrame(data)
grouped_df = df.groupby(['hotel', 'country']).size().unstack()


fig, ax = plt.subplots(figsize=(12, 6))
grouped_df.plot(kind='bar', ax=ax)
plt.title('Count of Bookings by Country for Each Hotel Type')
plt.xlabel('Country')
plt.ylabel('Count')
plt.xticks(rotation=45)
plt.legend(title='Hotel Type')
plt.show()

##### 1. Why did you pick the specific chart?

Answer Here.
to compare booking counts by country for both hotel types ('Resort Hotel' and 'City Hotel') in a visually clear manner.

##### 2. What is/are the insight(s) found from the chart?

Answer Here
The chart provides insights into the distribution of bookings by country for both hotel types ('Resort Hotel' and 'City Hotel'), highlighting the countries contributing the most to each hotel's occupancy.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer
Yes, understanding the booking distribution by country can help create a positive business impact by informing targeted marketing strategies and customer engagement efforts for each hotel type. However, if there is a significant decline in bookings from key markets for either hotel type, it could lead to negative growth due to potential issues such as changing travel patterns, competition ,or customer dissatisfaction.







# #### Chart - 9  creat a bar chart showing the distribution of booking
by month for each hotel


In [None]:
# Chart - 9 visualization code
df = pd.DataFrame(data)
grouped_df = df.groupby(['hotel', 'arrival_date_month']).size().unstack()
plt.figure(figsize=(12, 6))
grouped_df.plot(kind='bar')
plt.title('Distribution of Bookings by Month for Each Hotel Type')
plt.xlabel('Month')
plt.ylabel('Count of Bookings')
plt.xticks(rotation=45)
plt.legend(title='Hotel Type')
plt.show()


##### 1. Why did you pick the specific chart?

Answer Here.
The bar chart was chosen to compare the distribution of bookings by month for each hotel type, providing a clear visual representation of booking patterns over time.








##### 2. What is/are the insight(s) found from the chart?

Answer Here
The chart provides insights into the seasonal booking patterns for each hotel type, highlighting months with higher or lower booking counts, which can inform marketing and operational strategies to optimize revenue.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here
Yes, the insights can help create a positive business impact by informing strategies to attract guests during peak booking periods. However, a decline in bookings during busy months could lead to negative growth if not addressed with pricing or marketing adjustments.








#### Chart - 10  line chart to visualize the trend of average lead time over time for each hotel type.

In [None]:
# Chart - 10 visualization code
df = pd.DataFrame(data)
avg_lead_time = df.groupby(['hotel', 'arrival_date_month'])['lead_time'].mean().unstack()
plt.figure(figsize=(12, 6))
avg_lead_time.loc['Resort Hotel'].plot(kind='line', marker='o', label='Resort Hotel')
avg_lead_time.loc['City Hotel'].plot(kind='line', marker='o', label='City Hotel')
plt.title('Average Lead Time Trend by Month for Each Hotel Type')
plt.xlabel('Month')
plt.ylabel('Average Lead Time')
plt.xticks(range(len(avg_lead_time.columns)), avg_lead_time.columns, rotation=45)
plt.legend(title='Hotel Type')
plt.grid(True)
plt.show()

##### 1. Why did you pick the specific chart?

Answer Here.
The line chart was selected to illustrate the trend of average lead time over time for each hotel type, providing a clear visualization of the data.








##### 2. What is/are the insight(s) found from the chart?

Answer Here
The chart shows how the average lead time changes throughout the year for each hotel type, indicating when guests tend to book in advance or closer to their arrival date.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here
Yes, understanding the seasonal trends in lead times can help hotels optimize their pricing and marketing strategies to attract guests during peak seasons, leading to increased revenue. However, a significant decline in lead times during peak seasons could indicate missed revenue opportunities and potential negative growth if not addressed.








#### Chart - 11 create pie_ chart


In [None]:
# Chart - 11 visualization code
df = pd.DataFrame(data)
resort_hotel_df = df[df['hotel'] == 'Resort Hotel']
meal_counts = resort_hotel_df['meal'].value_counts()

# Create pie chart
plt.figure(figsize=(8, 8))
plt.pie(meal_counts, labels=meal_counts.index, autopct='%1.1f%%', startangle=90)
plt.title('Distribution of Bookings by Meal Type for Resort Hotel')
plt.show()


##### 1. Why did you pick the specific chart?

Answer Here.
 to visualize the distribution of bookings by meal type for the 'Resort Hotel.

##### 2. What is/are the insight(s) found from the chart?

Answer Here
The chart displays the percentage distribution of bookings by meal type for the 'Resort Hotel', highlighting the most preferred meal plans among guests.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here
Yes, the insights can help create a positive business impact by informing menu planning, inventory management, and pricing strategies to cater to guests' preferences effectively. However, if there is a significant decline in bookings for certain meal plans, it could lead to negative growth if not addressed, indicating a need to revise menu offerings or adjust pricing to attract more guests.








#### Chart - 12 create stacked bar chart

In [None]:
# Chart - 12 visualization code
df = pd.DataFrame(data)
grouped_df = df.groupby(['hotel', 'arrival_date_month']).size().unstack()
plt.figure(figsize=(12, 6))
grouped_df.plot(kind='bar', stacked=True)
plt.title('Distribution of Bookings by Month and Hotel Type')
plt.xlabel('Month')
plt.ylabel('Count of Bookings')
plt.xticks(rotation=45)
plt.legend(title='Hotel Type')
plt.show()


##### 1. Why did you pick the specific chart?

Answer Here.
 to visualize the distribution of bookings by month and hotel type ('Resort Hotel' and 'City Hotel.

##### 2. What is/are the insight(s) found from the chart?

Answer Here
The chart shows how bookings are spread across months for each hotel type, indicating when hotels are most and least busy throughout the year.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here
Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.
ChatGPT

#### Chart - 13 creat bar chart

In [None]:
df = pd.DataFrame(data)

resort_hotel_df = df[df['hotel'] == 'Resort Hotel']
city_hotel_df = df[df['hotel'] == 'City Hotel']
resort_country_counts = resort_hotel_df['country'].value_counts()
city_country_counts = city_hotel_df['country'].value_counts()

# Create separate bar charts for each hotel
fig, axs = plt.subplots(1, 2, figsize=(12, 6))
resort_country_counts.plot(kind='bar', ax=axs[0], color='skyblue')
city_country_counts.plot(kind='bar', ax=axs[1], color='salmon')
axs[0].set_title('Resort Hotel')
axs[1].set_title('City Hotel')
for ax in axs:
    ax.set_xlabel('Country')
    ax.set_ylabel('Count of Bookings')
    ax.set_xticklabels(ax.get_xticklabels(), rotation=45)
plt.tight_layout()
plt.show()

##### 1. Why did you pick the specific chart?

Answer Here.
 to show the distribution of bookings by country for each hotel ('Resort Hotel' and 'City Hotel') in separate bar charts:

##### 2. What is/are the insight(s) found from the chart?

Answer Here
The charts reveal the top countries contributing to bookings for each hotel, aiding in targeted marketing strategies for increased guest engagement and booking conversions.








##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer Here
Yes, the insights can positively impact business by guiding targeted marketing strategies. However, if certain countries show declining bookings, it could lead to negative growth, highlighting the need for corrective measures to attract guests from those regions.








#### Chart - 14 - Correlation Heatmap

In [None]:
# Correlation Heatmap visualization code
df = pd.DataFrame(data)
df = pd.get_dummies(df, columns=['hotel', 'arrival_date_month', 'meal', 'country'])

# Calculate correlation matrix
corr_matrix = df.corr()

# Create a heatmap
plt.figure(figsize=(10, 8))
sns.heatmap(corr_matrix, annot=True, cmap='coolwarm', fmt=".2f")
plt.title('Correlation Heatmap of Hotel Booking Dataset')
plt.show()


##### 1. Why did you pick the specific chart?

Answer Here.
for exploring relationships between numerical variables in a dataset. It helps identify which variables are closely related and can provide insights into potential patterns or dependencies in the data.








##### 2. What is/are the insight(s) found from the chart?
1 There is a positive correlation between 'lead_time' and 'stays_in_week_nights', indicating that as the lead time increases, the number of stays in week nights also tends to increase.
2 There is a positive correlation between 'adults' and 'children', suggesting that bookings with more adults tend to also have more children.
3 There is a negative correlation between 'lead_time' and 'arrival_date_year', indicating that as the lead time increases, the year of arrival tends to be earlier.


3 Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Answer here
The insights from the correlation heatmap can guide strategic decisions for hotels. For instance, observing a positive correlation between lead time and stays in week nights can help in pricing and room allocation strategies. Similarly, recognizing the positive correlation between adults and children can lead to family-friendly offerings. However, further analysis is needed to validate these insights and ensure they translate into actionable strategies for positive business impact.








#### Chart - 15 - Pair Plot

In [None]:
# Pair Plot visualization code
numerical_cols = ['lead_time', 'stays_in_weekend_nights', 'stays_in_week_nights', 'adults', 'children', 'babies']

sns.pairplot(df[numerical_cols])
plt.show()


##### 1. Why did you pick the specific chart?

Answer Here.
I chose a pair plot because it allows for the visualization of relationships between multiple numerical variables in a single plot, making it easier to identify patterns and correlations in the data.








##### 2. What is/are the insight(s) found from the chart?

Answer Here
The pair plot shows relationships between variables. For instance, longer lead times may correlate with more weekend or weeknight stays, suggesting booking behavior patterns.








## **5. Solution to Business Objective**

#### What do you suggest the client to achieve Business Objective ?
Explain Briefly.

Answer Here.
To achieve the business objective, the client can offer special deals for customers who book longer stays in advance. They can also send personalized offers to customers based on their booking habits, which can help attract more customers and increase loyalty.








# **Conclusion**

Write the conclusion here.

In conclusion, analyzing the hotel booking dataset has provided valuable insights into booking patterns and guest behavior. Understanding these patterns can help hotels optimize pricing strategies, improve customer engagement, and ultimately increase revenue. By leveraging data-driven insights, hotels can make informed decisions to enhance their business performance.








### ***Hurrah! You have successfully completed your EDA Capstone Project !!!***