text = "My Data Analysis Portfolio!"
print(text)
Hi all! I'm currently working on creating a porfolio to showcase my works related to BI & Data Analytics using a mixed bag of technologies such as Python, SQL, Excel and PowerBI. This project is a sample of my hotel bookings of Dacha Resort, a small resort hotel in Phuket, Thailand, that I operated between 2011 - 2013. Some personal indentifiable data have been omitted to protect privacy.
The data set contains 40060 rows of record and 32 feature sets.
Jupyter notebook file may take up to 10 sec. to load
-
Import libraries
-
Extract and Read Data
-
Quick Overview of Data
-
Data preparation
-
Descriptive Statistics
-
Exploratory Data Analysis
-
Bookings by Market Segment
Shows that 44% of reservations were done online, ~19% was through offline travel agency and tour operators and another 16% were walk-in customers -
Bookings by Distribution Channel
Regardless of online or offline, Travel Agents and Tour Operators account for more than 72% of Distribution Channel -
Change Column Name - Converting Values Of The "meal" column
-
Bookings by Meals
3 Quarters of reservations include breakfast -
Bookings by Customers type
-
Bookings by type of Guest
A bit over 4% are returning guests. -
Bookings by Deposit Types
The majority of guests made no deposits. -
Bookings by Reservation Status
Around one quarter of reservations were canceled. -
Bookings by Total Nights stayed by Guests
Most common durations are between 1 to 4 nights, either that or one whole week! -
Total Nights Stayed by Guests in Hotel
-
Visitors from Top 20 Countries
Vast majority of guests are locals (Thais) with UK and Malaysia as distant second and third largest groups -
Bookings by Lead Time
Most bookings were done within 30 days before checking-in to the hotel. -
Number of Bookings per Year
Since the start of operation in 2011, the number of bookings have doubled by next year and likely to grow even further in the following years. (data set only until August 2013) -
Number of Bookings per Month
July and August are the peak months.
-
Remove columns
-
Correlations
- Correlation Matrix
- Heat Map