This repository contains a Jupyter Notebook showcasing an in-depth Exploratory Data Analysis and Fuzzy Segmentation analysis for TravelTide, a fictional company specializing in the travel industry.
This Jupyter Notebook aims to support TravelTide's new marketing initiative. The analysis focuses on customer segmentation to identify the best perks that can be offered to various customer groups. The notebook contains detailed EDA, correlation analyses, feature engineering, and fuzzy segmentation.
The raw data for this project can be accessed here.
The data used in this notebook is the result of several SQL queries that extracted information from the database, joining the four tables. These SQL queries can be found in the following GitHub Gist:
Please refer to this Gist to understand how the data was gathered and pre-processed. The final data that I have used in the notebook is the result of SQL queries, and if you want to rerun the notebook, you can download it from here.
To run this notebook, you will need:
- Python 3.x
- Jupyter Notebook
- Required Python libraries: pandas, numpy, matplotlib, seaborn, scikit-learn
- Clone this repository to your local machine.
- Navigate to the project directory.
- Install the required packages.
- Open Jupyter Notebook.
- Navigate to the notebook file and open it.
- Run the cells to see the analysis.