I completed this project as part of the Machine Learning class at UC Berkeley alongside my peers Chloe Nguyen and Catherine Liao. I want to express my gratitude for their valuable contributions to the project!
This project focuses on conducting a comprehensive customer personality analysis for optimizing marketing strategies. Through extensive data preprocessing and clustering techniques such as KMeans and Agglomerative Clustering, we aim to identify four distinctive customer segments to empower the company with actionable insights to tailor products and targeted campaigns, thereby maximizing customer engagement and conversion rates.
Source: https://www.kaggle.com/datasets/imakash3011/customer-personality-analysis
- Mini Batch K-Means Clustering
- Agglomerative Clustering
- DBSCAN
- GMM
All models showed similar results.
Common personas among the models:
- Customer persona 1: Lowest income families (1-2 kids) with low spending habits across all products and all purchasing methods
- Customer persona 2: Moderate income families (1-2 kids) with moderate spending habits and high deals, stores and web purchases
- Customer persona 3: High income singles/ small families (0-1 kid) with high spending habits across all product categories- especially wine sales