Customer segmentation is the process of dividing customers into groups based on common characteristics so companies can market to each group effectively and appropriately. In order to demonstrate customer segmentation I will be doing "Cluster Analysis" to classify Starbucks customers into 3 distinct clusters.
Target Starbucks Customers for marketing campaign based on the offers received/viewed/completed by the customers along with the transaction details such as amount spent & amount rewarded back.
The customer transaction data is provided by Starbucks and is a part of Udacity Data Science Nanodegree project. The dataset contains transaction data for 30 days.
The dataset contains three files:
- portfolio.json: contains information about each offer & various attributes related to the offer
- profile.json: contains metadata about customers
- transcript.json: contains data related to event such as offer received/viewed/completed by a customer
Some highlights of the cleaning and transformation tasks include:
- Identifying and labeling repeated offers uniquely.
- Removing the misattribute offers Ex: offers viewed before even receiving.
- Imputing null values.
- Calculating Response Rate & Conversion Rate
Further details about about cleansing & data wrangling tasks can be found in Cleansing_and_Wrangling notebook.
The following tasks were carried out for modeling:
- Feature Scaling using Standard Scaler from Scikit-learn library.
- Dimensionality reduction using Principle Component Analysis (PCA).
- Clustering using K-Means algorithm.
- Majority of the customers in this group are Females followed by the Other Category.
- As the name of the group suggests, this group mainly receives/views/completes the BOGO offers. As a result, they have higher bogo rewards awarded to them.
- In addtion, this group also has the highest income of all three segments, as well as total amount spent by them in transactions is also the highest.
- Finally, majority of the customers from this group became members from the year 2015 to 2017.
- Majority of the customers in this group are Male and recevie highest number of informational offers.
- Altough, they have a decent view rate for both BOGO & DISCOUNT offers, the hardly ever opt in and complete the transactions using these offers.
- They also have the lowest income of all three groups, and most of them became members in the past couple of years i.e 2017 & 2018.
- Majority of the customers in this group belong to the Other Category, followed by Female customers.
- This group receives/views/completes highest number of DISCOUNT offers. As a result, they have higher discount rewards awarded to them. In addition, this group also receives highest number of informational offers.
- The transactions completed by this group is the highest as compared to rest of the groups.
- Finally, these people have been the customers of starbucks consistently since 2013.
The analysis & output can be found in Modeling notebook.