Skip to content
My work for the KPMG (open to public) challenge for bank customer segmentation based on its annual banking industry survey. Dimension of dataset 40,000rows x 150 columns
Jupyter Notebook
Branch: Submission-work
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Data Science Bootcamp Data_2.0.xlsx
Exploration.ipynb
README.md
Submission.pdf
Updated_Submission.pdf

README.md

Customer-Clustering-Segmentation-Challenge

My work for the KPMG challenge for bank customer segmentation based on its annual banking industry survey. Dimension of dataset 40,000rows x 150 columns

After data cleanup, I created and selected some specific features of interest. Then I ran K-means to generate 7 clusters and used principal component analysis to run some visual checks.

My methodology is explained in the "Updated_Submission.pdf" file. This file also explains the results obtained from the cluster analysis (the customer personas), and profers ways that the analysis could be improved.

The raw data set and the encoding can be found in the "Data Science Bootcamp Data_2.0.xlsx"

You can’t perform that action at this time.