This project about identify customer segments, I used K-means unsupervised learning algorithm and Principal Component Analysis(PCA) to dimensionality reduction
In this project we have two dataset the first one is general dataset for people of Germany and the second for customers of a mail-order sales company.
Steps of the project as follows:
- Load The data
- Preprocessing (Data Cleaning & engineering)
- Feature Transformation (feature scaling and perform the PCA)
- Interpret Principal Components
- Clustering
- Compare Customer Data to Demographics Data
Notice: the two datasets are not included in this repository. a Bertelsmann partners AZ Direct and Arvato Financial Solutions which had provided datasets .