- Filipa Alves
- Helena Oliveira
- J. Daniel Conde
Master in Data Science and Advanced Analytics (Nova IMS, Lisbon) - Autumn 2021
The objective of the project was to segment the clients of an insurance company.
-
Performed Coherence Check, spotted outliers using DBSCAN and also Manual Filtering. Missing values were dealt carefully, imputing using KNN and Logistic Regression, mainly.
-
In order to segment clients, K-Protopytes (on categorical variables) and Hierarchical Clustering uppon K-Medoids (on numeric variables) were performed. Marketing campaings were also designed accordingly to the characteristics of each cluster.