In this project we are going to explore some methods to deal with imbalanced data by using the Bank Marketing Data Set. These techniques include the following data sampling methods:
- Synthetic Minority Oversampling Technique (SMOTE)
- Adaptive Synthetic (ADASYN)
- Neighbourhood Cleaning Rule
- One Sided Selection
- SMOTEENN
- Smote + Tomek
In addition to that, we will see how applying cost sensitive learning can affect the performance of the classifier. To read the full article with the description of the techniques applied, it can be found here.