Skip to content

liceyoung/Data_Mining_Project_Default_of_Credit_Cards

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Data Mining Project: Default of Credit Cards

Team of three: Alice, Yaksh, Joe
Language, Tools, Technologies, Library: Python, Jupyter Notebook, Pandas, Numpy, Matplotlib

Data Source (From Reference Paper) :

The study took payment data in October, 2005, from an important bank (a cash and credit card issuer) in Taiwan and the targets were credit card holders of the bank. Among the total 25,000 observations, 5529 observations (22.12%) are the cardholders with default payment.

Problem definition:

  • Investigate and understand customers' default payment behavior in Taiwan.
  • Aims to compare the predictive accuracy of data mining methods in determining the probability of default among these customers.

Training & Testing Dataset

  • Data was randomly divided into two groups, one for model training and the other to validate the model.
  • Total records: 30000 records
  • Testing: 9000 records
  • Training: 21000 records

Algorithms

  1. Decision Tree
  2. Support Vector Machine (SVM)
    • SVM Linear
    • SVM Radial Basis Function (RBF)
    • SVM Sigmoid
    • SVM Polynomial (degree 4)
  3. Artificial Neural Network (ANN)

Final Results

  • SVM RBF looks to be the best classifier throughout.
  • There are few points where ANN and SVM linear might slightly outperform SVM RBF.

About

Applying Data Mining Techniques to Credit Cards Dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published