Credit-card fraud detection

I have done some exploratory data analysis and visualization on the data. Since this is a highly unbalanced dataset , I have used under-sampling to make balanced dataset for model training .I have used various algorithms and in the end I found Random Forest to be the most effective. Thus I have used it as the model for doing the final testing on the dataset. Figures after testing the Random forest model on the test set.

Accuracy - 93.93 %
Precision - 95.23%
Recall- 86.96%
F1-score - 90.90 %

In the 2nd notebook, I have used a deep learning model which is 5 layers deep. Figures after testing the DNN model on the test set.

Accuracy - 99.93 %
Precision - 84.67%
Recall- 78.91%
F1-score - 81.69 %

After that I have used an oversampling method SMOTE to solve the problem of imbalanced data-set. Figures after testing the DNN model on the test set.

Accuracy - 99.68 %
Precision - 99.76%
Recall- 99.61%
F1-score - 99.68 %

Thus a deep learning model with SMOTE gives the best performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Credit-card fraud detection

Files

README.md

Latest commit

History

README.md

File metadata and controls

Credit-card fraud detection