Assignment-10---Supervised-Machine-Learning-Model

Mini Project: Supervised Machine Learning – Fraud Detection

Project Overview

This project applies supervised machine learning techniques to detect fraudulent financial transactions using the fraud.csv dataset. It involves exploring, pre-processing, modelling, and evaluating the dataset to build a system capable of identifying fraudulent behaviour with high accuracy and reliability.

🧾 Project Requirements & Description

This project follows the key requirements of the assignment brief:

Data Exploration
- Analysed structure and distribution of fields in fraud.csv
- Identified zero and missing values
- Detected the need for scaling and encoding due to imbalance and categorical data
Data Preparation
- Filled zero values with medians where appropriate
- Scaled data using RobustScaler to manage outliers
- Encoded categorical features using LabelEncoder
- Sampled data to address class imbalance for model fairness
Model Development
- Trained multiple supervised ML models:
  - Logistic Regression (regular)
  - Support Vector Machines (Linear, RBF, Sigmoid)(undersampled)
  - Random Forest (regular and undersampled)
  - Neural Network (undersampled)
Model Evaluation
- Used accuracy, precision, recall, and F1-score
- Plotted ROC curves for all models
- Analysed trade-offs between false positives and false negatives

Dataset Used

Name: fraud.csv
Description: A simulation of mobile money transactions labelled as fraud or non-fraud.
Source: Kaggle – PaySim Fraud Detection Dataset

Models & Results Summary

Model	Precision	Recall	F1-Score	Accuracy
Logistic Regression	Low	High	Low	97%
SVM (Linear/RBF)	~72%	~90%	Strong	98%
Random Forest	13%	98%	Moderate	99%
RF (Undersampled)	99%	97%	Best	100%
Neural Network	94%	88%	High	98%

✅ Best Model: Random Forest (Undersampled)
🚫 Worst Kernel: SVM Sigmoid — struggled with precision and overall accuracy
📉 Accuracy alone was misleading due to class imbalance; precision/recall were prioritized

Next Steps

Fine-tune top models using GridSearchCV
Add ‘is_off_hours’ feature (e.g., 0–6 AM) to allow model to flag time-based risk
Deploy fraud detection system as an API
Monitor performance on live or real-world data

How to Run

Clone this repository
Install required packages:
```
pip install -r requirements.txt
```

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Assignment 10.ipynb		Assignment 10.ipynb
Fraud Detection.pptx		Fraud Detection.pptx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Assignment-10---Supervised-Machine-Learning-Model

Mini Project: Supervised Machine Learning – Fraud Detection

Project Overview

🧾 Project Requirements & Description

Dataset Used

Models & Results Summary

Next Steps

How to Run

About

Uh oh!

Releases

Packages

Languages

lawabidingcoder/Assignment-10---Supervised-Machine-Learning-Model

Folders and files

Latest commit

History

Repository files navigation

Assignment-10---Supervised-Machine-Learning-Model

Mini Project: Supervised Machine Learning – Fraud Detection

Project Overview

🧾 Project Requirements & Description

Dataset Used

Models & Results Summary

Next Steps

How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages