This project is all about detecting fraudulent trasaction that are done through credit cards. I have created a machine learning web app which will take some details of transaction from the user and based on that our model will predict whether the transaction is fraudulent or not.
Fraud detection is a set of activities undertaken to prevent money or property from being obtained through false pretenses. Fraud detection is applied to many industries such as banking or insurance. In banking, fraud may include forging checks or using stolen credit cards. With the increase in digitalization, there is also increase in the fraudulent activities happening in various domains, mainly in the retail domain. These are detrimental to the ecosystem of online transactions.
To recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase with the help of Machine Learning.
- Manual insection if fraud is identified
- Detection of upocoming frauds
- Gives better insight of customer base.
- Prevents customer from falling on fraud traps.
The dataset is taken from a kaggle problem statement. You can downlaod the dataset from here
- Data Exploration : Exploring dataset using pandas,numpy,matplotlib and seaborn.
- Data visualization : Ploted graphs to get insights about dependend and independed variables.
- Feature Engineering : Removed missing values and created new features as per insights.
- Feature Selection : Removed all the unwanted feature using selectKbest.
- Model Selection I : Tested all base models to check the base accuracy. Also ploted and calculate Performance Metrics to check whether a model is a good fit or not.
- Pickle File : Selected model as per best accuracy and created pickle file using pickle library.
- Webpage & deployment : Created a webform that takes all the necessary inputs from user and shows output.
- Pycharm Is Used For IDE.
- For Visualization Of The Plots Matplotlib , Seaborn Are Used.
- Heroku is Used For Model Deployment.
- Mongodb Database is used as Database.
- Front End Deployment Is Done Using HTML , CSS.
- Flask is for creating the application server and pages.
- Git Hub Is Used As A Version Control System.
- os is used for creating and deleting folders.
- csv is used for creating .csv format file.
- numpy is for arrays computations and mathematical operations
- pandas is for Manipulation and wrangling structured data
- scikit-learn is used for machine learning tool kit
- Logistic Regression is used for training the model.
- pickle is used for saving model.
Project.Demo.Video.mp4
https://fraudtrasactiondetection.herokuapp.com
- clone the project
git clone https://github.com/sandy252/FraudDetection.git
- traverse to project directory
cd FraudDetection
- Install dependencies
pip install -r requirements.txt
- Run the app.py
python app.py
For support , email @ kashyapsandeep252@gmail.com