Skip to content

Fraud Detection Research - Data Science Capstone Project at Penn State University, University Park Campus

License

Notifications You must be signed in to change notification settings

saplanyanki/Blacklist_Fraud_Detection_App

Repository files navigation

Credit Card Fraud Detection Web Application - XGBoosted Neural Networks

Contributors Forks Stargazers Issues MIT License


Data Science Capstone Project @ Penn State

@yankisaplan - @elifreedman
Explore the App Code »
Explore the ML Code »
View Demo - Not Available · Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Roadmap
  5. License
  6. Acknowledgments

About The Project

Project:

  • The Blacklist web application is an innovative tool that uses the power of machine learning to detect and prevent credit card fraud. Our team has developed a custom model that employs various techniques to accurately classify fraudulent activity, providing users with a high level of security and peace of mind. The Blacklist architecture consists of three main components: the model, the backend, and the frontend.
  • The model is a supervised neural network that is enhanced by an XGBoost algorithm, which provides the most relevant features for accurate predictions. The Blacklist also uses the Python Flask module to connect the pre-trained model with the backend, enabling fast and efficient prediction of credit card data.
  • The frontend of the Blacklist is designed to be user-friendly and easy to navigate, making it accessible to users of all technical levels. The seamless integration of the model, backend, and frontend ensures that users receive predictions in just a couple of seconds and have a great experience using the application.

(back to top)

Built With

This section should list any major frameworks/libraries used to bootstrap your project. Leave any add-ons/plugins for the acknowledgements section. Here are a few examples.

  • Python
  • Learn
  • Tensorflow
  • Torch
  • Flask

(back to top)

Getting Started

Clone the repository and follow Prerequisites.

Prerequisites

  • Download Python Libraries Listed Below:
    bcrypt==4.0.1
    bootstrap-py==1.0.2
    Flask==2.2.3
    Flask-Bcrypt==1.0.1
    Flask-Login==0.6.2
    Flask-SQLAlchemy==3.0.3
    Flask-WTF==1.1.1
    joblib==1.2.0
    numpy==1.22.0
    pandas==1.5.3
    SQLAlchemy==2.0.6
    tensorflow==2.11.0
    WTForms==3.0.1
    XBNet==1.4.6
    xgboost==1.7.3
    torch
    sklearn
    os
    pickle
    tqdm
    matplotlib

Installation

  1. Clone the repo
    git clone https://github.com/saplanyanki/DS340-440.git
  2. Install packages
    pip install everything in the requirements.txt

(back to top)

Usage

  • The process begins when the user navigates to our blacklist website. From there, they are able to either log into their previously created account, or to register for an account if they are a new user. When logging into a previously created account, the sign-in page will refresh if the user credentials are incorrect. If the credentials are correct, the user will be redirected to the prediction page. From the registration page, the user is prompted to fill out their name, email, and password. There are restrictions to the passwords a user may create. The password must be at least eight characters and must contain numbers and special characters for password security.Once on the prediction page, the user is able to upload a comma separated value (csv) file of their credit card statement.

  • Once uploaded, the user is then redirected to the prediction dashboard page. Finally, when on the dashboard, the user is able to see whether or not there were fraudulent transactions on their credit card statement. They are also able to view various graphs pertaining to the level of fraud found on that statement. Lastly, the user is also able to select various models to process their data.

  • The user-friendly interface of our credit card fraud detection software allows users to easily navigate through various tabs to access information about the machine learning models in use. In particular, the My Documents tab provides users with the ability to select or view various Blacklist machine learning models, which have been specifically designed to enhance the accuracy and efficiency of our fraud detection system.

  • In this tab, users can explore different models that are currently available or choose new ones to add to their repertoire. By selecting from a range of Blacklist models, users can be confident that their system is detecting and flagging any suspicious credit card activity, while minimizing the risk of false positives or negatives.

  • Moreover, the My Documents tab also enables users to keep track of the models they are currently using for credit card fraud detection. This feature provides an easy-to-read overview of the specific models that have been chosen, along with details about their performance and accuracy.

  • Overall, the My Documents tab is an essential tool for users of our credit card fraud detection software. With its intuitive interface and comprehensive range of Blacklist machine learning models, it provides users with the tools they need to stay ahead of the curve when it comes to preventing fraudulent activity.

  • In addition to being able to select and view Blacklist machine learning models, our credit card fraud detection software also allows users to keep track of the files they have uploaded for analysis. This feature is available within the My Documents tab, providing users with a comprehensive overview of their system's performance and the files that are currently being used to make predictions on fraudulent activity.

  • To further enhance their understanding of their system's performance, users of our credit card fraud detection software can access the Data Analysis tab. This tab provides users with a detailed overview of their fraud data story, enabling them to observe trends and patterns in their data and gain deeper insights into the accuracy and effectiveness of their fraud detection system.

  • Within the Data Analysis tab, the users can see the Blacklist Opinion info boxes, to see the percentages of changes of their risk contributions, consists of Category of purhcase, location effect, vendor blacklist and past vs present consisting of fraud risk change, # of purhcases changes, and amount change and lastly user fraud health info box to see likelihood of future fraud, purchase patterns, and location safety. Within the Data Analysis tab, users can explore a range of data visualization tools, such as graphs, charts, and tables, to gain a comprehensive understanding of their data. They can filter and sort their data based on various criteria, such as date, transaction type, or amount, to identify any unusual activity or patterns.

  • In addition to exploring their own data, users of our credit card fraud detection software can also gain insights by comparing their data to the general population data. This feature is available within the Data Analysis tab, where users can view how their data compares to the data used to train the machine learning models.

  • By comparing their data to the general population data, users can identify any patterns or trends that may be unique to their data, and make any necessary adjustments to their system's settings or analysis methods. They can also identify any areas where their data may be underrepresented or overrepresented, which can help them to improve the accuracy and effectiveness of their fraud detection system.

(back to top)

Project Roadmap

  • Research
  • Dashboard
  • Modeling
  • Testing

See the open issues for a full list of proposed features (and known issues).

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Acknowledgments

@misc{sarkar2021xbnet, title={XBNet : An Extremely Boosted Neural Network}, author={Tushar Sarkar}, year={2021}, eprint={2106.05239}, archivePrefix={arXiv}, primaryClass={cs.LG} }

(back to top)

About

Fraud Detection Research - Data Science Capstone Project at Penn State University, University Park Campus

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published