Skip to content

Abhinandan-Khurana/PhishSleuth

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GIF example

-: AI-enabled Phishing link Detection and Alert System :-

Table of content

  • Introduction.
  • Insallation.
  • Directory Tree.
  • Terminology used
  • Result.
  • Conclusion.

INTRODUCTION

With the increasing number of crime the crime detection tools has also increased at an alarming rate.On the 3rd of March, the economic times of India mentioned that:- About 30 crore people are vulnerable to phishing attacks in India, of which 5 lakh potentially fall prey to scamsters.That's why we came up with the idea of PhishSleuth.It is a Phishing Links Detection and Alert system is to help individuals and organizations protect themselves from phising attacks. Phishing attacks are a common and growing threat to personal and business security, and they can result in the loss of sensitive information, financial loss, and other consequences. BY developing a solution that can detect and alert users of potential phishing attacks across vsrious digital channels, we can provide an additional layer of security and protection attacks across various digital channels, we can provide an additional layer of security and protection. Ultimately, the goal of developing a solution like this is to empower individuals and organixations to protect themselves from phishing attacks and minimize the risk of data breaches, financial loss, and other negative consequences.

Description of image 2 Description of image 3 Description of image 3
Description of image 2

Installation

  1. Download the zip file and unzip the the file in a particular folder.
  2. Go the chrome seetings manage extensions.
  3. Now turn on developer mode.
  4. Select load unpacked and import mainfiest content from the folder.

Directory tree

├── frontend
│   ├── icons
│   │   ├── 128x128.png
│   │   ├── 16x16.png
│   │   ├── 32x32.png
│   │   ├── 48x48.png
│   │   └── url_list.pdf
│   ├── js
│   │   ├── jquery.js
│   │   └── plugin_ui.js
│   ├── main.js
│   ├── manifest.json
│   ├── plugin_ui.css
│   ├── plugin_ui.html
│   ├── README.md
│   ├── style.css
│   ├── tempstorage.json
│   └── test.html
├── LICENSE
├── phish-api
│   ├── app.py
│   ├── app.yaml
│   ├── Procfile
│   ├── __pycache__
│   │   ├── app.cpython-310.pyc
│   │   └── app.cpython-38.pyc
│   ├── README.md
│   ├── requirements.txt
│   ├── SVM_Model.pkl
│   └── Web_Scrapped_websites.csv
├── README.md
└── sms-email-spam-classifier-main
  ├── app.py
  ├── model.pkl
  ├── nltk.txt
  ├── Procfile
  ├── requirements.txt
  ├── setup.sh
  ├── sms-spam-detection.ipynb
  ├── spam.csv
  └── vectorizer.pkl

Terminology Used

  • Support Vector Machine (SVM)-
    Support vector machines (SVMs) is used for theclassi¿cation of both linear and nonlinear data.
  • Decision Tree-
    A decision tree is a directed, acyclic graph with two types of nodes, namely; internal nodes that represents a test and terminal node holds a class label.
  • K-Nearest Neighbour-
    These are distance-based comparisons that intrinsically assign equal weights to each attribute.
  • Rotation Forest (RoF)-
    Rotation Forest (RoF) is an ensemble classifier in which the training data is created by randomly splitting the feature set into K subsets and Principal.
  • Component Analysis (PCA) is applied to each subset for any base classifier.
  • Random Forests (RF)-
    Random Forests can be built in tandem with random attribute selection using bagging. Random Forests follow an ensemble approach to learning, that is a divide and conquer approach for improving performance.

Result

Accuracy of various model used for Phishing detection

Algorithm Accuracy Precision Accuracy_max_ft_3000
0 KN 0.905222 1.000000 0.905222
1 NB 0.978723 1.000000 0.971954
2 ETC 0.979691 0.991453 0.979691
3 RF 0.975822 0.990826 0.975822
4 SVC 0.971954 0.974138 0.974855
5 AdaBoost 0.961315 0.954128 0.961315
6 xgb 0.968085 0.950413 0.968085
7 LR 0.967118 0.940000 0.956480
8 GBDT 0.946809 0.931373 0.959381
9 BgC 0.959381 0.861538 0.959381
10 DT 0.932302 0.838095 0.931335

Conclusion

The final take away form this project is to explore various machine learning models, perform Exploratory Data Analysis on phishing dataset and understanding their features. Gradient Boosting Classifier currectly classify URL upto 95.6% respective classes and hence reduces the chance of malicious attachments.