GitHub - khuynh22/SecureSense-A-Data-Driven-Framework-for-Phishing-Attack-Prevention: The repo holds the

SecureSense: A Data-Driven Framework for Phishing Attack Prevention

UIC Engineering Expo 2023 Best in Show

1. Introduction:

This is my Bachelors of Science Degree Capstone Project, where I apply my the theory of my Computer Science - Machine Learning Major and Business Analytics Minors
As the rising trend of Phishing Attacks within the scope of UIC Student, I decided to build a data frames of machine learnings models with the hope to use technology to punish technology criminals!/li>
There are three ML models has been accomplished, and the website are on the process of building into production. I am planning to release the project within 2024.

2. Methodology:

The models was build based on the dataset Phishing Legitimate Full.csv from Mendeley Data, with 10,000 data points from 5000 legitimate webpages and 5000 phishing webpages with 48 websites features to analyze

The project use three main Machine Learning models to build, including:

Decision Tree Model
Logistics Regression Model
Random Forest Classification Model

The project also used other concepts including Mutual Infos, Spearman Coefficient, Gini Index, etc. in addition to the ML models.

Please encounter the project report to learn more about why and how these concept are implemented within the scope of this project.

3. Result:

After train the model and test it using the database, here are the result of the model:

Briefly saying, all three models provides a great outcomes, with the best model (in term of numbers from Random Forest). However, based on the reality of running the models, as well as the theoretical point of Random Forest Model (which basically run multiples of Decision Tree in runtime). Therefore, in further step of putting the model into production, Decision Tree Model could be considered for it efficient in runtime.

4. Further Steps:

I am working with two other members of my team in other to develop the website where the user can input the webpage URL, and we can use NLP Model and Web Scraping to transform all the necessary features into our ML Model. After that, the data point can be process through our ML Model and output the result.

Our team are at the processed of developing the wireframe using Figma, and hopefully can make this project into production within 2024.

5. Acknowledgements

This work has been conducted under the supervising of Professor Mitchell Theys and feedback from Professor Xinhua Zhang from the Department of Computer Science at UIC.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Data Preprocessing		Data Preprocessing
Decision Tree for Phishing Attack		Decision Tree for Phishing Attack
Decision_Tree_for_Phishing_Attack.ipynb		Decision_Tree_for_Phishing_Attack.ipynb
Logistic Regression.ipynb		Logistic Regression.ipynb
Phishing Attacks Poster.pdf		Phishing Attacks Poster.pdf
Phishing Detection Using Machine Learning .ipynb		Phishing Detection Using Machine Learning .ipynb
Phishing_Detection_Using_Logistic_Regression_and_Random_Forest_Classifier.ipynb		Phishing_Detection_Using_Logistic_Regression_and_Random_Forest_Classifier.ipynb
Phishing_Legitimate_full.csv		Phishing_Legitimate_full.csv
README.md		README.md
SecureSense_ A Data-Driven Framework for Phishing Attack Prevention.pdf		SecureSense_ A Data-Driven Framework for Phishing Attack Prevention.pdf
Web Scraping.ipynb		Web Scraping.ipynb
decision_tree_for_phishing_attack.py		decision_tree_for_phishing_attack.py
phishing_detection_using_logistic_regression_and_random_forest_classifier.py		phishing_detection_using_logistic_regression_and_random_forest_classifier.py

khuynh22/SecureSense-A-Data-Driven-Framework-for-Phishing-Attack-Prevention

Folders and files

Latest commit

History

Repository files navigation

SecureSense: A Data-Driven Framework for Phishing Attack Prevention

UIC Engineering Expo 2023 Best in Show

1. Introduction:

2. Methodology:

3. Result:

4. Further Steps:

5. Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Languages