Multi-class Classifications of Malicious URLs using Deep Learning

SCS 3546 Deep Learning

Jupyter Notebooks:

Main Experiment

Team members:

Name	Github Repo
Arjie Cristobal	https://github.com/quickheaven

Introduction

Artificial Intelligence (AI) and cybersecurity are two of the most rapidly growing sectors in the technology industry.

The global AI in cybersecurity market was valued at $19.2 billion in 2022 , and is projected to reach $154.8 billion by 2032, growing at a CAGR of 23.6% from 2023 to 2032.

The future growth of both AI and cybersecurity is promising and will be critical in the future.

Objective

This study will explore a lightweight approach to identify and classify malicious URL using deep learning via Keras.

Dataset

URL dataset (ISCX-URL2016)

University of New Brunswick
Canadian Institute for Cybersecurity

Mohammad Saiful Islam Mamun, Mohammad Ahmad Rathore, Arash Habibi Lashkari, Natalia Stakhanova and Ali A. Ghorbani, "Detecting Malicious URLs Using Lexical Analysis", Network and System Security, Springer International Publishing, P467--482, 2016.

Link: URL dataset (ISCX-URL2016)

Loading and Preparing the dataset

This study reused the UrlDatasetLoader from the Machine Learning (ML) project Detection and categorization of malicious URLs for data cleaning and preparation. It is responsible on handling Null and NaN values, feature selections and anomaly detection.

The prepared dataset is then exported to CSV files and uploaded to Deep Learning Git repository for use in training.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
datasets		datasets
images		images
.gitignore		.gitignore
LICENSE		LICENSE
Multiclass Classification of Malicious URLs using DL_v2.pptx		Multiclass Classification of Malicious URLs using DL_v2.pptx
Multiclass_Classification_of_Malicious_URL.ipynb		Multiclass_Classification_of_Malicious_URL.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-class Classifications of Malicious URLs using Deep Learning

Introduction

Objective

Dataset

URL dataset (ISCX-URL2016)

Loading and Preparing the dataset

Presentation

About

Releases

Packages

Languages

License

quickheaven/scs-3546-deep-learning

Folders and files

Latest commit

History

Repository files navigation

Multi-class Classifications of Malicious URLs using Deep Learning

Introduction

Objective

Dataset

URL dataset (ISCX-URL2016)

Loading and Preparing the dataset

Presentation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages