Network Anomaly Detection System

Overview

This project focuses on building a Network Intrusion Detection System (NIDS) using machine learning techniques, specifically k-means clustering and Random Forest classifiers. By leveraging the KDDCUP99 dataset, the system effectively detects and classifies network anomalies, including Denial of Service (DOS), R2L, U2R, and Probing attacks.

Features

Binary and Multinomial Classification: Detect normal network activity and classify specific attack types.
Feature Engineering: Attribute Ratio (AR)-based feature selection for improved performance.
Machine Learning Models: Combines unsupervised k-means clustering with supervised Random Forest classifiers.
Comprehensive Evaluation: Precision, recall, F1 score, and ROC curve for model assessment.
Scalable Deployment: Ready for real-time network monitoring.

Dataset

The project uses the KDDCUP99 dataset, a widely used benchmark dataset for network intrusion detection. It includes a variety of network traffic examples, both benign and malicious, enabling robust model training and evaluation.

Methodology

Data Preprocessing: Cleaning and preparing the dataset for analysis.
Feature Selection: Using AR-based methods to identify key features.
Clustering and Classification:
- Apply k-means to group similar data points.
- Train Random Forest classifiers on clustered data for improved accuracy.
Model Evaluation: Assess performance using detailed metrics and confusion matrices.
Deployment Plan: Strategies for integrating the system into operational networks.

Installation

Clone the repository:

git clone https://github.com/shayan02/network-anomaly-detection.git
cd network-anomaly-detection

Notebook Overview

The project is implemented in a Jupyter Notebook, consisting of 89 code cells covering the following aspects:

Data preprocessing and cleaning.
Feature engineering and selection.
Implementation of k-means clustering and Random Forest classifiers.
Detailed evaluation and visualization of results.

Sample Code Snippets

Initial Setup:

import os
from collections import defaultdict
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from google.colab import drive
import warnings

%matplotlib inline

Warnings Filter:
```
warnings.filterwarnings('ignore')
```

Usage

Modify the configuration file to specify dataset paths and parameters.
Run the scripts to preprocess data, train models, and evaluate performance.
Use the trained model to monitor network traffic for anomalies.

Results

The system demonstrates high accuracy in detecting and classifying various types of network attacks. Performance metrics and visualizations are included in the project for detailed analysis.

Future Enhancements

Incorporate deep learning models for better handling of complex patterns.
Adapt the system for real-time data streams.
Expand testing with more recent and diverse datasets.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contributing

Contributions are welcome! Please open an issue or submit a pull request to propose changes or report bugs.

Acknowledgments

Dataset: KDDCUP99 Dataset
Techniques: Inspired by advancements in machine learning for cybersecurity.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Network_Anamoly_Data_Test.txt		Network_Anamoly_Data_Test.txt
Network_Anamoly_Data_Train.txt		Network_Anamoly_Data_Train.txt
README.md		README.md
Sub_Attack_Types.txt		Sub_Attack_Types.txt
main.ipynb		main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Network Anomaly Detection System

Overview

Features

Dataset

Methodology

Installation

Notebook Overview

Sample Code Snippets

Usage

Results

Future Enhancements

License

Contributing

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

shayan-mudassar/network-anomaly-detection

Folders and files

Latest commit

History

Repository files navigation

Network Anomaly Detection System

Overview

Features

Dataset

Methodology

Installation

Notebook Overview

Sample Code Snippets

Usage

Results

Future Enhancements

License

Contributing

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages