GitHub - Shanthi-1821/Software_Development_ML_project: Software Development : AI-ML Supervised Classifier project on Security Authentication Algorithm

Overview

This project uses a Machine Learning model to authenticate users based on typing behavior and allows only authenticated users to encrypt and send messages. The messages are securely decrypted by the receiver. Unauthorized users are blocked from accessing the encryption process.

Dataset Used

Dataset Source: Kaggle (Keystroke Dynamics)

Link: https://www.kaggle.com/datasets/carnegiecylab/keystroke-dynamics-benchmark-data-set

the libraries used

Machine Learning & Data Handling

pandas – for handling and analyzing the dataset (import pandas as pd)
numpy – for numerical operations (import numpy as np)
scikit-learn (sklearn) – for ML model and evaluation:
- train_test_split – to split data into train/test
- RandomForestClassifier – ML model
- accuracy_score – to check model accuracy
- classification_report, confusion_matrix (if used)

Security & Encryption

cryptography – for message encryption/decryption (from cryptography.fernet import Fernet)
hashlib – to generate SHA-256 hash of a message

Visualization

matplotlib.pyplot – to plot feature importance chart (import matplotlib.pyplot as plt)

How It Works

Data Preprocessing: Clean the dataset, label users as real or fake

Model Training: Split data and train ML model for authentication

User Authentication: Predict if the user is authorized

Encryption: If authenticated, encrypt message using Fernet

Decryption: Decrypt the message for authorized receivers

Hashing (optional): Ensure message integrity

ML Model Details

Algorithm: Random Forest Classifier

Input Features: Typing durations, delays

Target: 1 for authenticated user, 0 for others

Accuracy: ~100% (on test set)

Demo

Run the full notebook on Google Colab

Input a message manually or select from file

Get encrypted and decrypted results

View real/fake user authentication in action

Key Functionalities

Upload Dataset (.csv) from local machine

Preprocess Data and label a specific user (e.g., s027) as the real user

Train a Machine Learning Model (Random Forest Classifier)

Authenticate User (real or fake)

Encrypt Message using Fernet (symmetric AES encryption)

Decrypt Message on receiver side if authenticated

Dataset Format

Columns: subject, sessionIndex, rep, H.period, DD.period.t, UD.period.t, etc.

Real user label is set based on the subject column (e.g., 's027' is real, others are fake)

Future Enhancements

Real-time user typing input capture (instead of static data)

GUI-based front end or integration with a messaging platform

Add hashing or digital signature for integrity verification

Sample Outputs

User Authenticated. Proceeding to encrypt message...

Encrypted: gAAAAABi..

Decrypted: This is a confidential message.

Authentication Failed. Message encryption blocked.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
Secure_message_Authentication.ipynb		Secure_message_Authentication.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages