SentinEdge: Real-Time Fraud Detection Engine

SentinEdge is a high-throughput, stateful streaming pipeline designed to identify financial anomalies in real-time. Built for the modern data stack, it leverages the Medallion Architecture to move data from raw ingestion to audited, high-performance storage.

🏗️ Architecture

The system is composed of four distinct layers:

Ingestion (Bronze): A Python-based producer simulating live financial traffic, streaming JSON transactions into Apache Kafka (KRaft mode).
Processing (Silver): PySpark Structured Streaming serves as the brain, performing real-time ETL, schema enforcement, and anomaly detection.
Feature Store: Amazon DynamoDB (Local) acts as a low-latency state provider, tracking historical user benchmarks (rolling averages) for millisecond lookups.
Storage (Gold): Enriched transactions are persisted as Snappy-compressed Parquet files, enabling post-hoc analytical queries and MLOps evaluation.

📊 Performance Benchmarks

As of February 2026, the engine achieves the following metrics on a standard transaction stream:

Overall Accuracy: 99.21%
Precision: 94.03% (Low false-positive rate for customer experience)
Recall: 90.00% (High fraud capture rate)

📂 Project Structure

SentinEdge/
├── data_lake/               # Local Parquet storage (Silver/Gold layers)
├── images/                  # Performance report screenshots
├── docker-compose.yml       # Infrastructure (Kafka & DynamoDB)
├── processor.py             # PySpark streaming logic & DynamoDB UDFs
├── producer.py              # Synthetic transaction generator
├── init_db.py               # Feature Store schema initialization
├── test_attack.py           # Manual fraud event simulator
├── check_metrics.py         # Real-time Feature Store monitor
├── evaluator.py             # Accuracy/Precision/Recall auditor
├── run_all.sh               # Automation orchestrator
└── requirements.txt         # Project dependencies

🚀 Getting Started

Prerequisites

Docker & Docker Compose
Python 3.9+
Java 17 (for PySpark)

Installation & Execution

Clone the repository:

git clone [https://github.com/yourusername/SentinEdge.git](https://github.com/yourusername/SentinEdge.git)
cd SentinEdge

Setup Environment:

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Start Infrastructure:

docker-compose up -d

Launch the Pipeline: The orchestrator handles database initialization and launches the Processor and Producer in separate terminal windows.

chmod +x run_all.sh
./run_all.sh

🛠️ Key Technical Challenges Solved

Distributed Serialization: Resolved _thread.lock pickling errors by implementing Lazy Initialization of Boto3 resources within the Spark UDF executor context.
Cold Start Handling: Implemented "Initializing" states for new users to prevent false positives before a reliable behavioral baseline is established.
Fault Tolerance: Utilized Spark Checkpointing to ensure exactly-once processing and pipeline resilience across system restarts.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SentinEdge: Real-Time Fraud Detection Engine

🏗️ Architecture

📊 Performance Benchmarks

📂 Project Structure

🚀 Getting Started

Installation & Execution

🛠️ Key Technical Challenges Solved

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analyze_lake.py		analyze_lake.py
check_metrics.py		check_metrics.py
docker-compose.yml		docker-compose.yml
evaluator.py		evaluator.py
init_db.py		init_db.py
processor.py		processor.py
producer.py		producer.py
requirements.txt		requirements.txt
run_all.sh		run_all.sh
test_attack.py		test_attack.py

License

rukmini-17/SentinEdge

Folders and files

Latest commit

History

Repository files navigation

SentinEdge: Real-Time Fraud Detection Engine

🏗️ Architecture

📊 Performance Benchmarks

📂 Project Structure

🚀 Getting Started

Installation & Execution

🛠️ Key Technical Challenges Solved

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages