🛡️ CodeGuardian

Autonomous Code Security & Quality Review Agent

Submit a Pull Request — CodeGuardian scans it with 3 detection layers running in parallel, generates validated code fixes, writes missing tests, and posts a structured security report directly on your PR. Automatically. In ~40 seconds.

Built by Dhanush Kumar · Final Year B.E. (AIML) · 2026

🎯 What It Does

CodeGuardian is a production-grade multi-agent AI system that automates the entire code security review process. When a developer opens a Pull Request, CodeGuardian:

Detects vulnerabilities using 3 parallel scanning layers (static analysis + ML + LLM)
Filters false positives using an LLM that explains why each finding is real or noise
Generates code fixes with category-specific templates, validated with ast.parse()
Writes unit tests covering the fixed functions, targeting the exact vulnerabilities found
Posts a structured report directly as a GitHub PR comment with verdict, severity breakdown, and collapsible issue details
Generates a PDF report for documentation and audit trails

⚙️ How It Works

┌─────────────────────────────────────────────────────────────┐
│                    GitHub Pull Request                       │
└─────────────────────────┬───────────────────────────────────┘
                          │ webhook
                          ▼
┌─────────────────────────────────────────────────────────────┐
│                     PARSER AGENT                            │
│         Language detection · AST extraction                  │
│         Function-level code unit isolation                   │
└──────────────────┬──────────────────────────────────────────┘
                   │
        ┌──────────┴──────────┐
        │   LangGraph fan-out  │  ← runs in parallel
        ▼                     ▼
┌───────────────┐    ┌─────────────────┐
│ SECURITY      │    │ QUALITY AGENT   │
│ AGENT         │    │                 │
│               │    │ · Radon CC/MI   │
│ · Bandit      │    │ · AST checks    │
│   68 checks   │    │   - Long funcs  │
│ · Semgrep     │    │   - Deep nesting│
│   1000+ rules │    │   - Missing docs│
│ · CodeBERT    │    │ · LLM review    │
│   ML model    │    │   - Naming      │
│ · LLM enrich  │    │   - SRP         │
│   FP filter   │    │   - Duplication │
└───────┬───────┘    └────────┬────────┘
        └──────────┬──────────┘
                   │ merged results
                   ▼
┌─────────────────────────────────────────────────────────────┐
│                      FIX AGENT                              │
│   Category-specific fix templates · ast.parse() validation  │
│   SQL injection → parameterized queries                      │
│   Shell injection → list args, no shell=True                 │
│   Hardcoded secrets → os.getenv()                            │
│   Weak crypto → SHA-256 / bcrypt                             │
└─────────────────────────────────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│                   TEST WRITER AGENT                         │
│   pytest (Python) · jest (JS/TS)                            │
│   Happy path + edge cases + security regression tests        │
└─────────────────────────────────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│                    REVIEWER AGENT                           │
│   Verdict · GitHub PR comment · PDF report                  │
└─────────────────────────────────────────────────────────────┘

🔍 Detection Layers

#	Layer	Tool	What It Catches
1	Static Analysis	Bandit	68 Python checks — SQL injection (CWE-89), shell injection (CWE-78), hardcoded passwords (CWE-259), weak crypto (CWE-327), insecure deserialization (CWE-502)
2	Pattern Matching	Semgrep	1000+ community rules — Python, JavaScript, TypeScript, JSX, TSX. Custom YAML rules supported.
3	ML Classification	CodeBERT	Function-level binary classifier fine-tuned on CodeXGLUE Defect Detection (Devign dataset). Catches contextual/logic-level vulnerabilities static tools miss.
4	LLM Enrichment	LLaMA 3.3 70B	Reviews all scanner findings, dismisses false positives with explanation, adds exploitation context, surfaces issues scanners missed.

🏆 Why CodeGuardian

Feature	GitHub Copilot	SonarQube	CodeGuardian
Multi-layer detection	❌	Partial	✅ Bandit + Semgrep + CodeBERT
Fine-tuned ML classifier	❌	❌	✅ CodeBERT on CodeXGLUE
Auto-generated fixes	❌	❌	✅ Syntax validated
Auto-generated tests	❌	❌	✅ pytest + jest
LLM false positive filtering	❌	❌	✅ Explains every dismissal
Parallel agent execution	❌	❌	✅ LangGraph fan-out
Posts PR comment	✅	❌	✅ Structured markdown
PDF audit report	❌	Paid	✅ Free
Total cost	Paid	Paid	$0

📊 Real Results

Tested on data/samples/vulnerable_example.py — 9 deliberate vulnerabilities:

CODEGUARDIAN REPORT
════════════════════════════════════════════════════
Verdict  : REQUEST_CHANGES
Summary  : Found 24 security issue(s) and 18 quality issue(s)
           across 1 file(s). 0 false positive(s) dismissed.
           10 fix(es) and 0 test file(s) generated.

Severity Breakdown:
  🔴 Critical : 0
  🟠 High     : 13
  🟡 Medium   : 7
  🔵 Low      : 4

Issues Detected:
  [HIGH  ] Subprocess Popen With Shell Equals True (line 24) [bandit]
  [HIGH  ] Request With No Cert Validation        (line 68) [bandit]
  [HIGH  ] Start Process With A Shell             (line 75) [bandit]
  [HIGH  ] Subprocess Shell True                  (line 24) [semgrep]
  [HIGH  ] Shell Injection                        (line 24) [codebert]
  [HIGH  ] Insecure Deserialization               (line 40) [codebert]
  [HIGH  ] Arbitrary Code Execution               (line 52) [codebert]
  [HIGH  ] TLS Verification Disabled              (line 65) [codebert]
  [HIGH  ] Insecure API Key Storage               (line 20) [llm]

Detection Rate : 9/9 vulnerabilities found = 100%
Pipeline Time  : ~40 seconds end-to-end
Fix Validation : 9/10 fixes passed ast.parse()
════════════════════════════════════════════════════

🚀 Quick Start

Prerequisites

Python 3.10+
Node.js 18+ (frontend only)
Groq API key (free)
GitHub Token (free)

Setup

git clone https://github.com/Dhanushhuu/codeguardian
cd codeguardian

# Create virtual environment
python -m venv venv
venv\Scripts\activate          # Windows
# source venv/bin/activate     # Mac/Linux

# Install dependencies
pip install -r requirements.txt

# Configure environment
cp .env.example .env
# Edit .env — add GROQ_API_KEY and GITHUB_TOKEN

Run

# Start the API server
uvicorn api.main:app --reload --port 8000

# In a new terminal — run a review
python run_review.py

# Open Swagger UI
# http://localhost:8000/docs

Frontend

cd frontend
npm install
npm run dev
# Open http://localhost:5173

Test

# Tool-level tests (no API key needed)
python tests/test_pipeline.py --tools-only

# Full pipeline test (needs GROQ_API_KEY)
python tests/test_pipeline.py

📡 API Reference

Method	Endpoint	Description
`GET`	`/`	Health check
`POST`	`/review`	Submit code for manual review
`GET`	`/status/{id}`	Job status + live event log
`GET`	`/stream/{id}`	SSE stream of agent activity
`GET`	`/results/{id}`	Full JSON report
`GET`	`/results/{id}/pdf`	Download PDF report
`POST`	`/webhook/github`	GitHub PR webhook receiver

Example:

python run_review.py
# Submits vulnerable_example.py and polls for results
# Saves full report to review_result.json

🛠 Tech Stack — Total Cost: $0

Component	Technology	Why
Agent Framework	LangGraph	Parallel fan-out, state management
LLM	Groq — LLaMA 3.3 70B	Free tier, fastest inference (~200 tok/s)
Python Security	Bandit	68 checks, CWE + OWASP mapping
Multi-lang Security	Semgrep	1000+ maintained community rules
ML Classifier	CodeBERT (microsoft)	Function-level vulnerability detection
ML Dataset	CodeXGLUE Defect Detection	21,854 labeled C functions
Code Quality	Radon	Cyclomatic complexity + maintainability index
GitHub Integration	PyGithub + Webhooks	PR comment posting
PDF Reports	ReportLab	Audit-ready PDF generation
Backend	FastAPI + SSE	Async, streaming, auto-docs
Frontend	React + Vite	Dark dashboard, live event log

📁 Project Structure

codeguardian/
├── agents/
│   ├── parser_agent.py        # File parsing + AST function extraction
│   ├── security_agent.py      # 3-layer parallel scan + LLM enrichment
│   ├── quality_agent.py       # Radon + AST + LLM quality review
│   ├── fix_agent.py           # Template-guided fixes + ast.parse() validation
│   ├── test_writer_agent.py   # pytest/jest test generation
│   └── reviewer_agent.py      # Verdict + GitHub PR comment + PDF report
│
├── tools/
│   ├── pr_parser.py           # Language detection + AST function extraction
│   ├── bandit_scanner.py      # Bandit with CWE + OWASP mapping
│   ├── semgrep_scanner.py     # Multi-language Semgrep integration
│   ├── codebert_classifier.py # CodeBERT base + fine-tuned inference
│   └── quality_analyzer.py   # Radon CC/MI + AST quality checks
│
├── graph/
│   ├── state.py               # ReviewState TypedDict (Annotated for parallel nodes)
│   └── pipeline.py            # LangGraph graph — fan-out + fan-in
│
├── api/
│   └── main.py                # FastAPI + webhook + SSE streaming + PDF download
│
├── frontend/
│   └── src/App.jsx            # React dark dashboard with live agent log
│
├── finetuning/
│   └── train.py               # CodeBERT LoRA fine-tuning on CodeXGLUE
│
├── data/
│   └── samples/
│       └── vulnerable_example.py  # 9 deliberate vulnerabilities for testing
│
├── tests/
│   └── test_pipeline.py       # Integration + tool-level tests
│
├── run_review.py              # Submit + poll script
├── requirements.txt
├── .env.example
└── EVALUATION.md              # Real benchmark numbers after testing

🔐 Environment Variables

# Required
GROQ_API_KEY=your_groq_api_key          # console.groq.com (free)
GITHUB_TOKEN=your_github_token           # Fine-grained, repo read + PR write
GITHUB_WEBHOOK_SECRET=your_secret        # openssl rand -hex 32

# Optional
USE_FINETUNED_MODEL=false                # true after running finetuning/train.py
CODEBERT_MODEL_PATH=data/models/codebert-vulnerability-detector
REPORT_OUTPUT_DIR=data/reports
LOG_LEVEL=INFO

Built with LangGraph · Groq · Bandit · Semgrep · CodeBERT · FastAPI · React

Final Year Project — B.E. Artificial Intelligence & Machine Learning — 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ CodeGuardian

Autonomous Code Security & Quality Review Agent

📋 Table of Contents

🎯 What It Does

⚙️ How It Works

🔍 Detection Layers

🏆 Why CodeGuardian

📊 Real Results

🚀 Quick Start

Prerequisites

Setup

Run

Frontend

Test

📡 API Reference

🛠 Tech Stack — Total Cost: $0

📁 Project Structure

🔐 Environment Variables

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
agents		agents
api		api
data/samples		data/samples
finetuning		finetuning
frontend		frontend
graph		graph
tests		tests
tools		tools
.gitignore		.gitignore
EVALUATION.md		EVALUATION.md
README.md		README.md
check_result.py		check_result.py
conftest.py		conftest.py
requirements.txt		requirements.txt
run_review.py		run_review.py
send_review.py		send_review.py

Folders and files

Latest commit

History

Repository files navigation

🛡️ CodeGuardian

Autonomous Code Security & Quality Review Agent

📋 Table of Contents

🎯 What It Does

⚙️ How It Works

🔍 Detection Layers

🏆 Why CodeGuardian

📊 Real Results

🚀 Quick Start

Prerequisites

Setup

Run

Frontend

Test

📡 API Reference

🛠 Tech Stack — Total Cost: $0

📁 Project Structure

🔐 Environment Variables

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages