Skip to content

ashish993/GenAI-Comprehensive-Security

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GenAI Comprehensive Security Guide

A curated reference for securing generative AI systems.

OWASP Top 10 for LLMs (2025)

# Vulnerability Plain English
LLM01 Prompt Injection Attacker hijacks model via crafted input
LLM02 Insecure Output Handling Untrusted output causes XSS/SQLi
LLM03 Training Data Poisoning Malicious training data alters behavior
LLM04 Model Denial of Service Expensive inputs exhaust resources
LLM05 Supply Chain Compromised base models or plugins
LLM06 Sensitive Info Disclosure Model leaks PII or training data
LLM07 Insecure Plugin Design Over-permissioned plugins exploited
LLM08 Excessive Agency Agent acts too autonomously
LLM09 Overreliance Users trust outputs without verification
LLM10 Model Theft Model extracted via repeated queries

Defensive Tools

Tool What It Does
LLM Guard Scans prompts/responses for threats
NeMo Guardrails Defines allowed conversation flows
Guardrails AI Validates LLM output schema
Rebuff Detects prompt injection

Red Team Tools

Tool Purpose
Garak Automated LLM vulnerability scanner
PyRIT Microsoft AI red-teaming framework
ART IBM Adversarial Robustness Toolbox

Real-World Incidents

Samsung (2023): Engineers pasted proprietary code into ChatGPT. Data entered training set. Lesson: Never paste confidential data into public AI services.

Chevrolet Chatbot (2023): Customer tricked dealership bot into $1 truck price. Lesson: System constraints matter, not just prompts.

Air Canada (2024): Chatbot gave wrong refund policy. Company held legally liable. Lesson: Add human escalation for policy questions.

About

No description, website, or topics provided.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors