Awesome Attacks AI

A curated list of modern attacks against Artificial Intelligence

AI Attacks

Chat GPT "DAN" (and other "Jailbreaks")

Papers

Blog Post

Prompt injection explained, with video, slides, and a transcript

Tools

AI Detectors

GPTZero - An app that can quickly and efficiently detect whether an essay is ChatGPT or human written
DetectGPT - Zero-Shot Machine-Generated Text Detection using Probability Curvature
AI Content Detector - Copyleaks - Verify what content was written by a human or an AI chatbot with the AI Content Detector Chrome extension from Copyleaks.
ai-text-classifier - The AI Text Classifier (from OpenAI) is a fine-tuned GPT model that predicts how likely it is that a piece of text was generated by AI from a variety of sources, such as ChatGPT.
Hello-SimpleAI/chatgpt-detector-roberta - The base checkpoint is roberta-base. We train it with all Hello-SimpleAI/HC3 data (without held-out) for 1 epoch.

AI Anti Detectors

AI Security Community

AI Village

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Awesome Attacks AI

Contents

AI Attacks

Papers

Blog Post

Tools

AI Detectors

AI Anti Detectors

AI Security Community

About

Releases

Packages

License

101010ai/awesome-attacks-ai

Folders and files

Latest commit

History

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Awesome Attacks AI

Contents

AI Attacks

Papers

Blog Post

Tools

AI Detectors

AI Anti Detectors

AI Security Community

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages