Promptfoo Deep Dive Documentation

This is a companion documentation repo for promptfoo — the open-source LLM evaluation and red-teaming framework.

For the source code, official docs, and latest releases, visit the original repo: github.com/promptfoo/promptfoo

Official website: promptfoo.dev

Why This Repo Exists

The official promptfoo project is a large, sophisticated codebase. This repo provides comprehensive, beginner-friendly documentation that covers the architecture, internals, and usage of promptfoo in depth — designed for developers and users who want to truly understand how everything works under the hood, not just follow quick-start recipes.

Whether you come from a C/C++/Java background and are new to full-stack web development, or you're a user who wants to go beyond the basics, these guides will take you from zero to hero.

Documentation Index

Document	Audience	Description
Architecture Guide	Developers	System design with ASCII diagrams at multiple abstraction levels, evaluation pipeline, provider/assertion/red-team systems, database schema, communication flows, key file reference
Developer Guide	New Developers	Step-by-step environment setup, tech stack explained for C/Java developers, project structure walkthrough, how to read the code, making your first change, testing, debugging, git workflow
User Quick Start	Users	11 hands-on use cases with complete configs: translation eval, model comparison, JSON validation, LLM-as-judge, RAG evaluation, moderation, multi-turn conversations, local models (Ollama), red teaming, CI/CD, custom assertions
Study Plan	Learners	11-phase zero-to-hero learning path with theory + implementation + hands-on exercises + checkpoints for each module, recommended file reading order, capstone projects
Security Guide	Security Engineers	LLM threat landscape, how promptfoo addresses each threat, red team architecture, attack plugins and strategies, grading system, promptfoo's own security model, real-world scenarios

What is Promptfoo?

Promptfoo is a CLI and library for evaluating and red-teaming LLM applications. It lets you:

Test prompts and models with automated evaluations and 70+ assertion types
Compare models side-by-side (OpenAI, Anthropic, Google, Azure, Ollama, and 50+ more)
Red team your AI with automated adversarial attacks to find security vulnerabilities
Automate checks in CI/CD to catch regressions before deployment
Scan code for LLM-related security issues in pull requests

Quick Links to the Official Project

These docs were created by studying the promptfoo codebase (v0.121.1). For the latest changes, always check the original repo.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docs		docs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Promptfoo Deep Dive Documentation

Why This Repo Exists

Recommended Video

Documentation Index

What is Promptfoo?

Quick Links to the Official Project

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Promptfoo Deep Dive Documentation

Why This Repo Exists

Recommended Video

Documentation Index

What is Promptfoo?

Quick Links to the Official Project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages