pixie-qa

An agent skill for eval-driven development of LLM-powered applications.

What the Skill Does

The eval-driven-dev skill guides your coding agent through the full QA loop for LLM applications:

Understand the app — read the codebase, trace the data flow, learn what the app is supposed to do
Instrument it — add enable_storage() and @observe so every run is captured to a local SQLite database
Build a dataset — save representative traces as test cases with pixie dataset save
Write eval tests — generate test_*.py files with assert_dataset_pass and appropriate evaluators
Run the tests — pixie test to run all evals and report per-case scores
Investigate failures — look up the stored trace for each failure, diagnose, fix, repeat

Getting Started

1. Add the skill to your coding agent

npx openskills install yiouli/pixie-qa

The accompanying python package would be installed by the skill automatically when it's used.

2. Ask coding agent to set up evals

Open a conversation and say something like when developing a python based AI project:

"setup QA for my agent"

Your coding agent will read your code, instrument it, build a dataset from a few real runs, write and run eval-based tests, investigate failures and fix.

Python Package

The pixie-qa Python package (imported as pixie) is what Claude installs and uses inside your project. For the package API and CLI reference, see docs/package.md.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.claude/skills		.claude/skills
.github		.github
changelogs		changelogs
docs		docs
pixie		pixie
specs		specs
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pixie-qa

What the Skill Does

Getting Started

1. Add the skill to your coding agent

2. Ask coding agent to set up evals

Python Package

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pixie-qa

What the Skill Does

Getting Started

1. Add the skill to your coding agent

2. Ask coding agent to set up evals

Python Package

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages