GitHub - emartai/playagent: Test your AI agents. Catch failures before your users do.

Test your AI agents. Catch failures before your users do.

Quick Links: Quickstart · Assertions · Failure Types · Contributing

pip install playagent[all]

from playagent import record
from playagent.adapters.openai import OpenAI

client = OpenAI()

@record
def run_agent(user_input: str):
    return client.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": user_input}],
    )

  session   sess_a1b2c3d4
  agent     run_agent
  started   2026-04-05 09:14:32
  duration  3.24s
  status    passed

──────────────────────── turn 1 ─────────────────────────
  model     gpt-4o
  latency   812ms

  ▸ user
    What's the weather in Lagos today?

  ▸ assistant
    I'll look that up for you.
    ⬡ tool call  get_weather
      location  "Lagos, NG"
      units     "celsius"

Command	What it does
`playagent trace list`	Lists recent trace sessions.
`playagent trace view <trace_id>`	Shows turn-by-turn trace details.
`playagent report`	Shows aggregate pass/fail and failure breakdowns.
`playagent report --format json`	Emits report stats as JSON for CI pipelines.
`playagent --version`	Prints installed PlayAgent version.

Why PlayAgent

You stay local-first. PlayAgent writes to SQLite on your machine; nothing is sent to a hosted dashboard by default.
You can test behavior, not only outputs. Assertions check tool-call order, parameters, and call counts directly.
If you already use LangSmith, PlayAgent is a smaller option for local SDK-level checks; if you need hosted traces, collaboration, and observability dashboards, LangSmith is the better fit.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
assets		assets
dev-docs		dev-docs
docs		docs
playagent		playagent
scripts		scripts
tests		tests
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
action.yml		action.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Why PlayAgent

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Why PlayAgent

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages