ClawSpec

Contract-first QA for OpenClaw skills and agents.

ClawSpec gives you one testing surface for four things:

Contract	File	Purpose
Structural validation	`SKILL.md` / `SOUL.md`	Enforce required sections, permissions, gates, and safety rules
Scenario contract	`tests/scenarios.yaml`	Test a single skill or agent outcome
Handoff contract	`tests/handoffs/*.yaml`	Test what one stage passes to another
Pipeline contract	`tests/pipeline.yaml`	Test multi-stage orchestration health

ClawSpec ships:

clawspec validate
clawspec run
clawspec init
clawspec coverage
clawspec baseline — performance baseline capture and regression detection
29 shipped assertion types (19 standard + 10 trace-aware)
Observability integration via Opik — one-click trace debugging from any failing assertion
a Python API and an OpenClaw skill wrapper

Install

uv tool install .
# or
uv pip install -e .

Quick start

clawspec validate examples/basic/skills/hello
clawspec init examples/basic/skills/hello --force
clawspec run examples/basic/skills/hello --scenario hello-smoke --dry-run
clawspec coverage --ledger examples/ceremonia/coverage-ledger.yaml --json

Opik observability integration

Connect ClawSpec to Opik to get trace URLs in every report and unlock 10 trace-aware assertion types.

pip install clawspec[opik]
export OPIK_API_KEY=your-key

Add to clawspec.yaml:

observability:
  backend: opik
  opik:
    project_name: my-project
    workspace: my-workspace

When observability is active, each report includes a trace_url pointing directly to the run in the Opik dashboard. See docs/observability-integration.md for full setup instructions.

Baselines and regression detection

Capture performance baselines to detect regressions in cost, latency, and token usage:

clawspec baseline capture skills/my-skill --runs 20
clawspec baseline show skills/my-skill

Add a regression: block to any scenario contract to enforce drift thresholds in CI. See docs/baselines-and-regression-detection.md.

Using with ClawScaffold

ClawScaffold is the spec-first target lifecycle manager for OpenClaw. When used together, ClawScaffold auto-generates ClawSpec test scenarios during agent/skill adoption and creation.

pip install clawscaffold[spec]   # installs both packages

When both are installed:

clawscaffold adopt auto-generates tests/scenarios.yaml with ClawSpec assertions
clawscaffold audit uses ClawSpec validation for structural checks
clawscaffold create scaffolds test contracts alongside the target spec

ClawSpec works independently — you don't need ClawScaffold to write and run test contracts.

Related Projects

ClawWrap — Spec-first outbound message routing gate (policy enforcement, audit trail)
ClawScaffold — Spec-first target lifecycle manager (interactive interviews, adoption, auditing)
OpenClaw — Local-first AI agent framework (LLM + chat channels + Markdown skills)

Config

ClawSpec looks for clawspec.yaml from the target directory upward.

gateway_base_url: "http://127.0.0.1:18789"
gateway_webhook_endpoint: "/webhook/mcp-skill-invoke"
report_dir: "reports"
gateway_log_pattern: "/tmp/openclaw/openclaw-{date}.log"
scenario_patterns:
  - "skills/**/tests/scenarios.yaml"
  - "agents/**/tests/scenarios.yaml"
ledger_path: "coverage-ledger.yaml"

Compiler-managed repos can point ClawSpec at generated QA artifacts by overriding scenario_patterns and ledger_path. For example:

scenario_patterns:
  - "compiler/generated/qa/**/scenarios.yaml"
ledger_path: "docs/testing/coverage-ledger.yaml"

Python API

from clawspec import coverage, init, run, validate

validate("examples/basic/skills/hello")
init("examples/basic/skills/hello", force=True)
run("examples/basic/skills/hello", scenario="hello-smoke", dry_run=True)
coverage("examples/ceremonia/coverage-ledger.yaml")

Examples

Docs

OpenClaw Gateway Plugin

ClawSpec ships with a gateway plugin that registers the clawspec tool directly in the OpenClaw gateway. The plugin delegates all actions to the Python CLI via child_process.

Installation

Copy the extensions/clawspec/ directory into your OpenClaw workspace:

cp -r extensions/clawspec/ ~/.openclaw/extensions/clawspec/

Register the plugin in your ~/.openclaw/openclaw.json:

{
  "extensions": {
    "clawspec": {
      "repoRoot": "/path/to/your/workspace",
      "timeoutMs": 60000
    }
  }
}

Restart the gateway:

openclaw gateway restart

Plugin Configuration

Option	Type	Default	Description
`repoRoot`	string	`$OPENCLAW_WORKSPACE` or `cwd`	Workspace root containing the clawspec package
`pythonBin`	string	`python3`	Path to Python binary
`timeoutMs`	integer	`60000`	CLI execution timeout in milliseconds

Supported Actions

validate, run, init, coverage, baseline-capture, baseline-show, baseline-reset

See the openclaw.plugin.json for the full config schema.

ClawSuite

This package is part of ClawSuite — the OpenClaw agent infrastructure toolkit.

Package	Description	Repo
ClawPipe	Config-driven pipeline orchestration	austinmao/clawpipe
ClawSpec	Contract-first testing for skills & agents	austinmao/clawspec
ClawWrap	Outbound policy & conformance engine	austinmao/clawwrap
ClawAgentSkill	Skill discovery, scanning & adoption	austinmao/clawagentskill
ClawScaffold	Agent/skill scaffold interviews	austinmao/clawscaffold
ClawInterview	Pipeline interview compilation & execution	(coming soon)

All packages include OpenClaw gateway plugins for autonomous agent access.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
clawspec		clawspec
docs		docs
examples		examples
extensions/clawspec		extensions/clawspec
mcp		mcp
skill		skill
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ClawSpec

Install

Quick start

Opik observability integration

Baselines and regression detection

Using with ClawScaffold

Related Projects

Config

Python API

Examples

Docs

OpenClaw Gateway Plugin

Installation

Plugin Configuration

Supported Actions

ClawSuite

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ClawSpec

Install

Quick start

Opik observability integration

Baselines and regression detection

Using with ClawScaffold

Related Projects

Config

Python API

Examples

Docs

OpenClaw Gateway Plugin

Installation

Plugin Configuration

Supported Actions

ClawSuite

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages