skill-check

Skill contract testing for the agentskills.io ecosystem.

skill-check is an open testing framework for verifying that a skill works the way it claims.

Today, the published CLI ships the offline lint pillar. The broader framework is designed around three pillars:

Pillar	Command	What it catches
Lint	`skill-check lint`	Schema violations, missing files, insecure patterns — fully offline
Scan	`skill-check scan`	Planned: LLM-powered semantic security analysis (prompt injection, tool poisoning)
Behavior	`skill-check assert`	Planned: trace-based assertions for hooks, order, and outcomes

The repo already includes the draft trace/assertion specs and example suites for the planned scan, assert, and record commands, but those commands are not implemented in 0.1.0.

Install

npm install -g @mnvsk97/skill-check
# or run without installing:
npx @mnvsk97/skill-check lint ./my-skill

The installed binary is still skill-check.

Quick start

# Lint a skill directory (SKILL.md must be present)
skill-check lint ./my-skill

# Lint with JSON output for CI
skill-check lint ./my-skill --format json

Example output

✖  ERROR  SKILL.md            `name` field is missing  (schema.missing_name)
⚠  WARN   SKILL.md            description is only 8 chars — aim for 20+  (desc.too_short)
✖  ERROR  scripts/deploy.sh   Script is not marked executable  (scripts.not_executable)
⚠  WARN   SKILL.md            Toxic-flow: skill has read + write + network tools  (security.toxic_flow)

  FAILED  2 errors, 2 warnings
  Skill root: /path/to/my-skill

Release

Publishes are handled by GitHub Actions.

Use Node 22+ locally (nvm use 22).
Bump the version in package.json using npm version patch|minor|major.
Push the commit and tag with git push origin main --follow-tags.

The publish workflow runs on v* tags, rebuilds the package, runs tests, verifies npm pack --dry-run, and then publishes to npm. The NPM_TOKEN repository secret must be configured in GitHub before the first release.

Commands

`lint [path]`

Runs static checks against a skill. No API keys required.

skill-check lint [path] [options]

Arguments:
  path          Skill directory or SKILL.md path (defaults to cwd)

Options:
  -f, --format  Output format: pretty (default) | json
  --no-security Skip security checks

Exit code: 0 = passed (errors only; warnings don't fail), 1 = one or more errors.

`scan [path]` (planned)

LLM-powered security scan. Uses Claude to detect prompt injection, tool poisoning, and split-file attacks that static patterns miss. Requires ANTHROPIC_API_KEY.

`assert <suite>` (planned)

Validates a trace file against a YAML assertion suite. Used in CI after recording a live run.

`record <suite>` (planned)

Runs a skill against a real agent and captures a normalized trace. The trace can then be replayed with assert.

Lint rules

Six rule families covering schema, description quality, file references, scripts, and security. See docs/lint-rules.md for the full reference.

Family	What it checks
`schema.*`	Frontmatter structure (name, description, allowed-tools)
`desc.*`	Description quality and body content
`files.*`	File reference existence and path safety
`scripts.*`	Executable bit, shebang, non-empty
`security.*`	Injection patterns, secrets, tool capability analysis

Assertion suite format

Assertion suites are YAML files validated against spec/assertion-schema.json.

version: "0.1"
suite: "my-skill-tests"

tests:
  - id: "happy_path"
    kind: "end_to_end"
    prompt: "Do the thing"
    steps:
      - "preflight"
      - "action"
    outcome: "pass"

See examples/cloud-deploy/suite.yaml for a full working example.

Trace format

Traces are JSON files following spec/trace-schema.json. Events are grouped into eight families:

lifecycle.* — harness events (install, sandbox, test lifecycle)
skill.* — routing events (discovered, matched, activated)
hook.* — workflow step events (started, succeeded, failed, skipped)
tool.* — tool/function call events
command.* — shell command events
file.* — filesystem events
api.* — external API call events
outcome.* — final result events

See spec/event-families.md for the full reference.

Contributing

See CONTRIBUTING.md.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
docs		docs
examples/cloud-deploy		examples/cloud-deploy
scripts		scripts
skill		skill
spec		spec
src		src
.gitignore		.gitignore
.npmignore		.npmignore
.nvmrc		.nvmrc
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

skill-check

Install

Quick start

Example output

Release

Commands

`lint [path]`

`scan [path]` (planned)

`assert <suite>` (planned)

`record <suite>` (planned)

Lint rules

Assertion suite format

Trace format

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

skill-check

Install

Quick start

Example output

Release

Commands

lint [path]

scan [path] (planned)

assert <suite> (planned)

record <suite> (planned)

Lint rules

Assertion suite format

Trace format

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`lint [path]`

`scan [path]` (planned)

`assert <suite>` (planned)

`record <suite>` (planned)

Packages