ToolMark 🔨

ESLint + Jest + npm publish — for AI Agent Tools.

Build, test, scan, and ship tools across OpenClaw/ClawHub, Claude Code, Cursor, and Windsurf — from a single CLI.

Why ToolMark?

13,000+ tools are published on ClawHub. 13% contain critical security flaws (Snyk ToxicTools Report, Feb 2026). Tools break silently on platforms other than the one they were tested on. There is no pytest for agent tools — until now.

toolmark init my-tool --template github-api
toolmark test          # LLM-as-judge evaluation
toolmark scan          # prompt injection, dynamic fetch, credential leaks
toolmark compat        # check all 4 platforms at once
toolmark publish       # sign with Ed25519, push to ClawHub + Claude Code

Install

pip install toolmark

Requires Python 3.12+.

Quick Start

# 1. Scaffold
toolmark init my-github-tool --template github-api

# 2. Edit tool.md and tests/
cd my-github-tool

# 3. Test
ANTHROPIC_API_KEY=sk-ant-... toolmark test

# 4. Scan
toolmark scan

# 5. Check platform compatibility
toolmark compat

# 6. Publish
toolmark publish --platforms clawhub,claude-code

Commands

Command	What it does
`toolmark init`	Scaffold a new tool from a template
`toolmark test`	LLM-as-judge evaluation against YAML test cases
`toolmark scan`	Security scanner (prompt injection, dynamic fetch, creds)
`toolmark compat`	Cross-platform compatibility check (4 platforms)
`toolmark bench`	Benchmark latency, tokens, compute quality score (0–100)
`toolmark publish`	Sign with Ed25519, publish to configured registries

Templates

toolmark init my-tool --template github-api      # GitHub REST API wrapper
toolmark init my-tool --template file-ops         # Local filesystem tool
toolmark init my-tool --template mcp-integration  # Wraps an MCP server tool
toolmark init my-tool --template web-search       # Search API tool
toolmark init my-tool --template loom-query       # Loom knowledge graph tool
toolmark init my-tool --template blank            # Minimal scaffold

Test Cases (YAML)

# tests/test_search.yaml
- id: search_open_prs
  input: "find my open pull requests"
  expect_invoked: true
  expect_tool: search_pull_requests
  expect_params:
    state: open
    assignee: "@me"
  tolerance: fuzzy     # strict | fuzzy | invoked
  tags: [smoke]

Run: toolmark test --tags smoke

Security

toolmark catches:

SF001 — Dynamic fetch (curl | bash, eval(fetch(...)))
SF002 — Hardcoded credentials (API keys, passwords)
SF003 — Prompt injection phrases in tool descriptions
SF004 — Undeclared network endpoints
SNYK-* — 138 rules via Snyk agent-scan (if installed)

Provenance Signing

Every published tool is signed with Ed25519:

toolmark keygen              # creates ~/.toolmark/signing.key
toolmark publish --sign      # signs + publishes
toolmark verify my-tool     # verify any published tool

GitHub Actions

Every toolmark init project includes a ready-to-use workflow:

# .github/workflows/toolmark.yml — already in your project
- toolmark compat    # platform check
- toolmark scan      # security gate
- toolmark test      # LLM evaluation (needs ANTHROPIC_API_KEY secret)

Quality Leaderboard

See how your tool ranks: toolmark.dev/leaderboard

Quality Score = test pass rate (50%) + security score (30%) + compat score (20%).

Roadmap

Contributing

See CONTRIBUTING.md. We always have good first issues.

License

MIT — see LICENSE.

Built by @ddevilz as part of the Loom AI tooling ecosystem.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github		.github
templates		templates
tests		tests
toolmark		toolmark
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
BADGES.md		BADGES.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ToolMark 🔨

Why ToolMark?

Install

Quick Start

Commands

Templates

Test Cases (YAML)

Security

Provenance Signing

GitHub Actions

Quality Leaderboard

Roadmap

Contributing

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ToolMark 🔨

Why ToolMark?

Install

Quick Start

Commands

Templates

Test Cases (YAML)

Security

Provenance Signing

GitHub Actions

Quality Leaderboard

Roadmap

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages