Skip to content

MohammadBafkar/polymath

Repository files navigation

Polymath

A public, open-source Claude Code marketplace of work-shaped plugins covering the full lifecycle of building software products — from idea to incident, from thinking-craft to platform engineering. Installable a-la-carte.

The marketplace

43 plugins across foundation, mind & craft, product & discovery, engineering, quality & safety, platform & operate, infra, people & content, connectors, orchestration, and authoring. 17 workflows that compose those plugins into proven SDLC scenarios — activateProject, deliberationLoop, shipFeature, reviewPR, respondToIncident, bugTriage, perfRegression, refactorWithSafety, securityFinding, bumpDependency, migrateLanguageVersion, sunsetCapability, featureFromIdea, experimentToGA, weeklyReleaseTrain, incidentRetroToActions, deprecationToRemoval.

Install only what you need. Per-plugin always-on listing cost stays under 400 tokens. Most teams install 5–10.

Quick start

# 1. Add the marketplace from this checkout (or its GitHub URL)
claude plugin marketplace add /path/to/polymath

# 2. Install a starter set
claude plugin install \
  polymath-core@polymath polymath-engineering@polymath \
  polymath-release@polymath polymath-flows@polymath

# 3. Run a workflow
claude
> /polymath-flows:run-workflow shipFeature title="Rate-limit /login" scope=small

For a new target repository, run /polymath-core:init-project or /polymath-flows:run-workflow activateProject first. It creates .polymath/project.yaml, maps known capability providers, and writes docs/polymath-onboarding.md so agents know the stack, conventions, required tools, environment variables, recommended plugins, and first steps. See docs/polymath-onboarding.md.

The full catalog is published as the marketplace's GitHub Pages site under docs/site/.

Catalog by tier

  • Foundationpolymath-core (conventions, glossary, project initialization, project-context loader, plugin-budget, SessionStart hook).
  • Mind & craftpolymath-thinking, polymath-planning, polymath-writing, polymath-decisions, polymath-learning.
  • Product & discoverypolymath-product, polymath-research, polymath-design.
  • Engineeringpolymath-engineering, polymath-frontend, polymath-backend, polymath-mobile, polymath-data, polymath-ai.
  • Quality & safetypolymath-qa, polymath-security, polymath-performance.
  • Platform & operatepolymath-platform, polymath-devops, polymath-sre, polymath-observability, polymath-incident, polymath-release.
  • Infrapolymath-infra-kubernetes, polymath-infra-postgres, polymath-infra-cloud.
  • People & contentpolymath-communication, polymath-leadership, polymath-content.
  • Connectors (MCP + hooks)polymath-connector-github, -jira, -linear, -pagerduty, -datadog, -snyk, -slack, -sentry, -statuspage, -terraform, -monitoring.
  • Orchestrationpolymath-flows.
  • Authoringpolymath-author.

For language-specific depth (.NET, Python, Laravel, …), Polymath defers to external skill catalogs such as dotnet/skills and Laravel Boost. .polymath/project.yaml declares the external sources the project recommends and polymath-engineering:verify-change adapts to the project's language at runtime.

Design principles

  • Work-shaped plugins. One role / lifecycle stage / target per plugin. Not primitive-shaped catch-alls.
  • Token budget discipline. ≤ 400 tokens always-on per plugin; the CLI's claude plugin details is the authoritative measurement, the heuristic in tools/token-budget.sh is informational.
  • Commands vs. skills. Skills bundle templates / scripts / references. Commands are thin aliases (≤ 20 lines).
  • Agents only when justified. Reserved for forked context or panel critique.
  • Workflows are YAML. Driven by a deterministic executable (polymath-flows/bin/polymath-flow) that owns validation, state, and mustPass checks. The skill drives the loop; the script owns state.
  • Deterministic enforcement. mustPass: types: fileExists, fileMatches, commandSucceeds, stepSummaryMatches, command, artifactValid, artifactSchemaStrict, diffConstraint. AI cross-checks are advisory.
  • Capability abstraction. Workflows declare what they need (issue tracker, observability, pager, vulnerability scanner) via requires.capabilities. Projects pick the provider once in .polymath/capabilities.yaml. See docs/CAPABILITIES.md.
  • Project localization. .polymath/project.yaml describes the project's stack, conventions, external skill catalogs, and per-skill overrides. The SessionStart hook loads it and exposes a resolved snapshot to every skill. See docs/PROJECT-LOCALIZATION.md.
  • Each plugin owns its templates. Artifact templates live under plugins/<plugin>/templates/. Frontmatter on canonical artifacts (PRD, ADR, Plan, RFC, Runbook, ArchitectureDoc, DACIDecision, TradeoffMatrix, Postmortem, ThreatModel, PRDescription) is gated by the matching JSON schema.
  • Connectors share a shape. .mcp.json for the upstream MCP server, userConfig for credentials (sensitive: true), hooks for event-driven reactions, a references/<service>-tools.md doc.
  • No native scheduler. Recurring work lives in external schedulers (Cloud Routines, GitHub Actions, OS cron) that write to a queue file the polymath-core SessionStart hook surfaces.
  • No telemetry. See docs/PRIVACY.md.
  • agentskills.io v1.0 compatible. Skills are portable across any harness implementing the open standard (Claude Code, Codex, Cursor, Gemini CLI, Goose, JetBrains Junie).

Repo layout

polymath/
├── .claude-plugin/marketplace.json
├── .polymath/                          # project-context + capabilities examples
├── plugins/                            # one directory per plugin
│   └── polymath-<name>/
│       ├── .claude-plugin/plugin.json
│       ├── skills/<skill>/SKILL.md
│       ├── templates/                  # plugin-owned artifact templates
│       ├── commands/<cmd>.md
│       ├── agents/<role>.md
│       ├── hooks/                      # event-driven hook scripts
│       ├── .mcp.json                   # connector plugins only
│       ├── workflows/*.yaml            # polymath-flows only
│       ├── bin/                        # bundled executables (polymath-flow, scaffolders)
│       ├── README.md, CHANGELOG.md
│       └── tests/
├── shared/schemas/                     # workflow + artifact + capability + project + conformance schemas
├── tools/                              # scaffolders, validators, conformance, catalog, bakeoff, token analyzer
├── tests/
│   ├── golden/                         # fixtures per plugin / workflow
│   ├── bakeoff/                        # per-plugin/scenario bakeoff cases
│   └── skill-triggering/               # naive-prompt triggering tests per skill
├── docs/                               # CAPABILITIES, PROJECT-LOCALIZATION, PLUGIN-AUTHORING, WORKFLOW-SCHEMA, …
└── .github/workflows/                  # validate, token-budget, lint, link-check, golden-tests, evaluation, pages, release

See docs/PLUGIN-AUTHORING.md and docs/WORKFLOW-SCHEMA.md.

Gates

Every change runs locally and in CI:

  • tools/validate-all.shclaude plugin validate --strict at marketplace root + per plugin; catches version drift between a marketplace entry and its plugin.json.
  • tools/lint-skills.sh — description ≤ 200 chars, SKILL.md ≤ 500 lines.
  • tools/token-budget.sh — per-plugin cap of 400 tokens; total target scales with plugin count.
  • tools/conformance.sh --all — structural check, including MANIFEST-3 (maturity tier in shared/polymath-catalog.json), CONNECTOR-2 (connector / infra plugins audited in docs/CONNECTOR-POLICY.md), SKILL-1, TEMPLATE-1, WORKFLOW-1, FIXTURE-1. The cross-check via tools/check-catalog.py verifies plugin sets and versions agree across marketplace.json, every plugin.json, and shared/polymath-catalog.json.
  • tools/build-catalog.py --check — verifies the GitHub Pages catalog regenerates reproducibly.
  • plugins/polymath-flows/bin/polymath-flow validate — every workflow YAML against the schema.
  • python3 -m unittest discover -s plugins/polymath-flows/tests and -s plugins/polymath-core/tests — executable unit tests.
  • tools/bakeoff.py check — baseline-vs-Polymath quality cases are parseable; optional --judge mode scores with an LLM judge.
  • tools/skill-triggering.py check — every skill-triggering test's frontmatter is valid.

The claude-cli-fixtures job runs tests/golden/run-fixtures.sh against the Claude Code CLI on every push to main. The job requires CLAUDE_CODE_OAUTH_TOKEN (preferred — Claude.ai subscription) or ANTHROPIC_API_KEY set in repo secrets, and hard-fails main-branch pushes when no auth is present. PR jobs without secrets (e.g. fork PRs) emit a warning and skip the live run.

Quality

Contributing

See docs/CONTRIBUTING.md and polymath-author — the meta-plugin that scaffolds new plugins, validates them against every gate, reviews SKILL.md quality, and measures the listing-token cost.

Use outside Claude Code

Polymath skills are written to the agentskills.io v1.0 standard. Export the 126 skills to a portable bundle and drop them into Codex CLI, Cursor, GitHub Copilot, VS Code, Gemini CLI, Goose, JetBrains Junie, and other listed clients:

python3 tools/export-agents-skills.py --clean
# → dist/agents-skills/<plugin>-<skill>/SKILL.md + manifest.json

See docs/PORTABILITY.md for the drop-location per harness and the honest list of surfaces that do not port (commands, hooks, MCP config, workflows, agents).

License

MIT.

About

No description, website, or topics provided.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors