Skip to content

Schema Reference

mbtiongson1 edited this page May 18, 2026 · 5 revisions

Schema Reference

Authoritative field definitions for every JSON and Markdown schema in the repository. Canonical schema files live in `registry/schema/`.


`registry/nodes/` — Modular Skill Definitions (SOURCE OF TRUTH)

The canonical source of truth for all skills. Each skill is stored as an individual JSON file within domain-specific subdirectories (e.g., `registry/nodes/basic/`).

Skill Node Structure

```jsonc { "id": "web-scrape", // kebab-case, unique "name": "Web Scrape", // Title Case display name "type": "extra", // "basic" | "extra" | "ultimate" | "unique" "level": "3★", // "0★" through "6★" "rarity": "common", // "common" | "uncommon" | "rare" | "epic" | "legendary" "description": "...", // one or two sentences, agent-agnostic "prerequisites": ["web-search", "parse-html", "extract-entities"], "derivatives": ["knowledge-harvest"], "conditions": "", // optional prerequisite conditions "demerits": [ // optional; demotes effective rank by 1★ per entry "niche-integration" // "niche-integration" | "experimental-feature" | "heavyweight-dependency" ], "evidence": [ { "class": "B", // "A" | "B" | "C" "source": "https://...", "evaluator": "github-username", "date": "2026-05-11", "notes": "..." } ], "knownAgents": [], // GitHub usernames of agents with this skill "status": "provisional", // "provisional" | "validated" | "disputed" | "deprecated" "createdAt": "2026-04-26", "updatedAt": "2026-05-11", "version": "0.1.0" } ```

Type rules:

  • `basic` skills must have empty `prerequisites`.
  • `extra` skills must have ≥ 2 `prerequisites`.
  • `unique` skills must have empty `prerequisites`, be at `level` `4★` or above, and be graph-isolated (0 derivatives referenced by other nodes).
  • `ultimate` skills must have ≥ 3 `prerequisites` and require 3 Class A/B evidence sources.

`registry/gaia.json` — Canonical Skill Graph (GENERATED)

This file is now an auto-generated artifact used for backward compatibility with the MCP server and external agents. Do not edit this file directly; changes will be overwritten by `scripts/assemble_gaia.py`.

Top-Level

Field Type Description
`version` string Semantic version of the graph
`generatedAt` string ISO timestamp of the last assembly
`skills` Skill[] Consolidated array of all skill nodes from `registry/nodes/`
`edges` Edge[] Generated prerequisite edges (derived from `prerequisites` in each node)

`registry/named/{contributor}/{skill}.md` — Named Skill

YAML frontmatter defines the named skill; the Markdown body is human-readable documentation.

```yaml

id: karpathy/autoresearch # contributor/skill-name name: AutoResearch # display name contributor: karpathy # GitHub username origin: true # true = first in this bucket genericSkillRef: autonomous-research-agent # must exist in registry/nodes/ status: awakened # "awakened" | "named" level: 4★ # 2★–6★ (never 0★ or 1★) description: > Autonomous research agent that iteratively searches, reads, and synthesizes academic papers into structured summaries. links: github: https://github.com/karpathy/autoresearch docs: https://... createdAt: 2026-04-29 updatedAt: 2026-05-11

Overview

... ```

Validation rules (`registry/schema/namedSkill.schema.json`):

  • `id` must match pattern `^[a-z0-9][a-z0-9_-]/[a-z0-9][a-z0-9_-]$`.
  • `genericSkillRef` must resolve to a valid skill node ID.
  • At most one `origin: true` per `genericSkillRef` bucket.
  • `level` must be `2★` or above.
  • `status: awakened` for contributor submissions; only reviewers set `status: named`.

`registry/named-skills.json` — Named Skill Index (Generated)

Do not edit — regenerated by `scripts/generateNamedIndex.py`.


`registry-for-review/skill-batches/{id}.json` — Skill Batch

Written by `gaia push`. One batch per invocation.


`.gaia/config.json` — Project Config


`skill-trees/{username}.json` — User Skill Tree


Edge Schema (Legacy/Generated)

Defined in `registry/schema/combination.schema.json`. These are no longer source-of-truth but are generated for the consolidated graph.


Schema Metadata

`registry/schema/meta.json` is the single source of truth for nomenclature, theming, and validation rules.

Clone this wiki locally