Trident

One agent skill. Three adversarial minds.

"As iron sharpens iron, so one person sharpens another." — Proverbs 27:17

Why "Trident"?

A trident has three prongs — none works alone. Remove one and it's just a stick. Trident applies the same idea to AI agents: Generator, Discriminator, and Arbiter — three roles that hold each other accountable. No single agent can declare its own work "done."

The Problem

You ask your AI agent to build something. It does. You ask it to review its own work. It says "looks good." You ship it. Then production breaks.

This keeps happening because:

What you expect	What actually happens
"Handle all edge cases"	Agent handles the happy path, skips the rest
"Write production-quality code"	No input validation, no error boundaries
"Review your work"	Agent reviews its own code and finds nothing wrong
"Fix the bug"	Agent fixes one bug, introduces two more

The root cause: an AI agent reviewing its own work is like a student grading their own exam. The incentive is to pass, not to find problems.

The Fix

Let a second AI agent grade the exam. And a third one make sure neither is cheating.

Trident turns one agent's monologue into a three-way adversarial review. The Generator builds. The Discriminator attacks. The Arbiter keeps them honest. Your code ships only when all three agree it's ready.

Works with any model. Even non-tier-1 models produce senior-level output with Trident — because quality comes from the process, not just the model.

Installation

For Humans

Copy and paste this to your AI agent (Claude Code, OpenCode, Cursor, etc.):

Install Trident design review skill by following these instructions:
https://raw.githubusercontent.com/0xFiMo/trident/main/docs/installation.md

Or just run the installer yourself:

# Linux / macOS / WSL
git clone https://github.com/0xFiMo/trident.git /tmp/trident
cd /tmp/trident && ./install.sh

# Windows (PowerShell)
git clone https://github.com/0xFiMo/trident.git $env:TEMP\trident
cd $env:TEMP\trident; .\install.ps1

For AI Agents

Fetch and follow:

curl -s https://raw.githubusercontent.com/0xFiMo/trident/main/docs/installation.md

Supported Platforms

Platform	Status
Claude Code	Supported
OpenCode	Supported
Any agent with SKILL.md support	Compatible

OS	Status
macOS	Supported (`heartbeat.sh`)
Linux	Supported (`heartbeat.sh`)
Windows (WSL)	Supported (`heartbeat.sh`)
Windows (native)	Supported (`heartbeat.ps1`)

Commands

/tri — sounds like "try." Every great creation starts with a try.

Command	What It Does
`/tri new "description"`	Try a new design. `Generator` designs, `Discriminator` scores, `Arbiter` verifies. Iterates until all dimensions >= 9.
`/tri apply`	Try to build it. Three Strikes — 3 rounds of `Generator` + `Discriminator` + `Arbiter` verification.
`/tri archive`	Done trying. Archive it and extract what you learned.
`/tri status`	Check what you're trying and what's done.
`/tri models`	Configure which models power each role.
`/tri auto "description"`	Full cycle — design + implement in one go.

The lifecycle is strictly sequential. The agent cannot skip ahead:

Does It Actually Work?

Same model (Kimi K2.5). Same prompt. Same task. Only difference: one used Trident, one didn't.

Without Trident — agent delivers code with 6 hidden issues. Tests pass. Looks fine. Ships broken.

With Trident — Discriminator catches all 6 in adversarial review. Fixed before a single line ships.

Tests catch logic bugs. Trident catches everything tests can't:

	Tests alone	+ Trident
Input validation gaps	❌	✅
Encapsulation leaks	❌	✅
API design flaws	❌	✅
Concurrency semantics	❌	✅

Full benchmark data with reproduction steps →

See It In Action

Weather animation built with MiniMax M2.7 — left is the original code, right is after Trident optimization:

trident_weather_compare_small.mp4

How It Works

The Triforce — Three Roles

Role	Analogy	Memory	What It Actually Does
`Generator`	GAN Generator	Persistent	Explores codebase. Produces designs with root cause analysis, state transition tables, change surface estimates. Implements code. Self-audits against its own spec.
`Discriminator`	GAN Discriminator	Session continuity	Scores every design across all applicable dimensions. Cites specific methods, line numbers, data flow. Classifies issues as MUST FIX or NICE TO HAVE. Accumulates knowledge — never re-checks what it already verified.
`Arbiter`	Independent evaluator	None (always fresh)	Zero context, zero bias. Checks if `Generator` and `Discriminator` are colluding. Catches blind spots neither addressed. Can override READY if convergence looks artificial.

Why three, not two? A Generator + Discriminator pair converges too easily. The Discriminator gets lenient after watching the Generator improve. The Arbiter prevents this — fresh every time, no sympathy.

Scoring Dimensions

Every design is scored across 5 core dimensions. All must reach >= 9/10:

Dimension	What It Measures
Correctness	Logic correct, all scenarios, boundaries, no crash on any input
Safety	Defensive input validation, fail-safe, backward compat
Testability	Test coverage, edge cases, verification coverage
Minimality	Minimal change surface
Conventions	Matches existing codebase patterns

For tasks that produce visual output (HTML, CSS, UI, animations), two additional dimensions apply:

Dimension	What It Measures
Visual Quality	Aesthetics, animation smoothness, color coherence, responsive behavior
Creative Impact	Originality, memorability, avoids generic template output

Missing input validation? MUST FIX, never NICE TO HAVE. If any input can crash your code, Safety cannot be >= 9/10.

Design Phase (`/tri new`)

Generator and Discriminator iterate until all applicable dimensions pass. No round limit.

Implementation Phase (`/tri apply`)

Three Strikes — three rounds, then escalate to human:

Round	Roles	On `Discriminator` Pass	On `Discriminator` Fail
1	`Generator` implements + `Discriminator` reviews	`Arbiter` verifies → done or keep fixing	Round 2
2	`Generator` fixes + `Discriminator` re-reviews (same session)	`Arbiter` verifies → done or keep fixing	Round 3
3	`Generator` + `Arbiter` collaborate	Done	Human-in-the-loop escalation

Key rule: Discriminator FAIL consumes a round. Arbiter FAIL does NOT — Generator fixes and re-submits within the same round.

Round 3 exhausted? Generator, Discriminator, and Arbiter each submit their perspective. The human decides.

Archive Phase (`/tri archive`)

Done? Archive it. Trident moves the working files to .trident/archive/ and asks one question:

"This review uncovered insights about [domain]. Want me to create a skill?"

If yes — the agent distills root causes, design decisions, and pitfalls into a reusable skill. Your agent gets smarter with every review.

Model Configuration (`/tri models`)

Not all roles need the same brain. The Generator writes code — a fast, affordable model works fine. The Discriminator and Arbiter only review — a stronger model catches more issues. Use /tri models to configure each role independently:

/tri models

  Trident Model Configuration:

  | Role          | Job                             | Model                            |
  |---------------|---------------------------------|----------------------------------|
  | Generator     | designs and implements code     | minimax-coding-plan/MiniMax-M2.7 |
  | Discriminator | scores and reviews (the critic) | anthropic/claude-opus-4-6        |
  | Arbiter       | independent final check         | anthropic/claude-opus-4-6        |

This isn't just about cost — it's about blind spot diversity. If all three roles use the same model, they share the same weaknesses. A bug that one model can't see, all three will miss. Different models have different blind spots — what MiniMax overlooks, Claude might catch, and vice versa.

One Command, Full Cycle (`/tri auto`)

Don't want to run /tri new and /tri apply separately? Use /tri auto — it runs the full design-to-implementation cycle in one go, then stops for you to review before archiving.

/tri auto "fix memory leak on disconnect"

→ Design phase (Generator ↔ Discriminator ↔ Arbiter) → READY
→ Implementation phase (Three Strikes) → Done
→ Waits for you to /tri archive

Design Philosophy

Three Pillars

Pillar	Origin	What It Prevents
GAN	Generative Adversarial Networks	Complacency — `Generator` can't ship until `Discriminator` is satisfied, `Discriminator` can't rubber-stamp because `Arbiter` is watching
Three Strikes	Baseball + Chinese proverb	Infinite loops — three rounds max, then escalate to human
Triforce	Three-role balance	Collusion — `Generator` and `Discriminator` can't agree to lower standards because `Arbiter` arrives fresh with no history

Space for Memory

AI agents forget. Context windows fill up. Sessions expire. Trident solves this by trading disk space for memory — each role writes its knowledge to markdown files that persist across sessions, rounds, and even platform restarts.

These files serve three purposes at once:

Agent memory — Generator, Discriminator, and Arbiter recover context from their own files instead of relying on session state
Role isolation — each role can only read the other's file, never write to it. No contamination.
Human auditability — open any .md file to see exactly what each role thought, scored, and decided. No black box.

.trident/{task-slug}/
├── generator.md        ← Generator's memory: design + version history + feedback
├── discriminator.md    ← Discriminator's memory: verified facts, patterns, blind spots
├── tasks.md            ← Implementation checklist (created by /tri apply)
├── apply-log.md        ← Round log with dimension scores
└── .done               ← Signal file for background agent completion

File	Who Writes	Who Reads	Survives Session Loss?
`generator.md`	`Generator`	All roles	Yes — full design history
`discriminator.md`	`Discriminator`	All roles	Yes — `Discriminator` rebuilds context from this
`tasks.md`	`Generator`	`Generator`	Yes — resume from last checkbox
`apply-log.md`	`Generator`	All roles	Yes — round scores preserved
`.done`	`Discriminator` or `Arbiter`	heartbeat.sh	Transient — deleted before each invocation

Contributing

PRs welcome. Test with both Claude Code and OpenCode before submitting.

Support ⭐☕

If Trident made your agent smarter, three things that help the most:

Star this repo — it helps others discover Trident
Share it — tell your team, post it, spread the word
Follow @FiMoTW on X — updates, tips, and new features

And if you're feeling generous, buy me a coffee:

Network	Address
	`0xB95f5EC545F1650Ce43200F016A0C92A56C36513`
	`7KZ7KBVCKAgX7TCAMRQyFctaVbKtTaAPVjoEfQKqUPFt`

Star History

References

GAN: Goodfellow, I.J. et al. (2014). Generative Adversarial Networks. arXiv:1406.2661 — the adversarial tension between Generator and Discriminator that inspired Trident's core mechanic.
Three Strikes: Three-strikes law — borrowed from baseball ("three strikes and you're out"), codified in law as automatic escalation after three offenses. Rooted in the Chinese proverb 事不過三 ("things shall not exceed three"), popularized by Wu Cheng'en's Journey to the West (c. 1592) — where Sun Wukong's battles famously follow a three-attempt pattern. The proverb teaches perseverance through three tries, but also the wisdom to escalate when three attempts aren't enough. Trident applies this: three rounds of review, then the human decides.
Triforce: Triforce — three golden triangles representing Power, Wisdom, and Courage that must be in balance. In Trident: Generator (creates), Discriminator (evaluates), Arbiter (balances).
Kimi K2.5: Kimi K2.5 by Moonshot AI — the model used in our benchmark experiments.
MiniMax M2.7: MiniMax M2.7 by MiniMax — the model used in the weather animation demo.

License

MIT LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
agents		agents
benchmarks		benchmarks
commands		commands
docs		docs
scripts		scripts
skills/trident		skills/trident
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
install.ps1		install.ps1
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trident

Why "Trident"?

The Problem

The Fix

Installation

For Humans

For AI Agents

Supported Platforms

Commands

Does It Actually Work?

See It In Action

How It Works

The Triforce — Three Roles

Scoring Dimensions

Design Phase (`/tri new`)

Implementation Phase (`/tri apply`)

Archive Phase (`/tri archive`)

Model Configuration (`/tri models`)

One Command, Full Cycle (`/tri auto`)

Design Philosophy

Three Pillars

Space for Memory

Contributing

Support ⭐☕

Star History

References

License

About

Uh oh!

Releases 6

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Trident

Why "Trident"?

The Problem

The Fix

Installation

For Humans

For AI Agents

Supported Platforms

Commands

Does It Actually Work?

See It In Action

How It Works

The Triforce — Three Roles

Scoring Dimensions

Design Phase (/tri new)

Implementation Phase (/tri apply)

Archive Phase (/tri archive)

Model Configuration (/tri models)

One Command, Full Cycle (/tri auto)

Design Philosophy

Three Pillars

Space for Memory

Contributing

Support ⭐☕

Star History

References

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Design Phase (`/tri new`)

Implementation Phase (`/tri apply`)

Archive Phase (`/tri archive`)

Model Configuration (`/tri models`)

One Command, Full Cycle (`/tri auto`)

Packages