yagent

「AIに bash 渡しとけば賢くやるでしょ」をやめた yagent は Bash tool を置かず task catalog/file permission/MCP wrappingでコンテキスト問題対策/subagent orchestration を最初から設計に入れた、LLMを信用しない 前提の coding agent。でも task や agent 追加は簡単! https://github.com/yappo/yagent/

yagent のユニークなポイント

yagent は、「LLM に bash を渡しておけば勝手に賢くやる」という発想ではなく、「LLM は便利だが、そのまま広い権限を与えるには信用しきれない」という前提から設計した coding agent です。そのため、あえて汎用の Bash tool は持たせていません。自由な shell 実行は強力ですが、モデルの誤解や暴走がそのまま危険操作につながりやすく、実運用では不安要素にもなります。yagent はそこを便利さで押し切らず、最初から危険な操作を簡単には通さない構造を選んでいます。

ただし、bash を消すだけでは不便になります。そこで yagent では、必要な作業を tasks.toml の task catalog として追加できるようにし、ビルド、テスト、検証、補助コマンド、MCP server の bind などを、安全性を意識した独自ツール経由で扱えるようにしています。つまり「何でも shell でやらせる」のではなく、「このリポジトリでは何を許可するか」を宣言的に定義して実行する形です。安全側に倒しつつ、運用上必要な task は素直に足せるので、窮屈な固定製品ではなく、現場仕様に育てやすい構成になっています。

ファイルアクセスも同じ思想です。広く自由に読める・書ける方が短期的には便利ですが、実際にはそれがそのまま不安につながります。yagent ではアクセス可能なパスを明示的に絞り込みつつ、作業の流れを止めすぎないよう permission UI を通じて問題ない範囲だけ段階的に緩められます。厳しく閉じるだけでもなく、最初から全部開けるだけでもなく、現場で扱いやすい粒度で制御できることを重視しています。

MCP Server についても、そのまま無造作に露出させるのではなく、task catalog と bind フローの中で独自にラップしています。これは安全性のためだけではありません。MCP は便利な一方で、server や tool の情報を常時抱え込むとコンテキストを食いやすく、必要のない情報まで LLM に渡し続ける構造になりがちです。yagent では必要なときにだけ task_bind で server を bind し、必要な tool だけを公開できるので、コンテキスト消費を抑えながら拡張性も確保しやすくなっています。

さらに yagent は、単一 agent に全部背負わせるのではなく、subagent / multiagent 前提で設計されています。manager planner researcher coder tester reviewer のような役割分担をベースに、intake -> plan -> execute -> verify -> recover -> finalize の harness で仕事を分け、必要なら ephemeral agent もその場で生成できます。最近の runtime では会話 summary ではなく typed artifact / reusable observation / workspace snapshot を主記憶に寄せているので、長めの run でも次の agent に必要な事実だけを scoped packet として渡しやすくなっています。

要するに yagent は、単なる「TUI から LLM を叩くツール」でも、単なる「自律実行が派手な agent」でもありません。Bash を安易に渡さず、task catalog・file permission・MCP wrapping・subagent orchestration を最初から設計に入れたうえで、それでも task や agent は簡単に拡張できるようにした、“LLM を信用しない” 前提の実運用志向 coding agent です。

特徴

yagent --config <file> でそのまま TUI 起動
yagent exec --prompt ... で単発実行
yagent exec --resume latest --profile strong のような resume / routing 指定
yagent benchmark --feature-profile legacy --feature-profile current --prompt ... で feature flag 差分を比較
Orchestrator-first のサブエージェント実行
planner-driven execution plan で必要な agent だけを選択
intake -> plan -> execute -> verify -> recover -> finalize の run harness
adaptive packet compaction と state-first workspace memory
role / phase ベースの model routing
lazy capability discovery (list_capabilities / enable_capability)
built-in agent catalog と user-defined agent DSL
built-in agent に加えて ephemeral agent の即席生成
Bubble Tea / Bubbles ベースの TUI
Run Graph / Plan / Verification / Memory の切替 panel
OpenAI 互換の chat/completions API に対応
Agent Status viewport による実行状況モニター
tool call card / tool log card による実行中ツール表示
ツールレジストリ経由で fs search git task patch を実行
Claude Code 風の permission UI
--log <path> による JSON Lines イベントログ
execution.max_parallel_agents による並列実行制御 (min = 1)
task_list / task_run / task_bind による安全寄りの Task Catalog
tasks.toml の [[mcpservers]] と遅延 bind による MCP 連携
internal/ ベースで責務分離した構成

コマンド

# TUI を起動
yagent

# イベントログをファイルに保存
yagent --log yagent.log

# 設定ファイル付きで TUI を起動
yagent --config ~/.yagent.toml

# 明示的に TUI を起動
yagent tui --config ~/.yagent.toml

# 単発実行
yagent exec --config ~/.yagent.toml --prompt "こんにちは"

# 単発実行 + イベントログ
yagent exec --log yagent.log --prompt "こんにちは"

# 前回 run を復元して続ける
yagent exec --resume latest --prompt "続きの修正をして"

# routing profile を指定
yagent exec --profile strong --prompt "慎重にレビューして"

# feature flag の比較
yagent benchmark --feature-profile legacy --feature-profile current --prompt "この repo の改善点を提案して"

設定ファイル

[server]
default = "lmstudio"

[[server.servers]]
name = "lmstudio"
url = "http://127.0.0.1:1234"
token = ""
model = "gpt-5"
timeout = "20m"

[file]
allow_paths = ["/Users/you/Projects"]

[execution]
max_parallel_agents = 2
max_handoff_depth = 2
default_timeout = "120s"

[features]
phase_harness = true
adaptive_compaction = true
role_routing = true
repo_memory = true

[routing.profiles.fast]
server = "lmstudio"
model = "gpt-5-mini"

[routing.profiles.strong]
server = "lmstudio"
model = "gpt-5"

[harness]
max_verification_attempts = 2
force_planner = true

[context]
max_recent_messages = 12
max_artifacts = 8
max_relevant_files = 16
compact_after_turns = 12
compact_after_tool_calls = 12
compact_after_est_tokens = 12000
compact_after_verify_cycles = 2

[memory]
enabled = true
state_dir = ".yagent/state"
max_runs = 20
max_facts = 50

[benchmark]
default_runs = 2

[agent_catalog]
paths = ["~/.config/yagent/agents"]

[agents.coder]
model = "gpt-5"
routing_profile = "strong"

[agents.reviewer]
instruction = "Review for regressions first."

設定項目

server.default: 使用するサーバー名
server.servers: 接続先一覧
server.servers[].model: そのサーバーを使うときの既定 model。CLI の --model や agent override がなければこれが使われます
server.servers[].timeout: LLM client の HTTP timeout。未設定時は 20m
file.allow_paths: ツールからアクセス可能なパス一覧
execution.max_parallel_agents: 並列実行する subagent 数の上限。1 で逐次実行
execution.max_handoff_depth: handoff の最大深度
execution.default_timeout: agent ごとの LLM 呼び出し timeout
features.phase_harness: phased multi-agent harness を使うか
features.adaptive_compaction: packet を軽くする compaction を使うか
features.role_routing: role / phase ベースの router を使うか
features.repo_memory: workspace memory / observation cache の読込と更新を使うか
routing.profiles.*: role / phase ごとの model routing 先
harness.max_verification_attempts: verify -> recover の最大試行回数
harness.force_planner: 実装前に planner を強制するか
context.*: recent messages / artifacts / compaction 閾値
memory.*: ./.yagent/state/ 配下の session state / workspace memory / observation cache 設定
benchmark.default_runs: yagent benchmark の既定試行回数
agent_catalog.paths: user-defined agent DSL のファイルまたはディレクトリ一覧
agents.<id>.instruction: built-in agent の instruction 上書き
agents.<id>.model: built-in agent のモデル上書き
agents.<id>.routing_profile: built-in agent の routing profile 上書き
agents.<id>.allowed_tools: built-in agent の tool allowlist 上書き
agents.<id>.disabled: built-in agent の無効化

model の優先順は次です。

agents.<id>.model
リクエスト時の --model
server.servers[].model

起動時のカレントディレクトリは自動で許可パスに追加されます。

memory.state_dir 配下の主なレイアウトは次です。

sessions/: 各 session の state
workspace/facts.json: stable workspace facts / known failures / reusable observations の要約
workspace/snapshot.json: 観測 freshness 判定に使う workspace snapshot
artifacts/: typed artifact
observations/: reusable observation record
executions/: tool execution record
mutations/: mutation record
latest_session: /resume が参照する最新 session id

TUI Panels

右側 pane は Run Graph / Plan / Verification / Memory を切り替えられます。/graph /plan /verification /memory で直接開けて、Ctrl+←/→ でも順送りできます。/resume を使うと latest_session を読み込み、Plan / Verification / Memory panel もその内容で更新されます。/memory panel には stable facts / known failures / reusable observations / recent artifacts を表示します。

Benchmark

yagent benchmark は feature flag の組み合わせを比較するための簡易コマンドです。既定では legacy と current を比較します。

legacy: phase harness / compaction / role routing / repo memory をすべて無効化した single-manager baseline
current: features.* に設定された現在の構成
no-harness, no-routing, no-memory, no-compaction: 個別 flag を切って比較する補助 profile

出力は各 profile ごとの成功数、平均時間、平均イベント数、verification failure 数、各 run の phase / attempt / tool call 数をまとめます。

Agent DSL

yagent には最初からいくつかの built-in agent が入っていますが、リポジトリ固有の役割は TOML で追加できます。
この TOML を README では Agent DSL と呼んでいます。

planner-driven execution plan では、この Agent DSL で定義した metadata も agent inventory に入ります。task_kinds capabilities preferred_phases scope_hints を明示しておくと、planner が built-in agent と同じ基準で user-defined agent を選びやすくなります。

Agent DSL を使うと、たとえば次のような専用 agent を追加できます。

ドキュメント更新専用 agent
テスト観点の洗い出し専用 agent
特定ディレクトリだけを読む調査 agent

Agent DSL のファイルは agent_catalog.paths に指定したファイルまたはディレクトリから読み込みます。
ディレクトリを指定した場合は、その中の .toml ファイルをすべて読み込みます。

まず built-in agent の基本セットがあり、その上に user-defined agent を追加するイメージです。
agents.<id> は built-in agent の上書き設定で、Agent DSL は新しい agent を追加するための仕組みです。

標準では次の built-in agent を同梱しています。

manager: ユーザー窓口と全体の取り回しを担当
planner: タスク分解を担当
researcher: 関連ファイルの探索と要点抽出を担当
coder: 実装ターンを担当
tester: 検証を担当
reviewer: レビューを担当

built-in agent には、tool discovery workflow に関する共通 instruction も入っています。
特に次の前提を明示しています。

visible な mcp__* が無くても、ただちに「MCP は使えない」とは限らない
MCP が必要そうなら、まず task_list で kind = "mcp_server" の entry を確認する
relevant な server が未 bind なら task_bind(task_id=...) を呼ぶ
bind 成功後は返却された tool_names をそのまま使う
write-capable agent では fs_write / patch_apply を直接呼び、承認は会話ではなく approval dialog で行う
write tool 実行時の承認は会話ではなく approval dialog で行われる
現在の agent が read-only で write が必要なときは、write-capable agent に delegate / handoff する

user-defined agent は 1 ファイルにつき 1 agent を定義します。最小例は次です。

id = "docs-writer"
name = "Docs Writer"
description = "README や設計メモの更新を担当"
instruction = "Write concise docs with concrete examples."
mode = "handoff"
allowed_tools = ["fs_read", "fs_write", "patch_apply"]
read_only = false
max_turns = 4
timeout = "30s"
tags = ["docs"]
task_kinds = ["docs", "mutate"]
capabilities = ["documentation"]
preferred_phases = ["execute", "recover"]
scope_hints = ["README", "design docs"]
verification_required = true
verification_max_attempts = 2

主な項目の意味:

id: agent の識別子。tool / handoff からこの ID で参照されます
name: UI に表示する名前。省略時は id
description: その agent が何を担当するかの短い説明
instruction: その agent に常に与える追加指示
mode: agent の使われ方。tool は補助役、handoff は実装の主担当、manager は窓口用です
allowed_tools: その agent が使ってよい tool 一覧
read_only: 読み取り専用 agent として扱いたいときに指定
read_only = true の agent は write tool を新たに得るわけではありません。ファイル変更が必要なときは write-capable agent へ委譲する前提です
model: その agent だけ別 model を使いたいときに指定
max_turns: その agent の最大継続ターン数
timeout: その agent の LLM 呼び出し timeout
routing_profile: その agent をどの routing profile で実行するか
tags: 人間向けの整理用ラベル
task_kinds: その agent が向いている要求種別。question research docs review test mutate を指定
capabilities: planner に見せる能力ラベル
preferred_phases: その agent が入りやすい phase。plan execute verify recover finalize
scope_hints: planner に伝える担当範囲のヒント
verification_required / verification_max_attempts: その agent を primary にしたときの verify policy

agents.<id> では built-in agent の instruction / model / allowed tools / disabled を上書きできます。
built-in agent の基本セットは最初から使えますが、追加の DSL で repo 専用 agent を拡張する前提です。

Task Catalog (`tasks.toml`)

task_list / task_run / task_bind で見える task は、自由なシェルコマンドではなく、事前登録した command task と MCP server task を扱う仕組みです。
その登録ファイルが tasks.toml です。

置き場所は次の 2 つです。

リポジトリごとの設定: .yagent/tasks.toml
ユーザー共通の設定: ~/.config/yagent/tasks.toml

これに加えて、go.mod や package.json などを見て自動で task テンプレートも追加されます。
読み込み順は「自動検出テンプレート → ユーザー設定 → リポジトリ設定」で、同じ id があれば後から読んだ定義が上書きします。

最小例:

[[tasks]]
id = "go:test"
description = "Go のテストを実行"
command = "go"
args = ["test", "./..."]
risk = "medium"
allow_network = false
timeout = 300

主な項目の意味:

id: task の識別子。task_run ではこの ID を指定します
description: 人間や agent が task_list で読む説明
command: 実行するコマンド本体
args: コマンド引数の配列
cwd: 実行ディレクトリ。省略時は現在のリポジトリ root
risk: low / medium / high。未指定時は medium
allow_network: ネットワーク利用を伴う task かどうか
timeout: タイムアウト秒数

実用例:

[[tasks]]
id = "go:test"
description = "Go の全テストを実行"
command = "go"
args = ["test", "./..."]
risk = "medium"
allow_network = false
timeout = 300

[[tasks]]
id = "go:build"
description = "Go のビルドを実行"
command = "go"
args = ["build", "./..."]
risk = "medium"
allow_network = false
timeout = 300

[[tasks]]
id = "frontend:test"
description = "フロントエンドの単体テストを実行"
command = "npm"
args = ["test"]
cwd = "web"
risk = "medium"
allow_network = false
timeout = 600

cwd を相対パスで書いた場合は、現在の作業リポジトリを基準に解決されます。
まずは description を人間が見て分かる文にしておくと、task_list の一覧がかなり使いやすくなります。

MCP Server Catalog (`[[mcpservers]]`)

同じ tasks.toml 内に、MCP server を [[mcpservers]] として登録できます。
MCP server は task_run ではなく task_bind で起動・bind します。bind 後にだけ、その server の tool が mcp__... 形式で LLM に公開されます。

[[mcpservers]]
id = "docs"
description = "Docs search MCP"
transport = "stdio"
command = "npx"
args = ["-y", "@example/docs-mcp"]
tool_prefix = "docs"
parallel_safe = false
include_tools = ["search_docs"]

主な項目の意味:

id: MCP server の識別子。task_bind ではこの ID を指定します
description: 人間や agent が task_list で読む説明
transport: 現在は stdio のみ対応
command: MCP server を起動するコマンド本体
args: コマンド引数の配列
cwd: 実行ディレクトリ。省略時は現在のリポジトリ root
env: 追加環境変数
risk: low / medium / high
allow_network: ネットワーク利用を伴う server かどうか
timeout: タイムアウト秒数
tool_prefix: bind 後の tool 名プレフィックス。未指定時は id
parallel_safe: true のときだけ orchestrator が並列実行候補として扱います
include_tools / exclude_tools: 公開する MCP tool のフィルタ

典型的な流れ:

task_list で kind = "mcp_server" の entry を確認
task_bind で対象 server を bind
bind 後に追加された mcp__... tool を agent が利用

補足:

task_list の mcp_server entry には bind_required, usage_hint, bind_hint, exposed_tool_prefix, exposed_tools などの補助情報が含まれます
task_bind の返却には tool_names に加えて server_tool_names, next_action_hint, exposed_tool_prefix が含まれます
tool_names は bind 後にそのまま呼べる qualified MCP tool 名です
server_tool_names は MCP server 側の元の tool 名です
task_bind と bind 後の mcp__* tool は既定で visible です。追加の capability 有効化を挟まずに task_list -> task_bind -> mcp__* の流れを取れます

実行ログ

--log <path> を指定すると、実行イベントを JSON Lines 形式で出力します。

execution_event.agent_started
execution_event.llm_called
execution_event.tool_called
execution_event.tool_failed
execution_event.agent_failed
execution_event.agent_completed
execution_event.agent_continued
permission.requested
permission.resolved

tool_failed と agent_failed には失敗理由も記録されるので、subagent の失敗調査に使えます。
権限確認や継続確認が複数同時に発生した場合でも、TUI 側で順番に処理できます。

TUI 操作

Enter: 送信
Ctrl+J: 改行
PgUp / PgDn: ログスクロール
Alt+↑ / Alt+↓: 1 行ずつログスクロール
/ 入力中: 候補コマンドを表示
@ 入力中: pwd 基準の相対パス候補を表示
Tab: 候補コマンドまたはファイルパスを補完
/help: ヘルプ表示
/plan: 直近 run の plan を表示
/artifacts: 直近 run の artifacts を表示
/memory: workspace memory を表示
/resume: 最新 run を会話に復元
/approvals: approval の状態を表示
/clear: 会話ログをクリア
/exit: 終了

Permission UI

ファイル操作や task 実行などで許可が必要なときは、下部に permission card を表示します。

←/→ または Tab: 選択移動
Enter: 確定
Esc: 拒否

非ファイル系の選択肢:

今回だけ許可
同じ操作を以後許可
拒否

ファイル系の選択肢:

今回だけ許可
同じ操作を以後許可
ファイルパターン指定で以後許可
拒否

同じ操作を以後許可 は、同じツール・同じ action・同じ scope・同じ risk に対して再利用されます。
ファイルパターン指定で以後許可 は、指定した glob パターンに一致する同種のファイル系リクエストを、そのセッション中は自動で許可します。
パターンは *.go のような basename glob と internal/* のような path glob を使えます。
サブエージェント導入後も permission の判定ルール自体は変わりません。
permission card には、どの agent が要求したかも表示されます。

Tool Call UI

LLM が tool を呼び出すと、TUI 下部に tool call card を表示します。

実行しようとしている tool 名
対象 path や task id などの target
limit_bytes, recursive, overwrite などの主なオプション
実行状態 (requesting, completed, failed)

tool の出力は Tool Logs card に蓄積されます。

Agent Status

TUI では会話ログとは別に Agent Status viewport を表示し、サブエージェントの状態を監視できます。

実行中 / 完了 / 失敗した agent 数
親子関係を含む agent ツリー
handoff / delegate の流れ
直近イベント
elapsed
ctx

ctx は、参照中の message + unique file refs + artifact refs の合計です。
elapsed は 1 秒単位で表示されます。

継続確認

built-in agent の既定 max_turns は 200 です。
上限に達すると、permission UI と同じ導線で「継続実行するか」を確認します。

許可するとその agent の turn カウンタをリセットして続行
拒否すると agent_failed として終了

権限確認と継続確認の待ち時間は agent timeout に含まれません。

アーキテクチャ

主要コードは internal/ 配下にあります。

internal/
  app/       起動 wiring と依存解決
  cli/       Cobra command 定義
  config/    TOML 設定読込
  domain/    中核型と interface
  infra/     LLM client / tools / policy / agent catalog 実装
  tui/       Bubble Tea / Bubbles の UI
  usecase/   orchestrator / task catalog / 実行ロジック

実行モデル

manager がユーザー入力を受ける
planner が agent inventory を見て execution plan を作り、必要な agent を選ぶ
user-defined agent も built-in agent と同じ inventory として plan に組み込まれる
planner が無効な plan を返した場合は orchestrator が検証し、必要なら保守的な fallback に切り替える
それ以外は planner が agent inventory を見て structured な execution plan を返す
delegate_to_<agent> で bounded task を subagent に委譲
handoff_to_<agent> で専門 agent に現在ターンを handoff
run_ephemeral_agent で一時的な subagent を即席生成
リポジトリ探索には fs_list を優先し、単純な一覧取得のためにスクリプト生成へ逃がさない
planner の返した plan は orchestrator が validate し、無効なら 1 回だけ repair を要求して、それでもだめなら deterministic fallback に落とす
並列化は非破壊系 task のみ。書き込み系は常に直列化

ツール

現在の主な tool は次です。

fs_read / fs_write / fs_list / fs_stat / fs_remove / fs_move
search_text / search_files
git_status / git_diff / git_log / git_show
task_list / task_run / task_bind
patch_apply
list_capabilities / enable_capability

Task catalog は .yagent/tasks.toml の [[tasks]] / [[mcpservers]] とユーザー共通の ~/.config/yagent/tasks.toml から構築されます。
MCP まわりでは、visible な mcp__* が空でも task_list / task_bind による lazy-bind が使える場合があります。
write-capable agent では fs_write / patch_apply を直接実行でき、その後の書き込み承認は通常の assistant 返答ではなく approval dialog で行われます。
一方で write tool が見えない理由が current agent の read-only 制約である場合は、「tool が存在しない」のではなく write-capable agent へ委譲すべきケースです。

開発

# ビルド
go build -o yagent .

# テスト
go test ./...

# 実行
./yagent --config ~/.yagent.toml

テスト方針

internal/config: 設定読込テスト
internal/infra/agents/catalog: built-in catalog / user DSL 読込テスト
internal/infra/llm: fake server を使った HTTP クライアントテスト
internal/infra/policy: permission policy / path policy テスト
internal/infra/tools: file/fs/task/registry のユニットテスト
internal/usecase/orchestrator: delegation / handoff / ephemeral agent の実行テスト
internal/usecase/taskcatalog: task catalog の読込テスト
internal/tui: state transition と viewport / permission UI / status UI のテスト

ライセンス

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.yagent		.yagent
docs		docs
internal		internal
.gitignore		.gitignore
AGENTS.md		AGENTS.md
Makefile		Makefile
README.md		README.md
TODO.md		TODO.md
build.sh		build.sh
config_example.toml		config_example.toml
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

yagent

yagent のユニークなポイント

特徴

コマンド

設定ファイル

設定項目

TUI Panels

Benchmark

Agent DSL

Task Catalog (`tasks.toml`)

MCP Server Catalog (`[[mcpservers]]`)

実行ログ

TUI 操作

Permission UI

Tool Call UI

Agent Status

継続確認

アーキテクチャ

実行モデル

ツール

開発

テスト方針

ライセンス

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

yagent

yagent のユニークなポイント

特徴

コマンド

設定ファイル

設定項目

TUI Panels

Benchmark

Agent DSL

Task Catalog (tasks.toml)

MCP Server Catalog ([[mcpservers]])

実行ログ

TUI 操作

Permission UI

Tool Call UI

Agent Status

継続確認

アーキテクチャ

実行モデル

ツール

開発

テスト方針

ライセンス

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Task Catalog (`tasks.toml`)

MCP Server Catalog (`[[mcpservers]]`)

Packages