MCS Agent Builder

Automate end-to-end Microsoft Copilot Studio agent builds — from customer intake through architecture, build, evaluation, and automated fix loops.

Two AI models work in parallel: Claude orchestrates, writes code, and executes against MCS APIs. GPT-5.4 runs alongside as a second pair of eyes — reviewing every instruction, topic, brief, and eval score in real-time. Different models, different biases, better coverage.

Quick Start

npm install -g mcs-agent-builder
mcs start

That's it. One install, one command. Opens your browser to the dashboard automatically. No Python, no git clone, no setup scripts.

Commands:

mcs start       # launch the dashboard
mcs stop        # stop the dashboard
mcs restart     # stop + start
mcs update      # update to latest version (auto-restarts if running)
mcs health      # check if running
mcs doctor      # check all prerequisites

For developers (working from the repo):

git clone https://github.com/microsoft/MCS-Agent-Builder.git
cd MCS-Agent-Builder
npm start

Running from the repo enables auto-update via git pull, frontend hot-reload, and git hooks.

What It Does

  Documents  ──>  Research  ──>  Build  ──>  Evaluate  ──>  Fix  ──>  Deploy
  (SDR, reqs)     (brief.json)   (MCS API)   (Direct Line)   (auto)   (prod env)
                       │                          │
                  GPT reviews                GPT scores
                  in parallel                in parallel

Upload customer documents (SDR, requirements, notes)
Research — Claude reads everything, identifies agents, researches MCS components, generates the full design (brief.json)
Build — Claude builds the agent in Copilot Studio using a hybrid API stack
Evaluate — automated tests run against the published agent, scored by both heuristics and GPT-5.4
Fix — if eval pass rate is below target, Claude classifies failures, fixes instructions/topics, and re-evaluates
Deploy — promote from dev to production (agent-level or solution-level)

The dashboard shows everything in real-time with an embedded Claude Code terminal. Multiple tabs let you work on several agents in parallel.

CLI Skills

/mcs-init ProjectName                    Create project, detect SDR files
/mcs-context CustomerName                Pull M365 history (emails, meetings, docs, Teams)
/mcs-research ProjectName                Full research + architecture + eval generation
/mcs-build ProjectName agentId           Build agent(s) in Copilot Studio
/mcs-eval ProjectName agentId            Run eval tests, write results
/mcs-fix ProjectName agentId             Fix eval failures, re-evaluate
/mcs-deploy ProjectName agentId          Deploy to target environment
/mcs-report ProjectName agentId          Generate reports (brief/build/customer/deployment)
/mcs-library list                        Browse team solution library
/mcs-refresh                             Refresh knowledge cache

Updates

The tool checks for updates every 4 hours when you run mcs start. If a new version is available:

  Update available: 1.0.2 → 1.1.0
  Run: mcs update

Run mcs update to install the latest version. If the dashboard is running, it auto-restarts with the new version.

Dual-Model Review (Claude + GPT-5.4)

Every non-trivial task gets two AI perspectives automatically:

What Happens	Claude	GPT-5.4 (parallel)
Instructions written	Writes them	Reviews for gaps, contradictions, anti-patterns
Topic YAML generated	Generates it	Reviews for dead ends, variable issues, UX problems
Brief completed	Designs it	Reviews for completeness, blocking issues
Eval tests run	Scores with heuristics	Scores with LLM understanding
Failures analyzed	Classifies root causes	Cross-checks the analysis

When they disagree: both positions are shown, the stricter finding wins. If eval scores diverge >20 points, the test is flagged for human review.

Setup (one-time, 30 seconds):

gh auth login                       # sign in with your GitHub account
gh auth refresh --scopes copilot    # add Copilot API access

GPT is fully optional — if not configured, everything works with Claude alone. Run mcs doctor to check.

Hybrid Build Stack

Each build step uses the best tool — fully API-native, zero browser automation:

Tool	Handles
PAC CLI	Listing agents, solution ALM
MCS LSP Wrapper	Instructions, model, topics, knowledge sync
Island Gateway API	Model catalog, component reads, routing, settings, eval upload
Flow Manager	Power Automate flow CRUD + composition
Dataverse API	File uploads, bot name, publish, security
Direct Line API	Eval testing (+ GPT-5.4 scoring with `--gpt` flag)
GPT-5.4 Review	Cross-model review of instructions, topics, briefs, scores

YAML Validation Pipeline

Topic YAML goes through 4 layers before reaching Copilot Studio:

Layer	Tool	Catches
Pre-generation	`gen-constraints.py`	Missing required fields
Structural	`om-cli.exe`	Unknown nodes, invalid structure (357 types)
Semantic	`semantic-gates.py`	PowerFx errors, cross-refs, variable flow, channel compat
Spec drift	`drift-detect.py`	Missing topics, trigger mismatches vs brief

Agent Teams

Complex builds use 7 AI teammates + GPT-5.4 that challenge each other's work:

Teammate	Role
Research Analyst	Discovers MCS capabilities, prevents false limitation claims
Prompt Engineer	Writes agent instructions, reviews prompt quality
Topic Engineer	Generates YAML topics + adaptive cards
QA Challenger	Reviews all outputs, challenges claims, generates eval sets
Flow Designer	Designs Power Automate flow specs
Repo Checker	Validates repo integrity after changes
Repo Optimizer	Finds dead code, duplication, bloat
GPT-5.4	Parallel second opinion on every review (via Copilot API)

You interact with the lead only. The lead delegates, teammates debate and iterate, then the lead executes validated outputs in Copilot Studio.

Knowledge System

The tool continuously learns and improves:

Layer	What	Stays Current
Cache (20 files)	MCS capabilities — models, connectors, MCP servers, triggers	Auto-refreshed at session start
Learnings (8 files)	Experience from past builds — what worked, what didn't	Captured after each build
Patterns	YAML syntax, Dataverse API, 11 topic templates, 9 flow templates	Stable reference
Frameworks	Component selection, architecture scoring, eval scenarios	Stable reference

Prerequisites

Run mcs doctor to check everything.

Requirement	Required	Why
Node.js 18+	Yes	Server and terminal
Claude Code	Yes	AI agent that runs the builds
Git	Optional	Auto-updates (repo mode only)
GitHub CLI + copilot scope	Optional	GPT-5.4 cross-model reviews
Azure CLI	Optional	ADO work items (bug/suggest)
PAC CLI	Optional	Power Platform operations
.NET 10 Runtime	Optional	YAML validation (om-cli)
VS Code + MCS Extension	Optional	Headless LSP sync

Architecture

Single Node.js process serves the dashboard (Express HTTP), REST API, and Claude Code terminal (WebSocket) on one port.

app/
  server.js                   Express server (HTTP + WebSocket on one port)
  lib/                        Readiness calc, document conversion, project CRUD, terminal
  frontend/                   React + TypeScript SPA (Vite + shadcn/ui)
  dist/                       Pre-built frontend (ships with npm package)

Port	Service	Binding
8000-8020	Dashboard + Terminal	localhost only

Single port, auto-discovered. If 8000 is busy, the next available port is used.

Project Data

Install method	Projects stored at
npm global (`mcs start`)	`~/MCS-Agent-Builder/`
Git repo (`npm start`)	`./Build-Guides/`

Project Structure

bin/
  cli.js                      CLI (start, stop, health, doctor, update)
  postinstall.js              Post-install setup

.claude/
  settings.json               MCP servers, permissions
  skills/                     13 skills (11 workflow + 2 utility)
  agents/                     7 AI teammate definitions

app/
  server.js                   Express backend + WebSocket terminal
  lib/                        Readiness, documents, projects, terminal, brief migration
  frontend/                   React + TypeScript SPA (Vite + shadcn/ui)

knowledge/
  cache/                      20 MCS capability cheat sheets (auto-refreshed)
  learnings/                  Experience from past builds
  patterns/                   YAML, Dataverse, solution patterns + topic/flow templates
  frameworks/                 Decision frameworks + eval scenarios

tools/
  lib/openai.js               GPT-5.4 client (GitHub Copilot Responses API)
  lib/http.js                 Shared HTTP + Azure CLI token helpers
  lib/flow-composer.js        Flow composition (builders, wiring, validation)
  multi-model-review.js       GPT review CLI (instructions, topics, briefs, scoring)
  eval-scoring.js             Scoring module (7 methods, dual heuristic+GPT)
  direct-line-test.js         Direct Line eval runner
  mcs-lsp.js                  MCS Language Server wrapper (push/pull)
  island-client.js            Island Gateway API client
  flow-manager.js             Power Automate flow CRUD + composition
  add-tool.js                 Headless tool/connector addition
  solution-library.js         Team SharePoint solution library
  replicate-agent.js          Cross-environment agent replication
  om-cli/                     ObjectModel CLI — YAML validation (357 types)

templates/                    brief.json schema (single source of truth)
Build-Guides/                 Per-project work (gitignored)

Troubleshooting

Problem	Fix
Something not working	`mcs doctor` — checks all prerequisites with fix instructions
Port conflict	Auto-discovered. Run `mcs health` to see actual port
GPT reviews not working	`gh auth login && gh auth refresh --scopes copilot`
PAC CLI not working	Ask Claude: "set up PAC CLI auth for me"
Wrong MCS environment	Claude detects mismatches and asks you to switch
Terminal not connecting	Close the tab and click "+" for a new session

Feedback

Click Bug or Suggest in the dashboard header. Claude creates a GitHub issue for you with auto-gathered context. Or file directly on GitHub.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 241 Commits
.claude		.claude
.github/workflows		.github/workflows
app		app
bin		bin
knowledge		knowledge
templates		templates
tools		tools
.gitignore		.gitignore
.mcp.json		.mcp.json
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
backlog.md		backlog.md
es-metadata.yml		es-metadata.yml
package-lock.json		package-lock.json
package.json		package.json
start.js		start.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MCS Agent Builder

Quick Start

What It Does

CLI Skills

Updates

Dual-Model Review (Claude + GPT-5.4)

Hybrid Build Stack

YAML Validation Pipeline

Agent Teams

Knowledge System

Prerequisites

Architecture

Project Data

Project Structure

Troubleshooting

Feedback

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MCS Agent Builder

Quick Start

What It Does

CLI Skills

Updates

Dual-Model Review (Claude + GPT-5.4)

Hybrid Build Stack

YAML Validation Pipeline

Agent Teams

Knowledge System

Prerequisites

Architecture

Project Data

Project Structure

Troubleshooting

Feedback

License

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages