The AI Counsel

Collective AI Intelligence — Convene a council of AI models that deliberate, peer-review, and synthesize the best answer — or assemble a panel of named advisor personas that debate your question and deliver a structured verdict.

📺 Video Overview & Demo
Click below to watch the video demonstration of The AI Counsel:

What is The AI Counsel?

The AI Counsel is a dual-mode multi-model AI deliberation system. Instead of relying on a single LLM for answers, it orchestrates multiple models working together — either through anonymous peer review or persona-driven debate.

Choose your experience:

🏛️ LLM Council — Multiple AI models independently answer your question, anonymously peer-review each other's responses, and a chairman model synthesizes the collective wisdom into a final answer.
🎭 LLM Advisors — Named advisor personas (The Skeptic, The Strategist, The Ethicist, etc.) debate your question across configurable rounds, reaching consensus or voting to deliver a structured verdict with an action plan.

Choosing the right mode: use Council for direct answers, creative prompts, factual questions, and "give me the best response" synthesis. Use Advisors when the question has real tradeoffs, disagreement, risk, strategy, ethics, prioritization, or a decision to make. Simple prompts such as "give me one amazing animal fact" are usually Council prompts; advisor personas will naturally turn them into a debate over criteria.

Installation

You can clone, install dependencies, and start the application in one shot:

git clone https://github.com/jacob-bd/the-ai-counsel.git && \
cd the-ai-counsel && \
uv sync && \
npm install --prefix frontend && \
./start.sh

(Note: uv sync installs the backend dependencies, npm install --prefix frontend installs the frontend dependencies, and ./start.sh spins up both servers together).

Then open http://localhost:5173 and configure your API keys in Settings.

Prerequisites: Python 3.10+, Node.js 18+, uv

Two Modes of Deliberation

🏛️ LLM Council — Multi-Model Deliberation

The original three-stage pipeline where raw model diversity produces vetted answers:

YOUR QUESTION (+ optional web search / file uploads)
         │
         ▼
  ┌─────────────────────────────────┐
  │   STAGE 1: DELIBERATION         │
  │   Claude, GPT-4, Gemini, Llama  │
  │   Each answers independently    │
  └──────────────┬──────────────────┘
                 ▼
  ┌─────────────────────────────────┐
  │   STAGE 2: PEER REVIEW          │
  │   Anonymized as A, B, C, D      │
  │   Each model ranks all others   │
  └──────────────┬──────────────────┘
                 ▼
  ┌─────────────────────────────────┐
  │   STAGE 3: CHAIRMAN SYNTHESIS   │
  │   Reviews all + rankings        │
  │   Delivers the final answer     │
  └─────────────────────────────────┘

Execution modes control deliberation depth:

Mode	Stages	Best For
Chat Only	Stage 1 only	Quick responses, comparing model outputs
Chat + Ranking	Stages 1 & 2	Peer review without synthesis
Full Deliberation	All 3 stages	Complete council synthesis (default)

Multi-Round Iterative Debate (v0.7.0)

Council mode also supports multi-round iterative debate — models refine their answers across multiple rounds based on peer critiques, with convergence detection and early stopping:

  ┌─────────────────────────────────┐
  │   ROUND 1: Initial Responses    │
  │   + Peer Critique               │
  └──────────────┬──────────────────┘
                 ▼
  ┌─────────────────────────────────┐
  │   ROUNDS 2–5: Refinement        │
  │   Cross-pollination of top      │
  │   claims + targeted critique    │
  │   (auto-stops on convergence)   │
  └──────────────┬──────────────────┘
                 ▼
  ┌─────────────────────────────────┐
  │   STAGE 4: CORRECTED DRAFT      │
  │   Chairman synthesizes final    │
  │   draft with [REVISED]/[NEW]    │
  └─────────────────────────────────┘

Three critique modes control how models evaluate each other:

Mode	How It Works
Free-form	Open-ended feedback on the full response
Paragraph-level	Structured per-paragraph evaluation with stable `[Para N]` markers
Claim-level	Chairman extracts falsifiable claims; peers verdict each claim (strong/weak/flawed)

Configure rounds (1–5), critique mode, and convergence threshold in Settings > Council Debate, or via the run_iterative_debate MCP tool. See docs/COUNCIL-DEBATE-CONFIG.md for a full walkthrough.

🎭 LLM Advisors — Persona-Driven Debate

A fundamentally different approach: named personas with distinct thinking styles argue your question in structured rounds.

Advisor mode works best when there is something meaningful to debate: a strategic choice, a product decision, a risk review, an ethical question, or competing options. For simple answer generation, use Council mode instead.

YOUR QUESTION (+ optional web search)
         │
         ▼
  ┌─────────────────────────────────┐
  │   ROUND 1: OPENING POSITIONS    │
  │   Each advisor states their case │
  └──────────────┬──────────────────┘
                 ▼
  ┌─────────────────────────────────┐
  │   ROUND 2–N: DEBATE             │
  │   Rotating order, respond to    │
  │   each other by name            │
  │   (auto-stops on consensus)     │
  └──────────────┬──────────────────┘
                 ▼
  ┌─────────────────────────────────┐
  │   VERDICT (or TIEBREAKER)       │
  │   Summary, consensus points,    │
  │   disagreements table, verdict, │
  │   next steps, open questions    │
  └─────────────────────────────────┘

12 built-in advisor personas:

Persona	Role	Style
🔍 The Skeptic	Critical Thinker	Challenges assumptions, demands evidence
🔧 The Pragmatist	Practical Advisor	Focuses on feasibility and real-world constraints
💡 The Innovator	Creative Thinker	Pushes boundaries, explores unconventional solutions
📜 The Historian	Pattern Analyst	Draws lessons from historical patterns
⚖️ The Ethicist	Moral Compass	Examines decisions through ethics and fairness
📊 The Data Analyst	Evidence Evaluator	Brings quantitative rigor and measurable evidence
🎭 The Contrarian	Devil's Advocate	Deliberately argues the opposing position
♟️ The Strategist	Big-Picture Thinker	Thinks long-term about positioning and leverage
🤝 The Humanist	People-First Advocate	Centers the human experience and well-being
🛡️ The Risk Assessor	Risk Analyst	Identifies worst-case scenarios and mitigations
🎤 The Comedian	Humorist Critic	Uses wit to expose absurdity and weak framing
📈 The Economist	Incentives Analyst	Analyzes incentives, scarcity, and unintended consequences

All personas are fully customizable — edit name, role, description, system prompt, and emoji. Changes persist across sessions with per-persona reset to defaults.

Features

Multi-Provider Support

Mix and match models from 12 different provider types:

Provider	Type	Description
OpenRouter	Cloud	100+ models via single API (GPT-4, Claude, Gemini, Mistral, etc.)
Ollama	Local	Run open-source models locally (Llama, Mistral, Phi, etc.)
Groq	Cloud	Ultra-fast inference for Llama and Mixtral models
NVIDIA NIM	Cloud	NVIDIA Build models via `integrate.api.nvidia.com`
OpenCode Zen	Cloud	Direct connection to opencode.ai/zen (chat/completions only, v1)
OpenCode Go	Cloud	Direct connection to OpenCode Go (subscription, chat/completions only, v1)
OpenAI Direct	Cloud	Direct connection to OpenAI API
Anthropic Direct	Cloud	Direct connection to Anthropic API
Google Direct	Cloud	Direct connection to Google AI API
Mistral Direct	Cloud	Direct connection to Mistral API
DeepSeek Direct	Cloud	Direct connection to DeepSeek API
Custom Endpoint	Any	Any OpenAI-compatible API (Together AI, Fireworks, vLLM, LM Studio, GitHub Models, etc.)

Web Search Integration

Ground your council's or advisors' responses in real-time information:

Provider	Type	Notes
DuckDuckGo	Free	Hybrid web+news search, no API key needed
TinyFish	Free	Batch Fetch API for fast multi-URL fetching
Serper	API Key	Real Google results, 2,500 free queries
Tavily	API Key	Purpose-built for LLMs, rich content
Brave Search	API Key	Privacy-focused, 2,000 free queries/month

Full Article Fetching: Uses Jina Reader to extract full article content from top search results (configurable 0–10 results).

Temperature Controls

Fine-tune creativity vs consistency per stage:

Council Heat (Stage 1): Individual response creativity (default: 0.5)
Peer Ranking Heat (Stage 2): Ranking consistency (default: 0.3)
Chairman Heat (Stage 3): Final synthesis creativity (default: 0.4)

Some provider/model combinations only accept their default temperature. The app automatically omits temperature for those models so preflight and runs do not fail on provider-specific temperature restrictions.

Additional Features

Live Progress Tracking — See each model or advisor respond in real-time with streaming; reconnect to active runs via GET /api/conversations/{id}/progress
Multi-turn Conversations — Follow-up questions carry full context automatically
Text File Uploads — Attach PDFs and text/code/config files in Council or Advisor mode; extracted text is sent as normalized prompt context across all providers while conversation history stores attachment metadata only
Council Sizing — Adjust council from 1 to 8 models; advisors from 2 to 4 personas (select from 12)
Advisor Presets — Save and load named advisor lineups (personas, model mode, optional rounds/web search) from Advisor Setup
Abort Anytime — Cancel in-progress requests
Conversation History — All conversations saved locally with search; sidebar cards show stacked date/time, compact run summaries (rounds, critique mode, personas, search), and cumulative cost per thread
Customizable System Prompts — Edit Stage 1, 2, and 3 prompts for Council mode
Run Cost Reporting — See total cost, input/output token split, call count, pricing confidence, and per-model breakdowns for council and advisor runs
Rate Limit Warnings — Alerts when your config may hit API limits
"I'm Feeling Lucky" — Randomize your council composition
Import & Export — Backup and share your settings, API keys, and prompts
Per-request Model Overrides — Use different models for individual requests without changing global config
One-shot API — POST /api/ask for scripts and MCP agents; each completed run is saved to the UI and returns a conversation_id
Docker Deployment — Single-container production deployment via docker compose

File Uploads

Attach PDFs and text-like files from the Council input or Advisor setup. The backend extracts text once before model calls, so uploads work the same way across OpenRouter, Ollama, Groq, direct providers, custom endpoints, and MCP.

Supported v1 formats include .pdf, .txt, .md, .csv, .json, .yaml, .xml, .html, logs, code files, and common config files. Conversation history stores file name/type/size metadata only; it does not store raw file bytes or extracted text.

PDFs use embedded text extraction by default. OCR for scanned or image-only PDFs is optional: set LLM_COUNCIL_OCR_ENABLED=1 and install OCRmyPDF, Tesseract, Ghostscript, and qpdf in the backend runtime. If OCR is unavailable, the run continues with extracted text and warnings.

Quick Start

Prerequisites

Python 3.10+
Node.js 18+
uv (Python package manager)

Running the Application

Option 1: Use the start script (recommended)

./start.sh

Option 2: Run manually

Terminal 1 (Backend):

uv run python -m backend.main

Terminal 2 (Frontend):

cd frontend
npm run dev

Then open http://localhost:5173 in your browser.

Docker / VPS Deployment

docker compose up -d --build

Then open http://YOUR_SERVER_IP:8001. Conversations and settings persist to ./data automatically.

For Ollama integration, reverse proxy setup, environment variables, and upgrade instructions, see docs/DOCKER.md.

Coming from LLM Council Plus? See the Migration Guide for step-by-step upgrade instructions. Your data and configs carry over without changes.

Network Access

The start script exposes both frontend and backend on the network automatically:

Local: http://localhost:5173
Network: http://YOUR_IP:5173

For manual setup:

# Backend with network access
LLM_COUNCIL_BIND_HOST=0.0.0.0 uv run python -m backend.main

# Frontend with network access
cd frontend && npm run dev -- --host

Remote admin endpoints (/api/settings/export, /api/settings/import, /api/settings/reset) require LLM_COUNCIL_ADMIN_TOKEN when accessed by proxied or remote clients.

Configuration

First-Time Setup

On first launch, configure at least one LLM provider in Settings:

LLM API Keys — Enter API keys for your chosen providers (and Ollama URL / custom endpoint if used)
Council Config (Settings) or welcome-screen Council Setup — add members and chairman; both edit the same saved lineup (auto-saves)

Settings changes save automatically (~1 second after you stop editing). API keys auto-save when you click "Test" and the connection succeeds.

Provider toggles are global: Settings → Council Config provider toggles control which sources appear in all model pickers — Council Setup and Advisor Setup alike. A provider must be both configured (API key) and enabled (toggle on) to show its models.

Advisor presets: In Advisor Setup, save named lineups (personas, models, optional rounds/web search) from the Model Assignment section. Presets persist in settings.json as advisor_presets (max 20; one default).

LLM API Keys

Provider	Get API Key
OpenRouter	openrouter.ai/keys
Groq	console.groq.com/keys
NVIDIA	build.nvidia.com
OpenAI	platform.openai.com/api-keys
Anthropic	console.anthropic.com
Google AI	aistudio.google.com/apikey
Mistral	console.mistral.ai/api-keys
DeepSeek	platform.deepseek.com

Ollama (Local Models)

Install Ollama
Pull models: ollama pull llama3.1
Start Ollama: ollama serve
In Settings, enter your Ollama URL (default: http://localhost:11434)
Click "Connect" to verify

Custom OpenAI-Compatible Endpoint

Connect to any OpenAI-compatible API:

Go to LLM API Keys → Custom OpenAI-Compatible Endpoint
Enter Display Name, Base URL, and API Key (optional for local servers)
Click "Connect" to test and save

Compatible services: Together AI, Fireworks AI, vLLM, LM Studio, GitHub Models, and more.

MCP Server

The AI Counsel exposes a powerful Model Context Protocol (MCP) server that lets AI tools like Claude Code and Gemini CLI interact directly with your local or remote instance.

The server exposes 10 action-based tools grouped by domain:

Deliberation: council_deliberate (stage1/stage2/stage3/full), model_chat (quick/multi_turn), advisor_debate, run_iterative_debate
Configuration: council_settings, advisor_settings, personas, providers, config_backup
History: conversations (list/get)

Legacy 25-tool names were removed in v0.5.2. run_iterative_debate was added in v0.7.0. See docs/mcp/TOOLS.md for the action parameter on each tool.

Deliberation tools also accept optional document inputs. Base64 files are extracted by the backend before model calls, so raw file bytes are not sent to providers.

Quick registration for Claude Code:

Option A: Local stdio (Standard for local development)

pip install -e .
claude mcp add the-ai-counsel python -m the_ai_counsel_mcp

Option B: Remote SSE (Zero-install for containers/servers)

claude mcp add --transport sse the-ai-counsel http://yourserver.com:8001/mcp/sse

Then ask Claude: "check the council health" to verify the connection (providers → action health; expect 10 tools in /api/health).

See docs/mcp/ for full setup guides, including stdio/SSE transport configurations, complete tools reference, and usage examples.

Claude Code Skill (REST fallback)

When MCP isn't available or you need preset CRUD / raw SSE, install the the-ai-counsel-api skill. When both skill and MCP are present, agents should use MCP tools first — the skill documents REST as fallback.

# Symlink from your cloned repo
mkdir -p ~/.claude/skills
ln -s "$(pwd)/skills/the-ai-counsel-api" ~/.claude/skills/the-ai-counsel-api

The skill covers all API endpoints, SSE stream parsing, advisor endpoints, and troubleshooting. See skills/the-ai-counsel-api/SKILL.md for the full reference.

Contributors: keep REST API, MCP tools, skill, and user docs in sync — see docs/DOC-SYNC.md.

Tech Stack

Component	Technology
Backend	FastAPI, Python 3.10+, httpx (async HTTP)
Frontend	React 19, Vite, react-markdown
Styling	CSS with "Midnight Glass" dark theme
Storage	JSON files in `data/` directory
Package Management	uv (Python), npm (JavaScript)

Data Storage

All data is stored locally in the data/ directory:

data/
├── settings.json              # Configuration (includes API keys)
├── persona_overrides.json     # Advisor persona customizations
└── conversations/             # Conversation history
    ├── {uuid}.json
    └── ...

Privacy: Prompts and responses are sent only to your configured LLM/search providers. Cost reporting also fetches public model-pricing catalogs; it does not send prompt text, responses, or API keys.

⚠️ Security Warning: API Keys Stored in Plain Text

API keys are stored in clear text in data/settings.json. The data/ folder is included in .gitignore by default.

Do NOT remove data/ from .gitignore

Never commit data/settings.json to version control

If you accidentally expose your keys, rotate them immediately

Troubleshooting

"Failed to load conversations"

Backend might still be starting up — the app retries automatically

Models not appearing in dropdown

Ensure the provider toggle is enabled in Settings → Council Config (toggles are global — apply to both Council and Advisor pickers)
Check that the API key is configured and tested successfully
For Ollama, verify connection is active

Jina Reader returns 451 errors

HTTP 451 = site blocks AI scrapers (common with news sites)
Try Tavily/Brave instead, or set full_content_results to 0

Rate limit errors (OpenRouter)

Free models: 20 requests/min, 50/day
Consider using Groq (14,400/day) or Ollama (unlimited)

Binary compatibility errors (node_modules)

When syncing between Intel/Apple Silicon Macs:

rm -rf frontend/node_modules && npm install --prefix frontend

Logs:

Backend: Terminal running uv run python -m backend.main
Frontend: Browser DevTools console

Credits & Acknowledgements

This project builds upon the original llm-council by Andrej Karpathy.

The AI Counsel extends that foundation with dual-mode deliberation (Council + Advisors), 12 provider integrations (including NVIDIA NIM and OpenCode Zen/Go), web search, persona-driven debates, customizable prompts, an MCP server, Docker deployment, and much more.

We gratefully acknowledge Andrej Karpathy for the original inspiration and codebase.

License

MIT License — see LICENSE for details.

Contributing

Contributions are welcome! This project embraces the spirit of "vibe coding" — feel free to fork and make it your own.

Built with the collective wisdom of AI
Ask the council. Debate with advisors. Get better answers.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
assets		assets
backend		backend
docs		docs
frontend		frontend
skills/the-ai-counsel-api		skills/the-ai-counsel-api
the_ai_counsel_mcp		the_ai_counsel_mcp
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
docker-entrypoint.sh		docker-entrypoint.sh
pyproject.toml		pyproject.toml
start.sh		start.sh
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

The AI Counsel

What is The AI Counsel?

Installation

Two Modes of Deliberation

🏛️ LLM Council — Multi-Model Deliberation

Multi-Round Iterative Debate (v0.7.0)

🎭 LLM Advisors — Persona-Driven Debate

Features

Multi-Provider Support

Web Search Integration

Temperature Controls

Additional Features

File Uploads

Quick Start

Prerequisites

Running the Application

Docker / VPS Deployment

Network Access

Configuration

First-Time Setup

LLM API Keys

Ollama (Local Models)

Custom OpenAI-Compatible Endpoint

MCP Server

Claude Code Skill (REST fallback)

Tech Stack

Data Storage

Troubleshooting

Credits & Acknowledgements

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages