Quorum is an evidence gate for enterprise AI agents.
It checks AI-generated business claims against approved company sources before those claims reach customers, employees, tickets, workflows, or downstream systems.
Enterprise teams are moving from AI experiments to agents that answer policy, support, HR, product, and operations questions. Those answers often sound confident even when they drift from approved company knowledge.
Quorum focuses on a narrow but high-value problem:
Given an AI-generated answer and approved source documents, identify which claims are supported, contradicted, unsupported, or need human review.
The goal is not to replace reviewers. The goal is to give reviewers a clear, auditable evidence report before an agent response becomes a business action.
The current CLI can:
- read an AI-generated answer from a Markdown or text file
- read one or more approved Markdown, text, or exported HTML source documents
- load source metadata such as
title,updatedAt, andtrustLevel - split the answer into atomic claims
- compare each claim against approved source snippets
- label each claim as
verified,contradicted,unsupported, orneeds_review - print a human-readable report
- write a JSON report for workflow automation
- write a Markdown reviewer report for approvals and handoff
- fail a CI job when selected risky verdicts appear
npm run dev -- verify \
--answer examples/answers/hr-answer.md \
--source-dir examples/sources \
--out reports/hr-report.json \
--markdown-out reports/hr-report.mdExample output:
Quorum Verification Report
Sources: HR Benefits Policy, Customer Support Playbook
Summary: 1 verified, 1 contradicted, 1 unsupported, 0 needs review
CONTRADICTED Employees receive 18 weeks of paid parental leave.
Reason: A closely matching approved source uses different numeric terms.
Evidence (HR Benefits Policy, high trust, score 0.857):
Employees receive 12 weeks of paid parental leave.
For CI-style blocking:
npm run dev -- verify \
--answer examples/answers/hr-answer.md \
--source-dir examples/sources \
--fail-on contradicted \
--fail-on unsupportedWhen a selected verdict is present, Quorum exits with status code 2.
git clone https://github.com/nash226/quorum.git
cd quorum
npm install
npm test
npm run build
npm run dev -- verify --answer examples/answers/hr-answer.md --source-dir examples/sources --out reports/hr-report.json --markdown-out reports/hr-report.mdSource files may include optional frontmatter metadata:
---
title: HR Benefits Policy
updatedAt: 2026-05-31
trustLevel: high
---Quorum includes this metadata in reports so reviewers can see which approved
source supported or contradicted each claim. trustLevel accepts high,
medium, or low and helps Quorum prefer stronger approved sources when
multiple passages are similarly relevant. Sources without a trust level default
to medium.
quorum verify --answer <path> (--source <path> | --source-dir <path>) [--json] [--out <path>] [--markdown-out <path>] [--fail-on <verdict>]
Options:
--answer <path>: AI-generated answer to verify--source <path>: approved source document; may be repeated--source-dir <path>: directory of approved source documents--json: print the full JSON report--out <path>: write the JSON report to disk--markdown-out <path>: write a reviewer-friendly Markdown report to disk--fail-on <verdict>: exit with code2when that verdict appears; may be repeated
Supported source extensions today:
.md.markdown.txt.html.htm
src/
claim-extractor.ts answer-to-claim extraction
claim-verifier.ts evidence matching and verdict logic
cli.ts command-line interface
report-policy.ts fail-on verdict policy
source-loader.ts source metadata and HTML loading
tests/ unit and fixture coverage
examples/ HR and support demo inputs
docs/ product notes, demo, roadmap, decision queue
npm test
npm run buildThe CI workflow runs both commands on pushes and pull requests.
The first wedge is HR and customer-support policy verification. These workflows are document-grounded, high-volume, and risky when AI answers drift from approved sources.
Near-term work:
- evaluation harness for labeled verdict examples
- Markdown or HTML reviewer reports
- batch verification
- better claim extraction for bullets, lists, and compound sentences
- API surface for agent integrations
See docs/roadmap.md for the working roadmap and
docs/product-brief.md for the product brief. For an
auto-updated snapshot of what has shipped on main, see
docs/status.md.
Automation uses GitHub issues labeled needs-human-decision when it needs
product judgment, credentials, paid services, or other human sign-off.
Review the queue here:
https://github.com/nash226/quorum/issues?q=is%3Aissue+is%3Aopen+label%3Aneeds-human-decision
See docs/decision-queue.md for the workflow.
Quorum is an early MVP. It is intentionally small, deterministic, and easy to inspect while the product direction is still being validated.