feat: add CI workflow, CONTRIBUTING.md, CHANGELOG.md, and README badges/demo by Copilot · Pull Request #1 · Smithbox-ai/ControlFlow

Copilot · 2026-04-15T20:43:20Z

Repository lacked CI, contributor guidance, release history, and a compelling first impression — all friction points that reduce community traction.

Changes

`.github/workflows/ci.yml`

Runs cd evals && npm test on every push/PR to master
permissions: contents: read (least-privilege GITHUB_TOKEN)
Unlocks a live CI status badge

`README.md`

Four badges at the top: CI status, agent count (13), eval checks (302), MIT license
New "How It Works" section leads with the 3-step user workflow (@Planner → approve → @Orchestrator) instead of architecture prose — reduces time-to-value for new visitors

`CONTRIBUTING.md`

Adding a new agent: P.A.R.T structure, schema contract, eval scenarios, governance registration
Editing existing agents: regression testing expectations, contract/schema update rules
Adding skills, proposing changes via Issues/PRs

`CHANGELOG.md`

v1.0.0 entry covering all 13 agents, architecture features, governance contracts, skill library, and eval suite

…es/demo Agent-Logs-Url: https://github.com/Smithbox-ai/ControlFlow/sessions/4a8567c4-a4c2-41c3-91d1-be277236ebbd Co-authored-by: Smithbox-ai <192372536+Smithbox-ai@users.noreply.github.com>

Agent-Logs-Url: https://github.com/Smithbox-ai/ControlFlow/sessions/4a8567c4-a4c2-41c3-91d1-be277236ebbd Co-authored-by: Smithbox-ai <192372536+Smithbox-ai@users.noreply.github.com>

Copilot

Pull request overview

Adds project “community traction” essentials (CI + contributor/release docs) and improves README first-impression by surfacing status badges and a quick-start workflow explanation.

Changes:

Add GitHub Actions CI workflow to run cd evals && npm test on pushes/PRs.
Update README with badges and a new “How It Works” section emphasizing the 3-step workflow.
Add new CONTRIBUTING.md and CHANGELOG.md with guidance and a v1.0.0 entry.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 5 comments.

File	Description
`.github/workflows/ci.yml`	Introduces CI job that installs eval deps and runs eval test suite.
`README.md`	Adds badges and a concise “How It Works” onboarding section.
`CONTRIBUTING.md`	Documents contribution workflows (running evals, adding/editing agents, skills, PR expectations).
`CHANGELOG.md`	Adds Keep-a-Changelog formatted v1.0.0 entry capturing system capabilities.

Copilot · 2026-04-15T20:48:55Z

+**Eval suite (302 checks)**
+
+- Pass 1: Schema validity (Ajv strict mode, JSON Schema 2020-12)
+- Pass 2–3: Scenario integrity and cross-scenario structural regression (179 structural checks)
+- Pass 4: P.A.R.T section order enforcement
+- Pass 4b: Clarification trigger and tool routing section validation
+- Pass 5: Skill library registration integrity
+- Pass 6: Synthetic rename negative-path checks
+- Pass 7: Prompt behavior contract behavioral regression (74 checks across 9 agents)
+- Pass 8: Orchestration handoff contract regression (49 checks)


The changelog claims the eval suite has “302 checks” with a specific breakdown, but evals/README.md currently documents a different total. Consider either aligning these numbers with the authoritative source or avoiding fixed counts in the changelog to prevent staleness.

Suggested change

**Eval suite (302 checks)**

- Pass 1: Schema validity (Ajv strict mode, JSON Schema 2020-12)

- Pass 2–3: Scenario integrity and cross-scenario structural regression (179 structural checks)

- Pass 4: P.A.R.T section order enforcement

- Pass 4b: Clarification trigger and tool routing section validation

- Pass 5: Skill library registration integrity

- Pass 6: Synthetic rename negative-path checks

- Pass 7: Prompt behavior contract behavioral regression (74 checks across 9 agents)

- Pass 8: Orchestration handoff contract regression (49 checks)

**Eval suite**

- Pass 1: Schema validity (Ajv strict mode, JSON Schema 2020-12)

- Pass 2–3: Scenario integrity and cross-scenario structural regression

- Pass 4: P.A.R.T section order enforcement

- Pass 4b: Clarification trigger and tool routing section validation

- Pass 5: Skill library registration integrity

- Pass 6: Synthetic rename negative-path checks

- Pass 7: Prompt behavior contract behavioral regression across agent prompts

- Pass 8: Orchestration handoff contract regression

Copilot · 2026-04-15T20:48:56Z

+    branches: ["master"]
+  pull_request:
+    branches: ["master"]


The workflow is only triggered for the master branch. The repo/diff context suggests the default branch may be main, in which case CI won’t run for pushes/PRs to the default branch and the README CI badge may show no results. Consider triggering on both main and master, or updating the branch filter to match the repo’s actual default branch.

Suggested change

branches: ["master"]

pull_request:

branches: ["master"]

branches: ["main", "master"]

pull_request:

branches: ["main", "master"]

Copilot · 2026-04-15T20:48:56Z


+[![CI](https://github.com/Smithbox-ai/ControlFlow/actions/workflows/ci.yml/badge.svg)](https://github.com/Smithbox-ai/ControlFlow/actions/workflows/ci.yml)
+![Agents](https://img.shields.io/badge/agents-13-blue)
+![Eval Checks](https://img.shields.io/badge/eval%20checks-302-brightgreen)


The “Eval Checks” badge hardcodes 302, but the repository’s evals/README.md currently states a different total (283). Hardcoding the number risks the badge and docs drifting out of sync; consider removing the fixed count or deriving it from a single authoritative source and updating all references together.

Suggested change

![Eval Checks](https://img.shields.io/badge/eval%20checks-302-brightgreen)

![Eval Checks](https://img.shields.io/badge/eval%20checks-passing-brightgreen)

Copilot · 2026-04-15T20:48:56Z

+npm test
+```
+
+All 302 checks must pass before any PR can be merged. The suite runs fully offline.


This section states “All 302 checks must pass…”, but evals/README.md currently documents a different total (283). To avoid documentation drift, consider removing the hardcoded number (e.g., “All eval checks must pass”) or updating both files to match the authoritative count.

Suggested change

All 302 checks must pass before any PR can be merged. The suite runs fully offline.

All eval checks must pass before any PR can be merged. The suite runs fully offline.

Copilot · 2026-04-15T20:48:57Z

+
+```bash
+cd evals
+npm install


The contributor instructions use npm install, while CI uses npm ci. Using npm ci locally (with the committed lockfile) better matches CI’s deterministic dependency resolution and reduces “works locally but not in CI” issues.

Suggested change

npm install

npm ci

Copilot AI and others added 2 commits April 15, 2026 20:35

feat: add CI workflow, CONTRIBUTING.md, CHANGELOG.md, and README badg…

4b66f8b

…es/demo Agent-Logs-Url: https://github.com/Smithbox-ai/ControlFlow/sessions/4a8567c4-a4c2-41c3-91d1-be277236ebbd Co-authored-by: Smithbox-ai <192372536+Smithbox-ai@users.noreply.github.com>

fix: restrict GITHUB_TOKEN permissions in CI workflow

0c4e416

Agent-Logs-Url: https://github.com/Smithbox-ai/ControlFlow/sessions/4a8567c4-a4c2-41c3-91d1-be277236ebbd Co-authored-by: Smithbox-ai <192372536+Smithbox-ai@users.noreply.github.com>

Copilot AI assigned Copilot and Smithbox-ai Apr 15, 2026

Copilot created this pull request from a session on behalf of Smithbox-ai April 15, 2026 20:43 View session

Smithbox-ai marked this pull request as ready for review April 15, 2026 20:45

Copilot AI review requested due to automatic review settings April 15, 2026 20:45

Copilot started reviewing on behalf of Smithbox-ai April 15, 2026 20:46 View session

Copilot AI reviewed Apr 15, 2026

View reviewed changes

Smithbox-ai merged commit 91fc86d into master Apr 15, 2026
4 of 5 checks passed

Smithbox-ai deleted the copilot/evaluate-current-project-status branch April 15, 2026 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add CI workflow, CONTRIBUTING.md, CHANGELOG.md, and README badges/demo#1

feat: add CI workflow, CONTRIBUTING.md, CHANGELOG.md, and README badges/demo#1
Smithbox-ai merged 2 commits intomasterfrom
copilot/evaluate-current-project-status

Copilot AI commented Apr 15, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 15, 2026

Uh oh!

Copilot AI Apr 15, 2026

Uh oh!

Copilot AI Apr 15, 2026

Uh oh!

Copilot AI Apr 15, 2026

Uh oh!

Copilot AI Apr 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	![Eval Checks](https://img.shields.io/badge/eval%20checks-302-brightgreen)
	![Eval Checks](https://img.shields.io/badge/eval%20checks-passing-brightgreen)

	All 302 checks must pass before any PR can be merged. The suite runs fully offline.
	All eval checks must pass before any PR can be merged. The suite runs fully offline.

Conversation

Copilot AI commented Apr 15, 2026

Changes

.github/workflows/ci.yml

README.md

CONTRIBUTING.md

CHANGELOG.md

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

`.github/workflows/ci.yml`

`README.md`

`CONTRIBUTING.md`

`CHANGELOG.md`