Skip to content

feat: add CI workflow, CONTRIBUTING.md, CHANGELOG.md, and README badges/demo#1

Merged
Smithbox-ai merged 2 commits intomasterfrom
copilot/evaluate-current-project-status
Apr 15, 2026
Merged

feat: add CI workflow, CONTRIBUTING.md, CHANGELOG.md, and README badges/demo#1
Smithbox-ai merged 2 commits intomasterfrom
copilot/evaluate-current-project-status

Conversation

Copy link
Copy Markdown
Contributor

Copilot AI commented Apr 15, 2026

Repository lacked CI, contributor guidance, release history, and a compelling first impression — all friction points that reduce community traction.

Changes

.github/workflows/ci.yml

  • Runs cd evals && npm test on every push/PR to master
  • permissions: contents: read (least-privilege GITHUB_TOKEN)
  • Unlocks a live CI status badge

README.md

  • Four badges at the top: CI status, agent count (13), eval checks (302), MIT license
  • New "How It Works" section leads with the 3-step user workflow (@Planner → approve → @Orchestrator) instead of architecture prose — reduces time-to-value for new visitors

CONTRIBUTING.md

  • Adding a new agent: P.A.R.T structure, schema contract, eval scenarios, governance registration
  • Editing existing agents: regression testing expectations, contract/schema update rules
  • Adding skills, proposing changes via Issues/PRs

CHANGELOG.md

  • v1.0.0 entry covering all 13 agents, architecture features, governance contracts, skill library, and eval suite

Copilot AI and others added 2 commits April 15, 2026 20:35
@Smithbox-ai Smithbox-ai marked this pull request as ready for review April 15, 2026 20:45
Copilot AI review requested due to automatic review settings April 15, 2026 20:45
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds project “community traction” essentials (CI + contributor/release docs) and improves README first-impression by surfacing status badges and a quick-start workflow explanation.

Changes:

  • Add GitHub Actions CI workflow to run cd evals && npm test on pushes/PRs.
  • Update README with badges and a new “How It Works” section emphasizing the 3-step workflow.
  • Add new CONTRIBUTING.md and CHANGELOG.md with guidance and a v1.0.0 entry.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 5 comments.

File Description
.github/workflows/ci.yml Introduces CI job that installs eval deps and runs eval test suite.
README.md Adds badges and a concise “How It Works” onboarding section.
CONTRIBUTING.md Documents contribution workflows (running evals, adding/editing agents, skills, PR expectations).
CHANGELOG.md Adds Keep-a-Changelog formatted v1.0.0 entry capturing system capabilities.

Comment thread CHANGELOG.md
Comment on lines +53 to +62
**Eval suite (302 checks)**

- Pass 1: Schema validity (Ajv strict mode, JSON Schema 2020-12)
- Pass 2–3: Scenario integrity and cross-scenario structural regression (179 structural checks)
- Pass 4: P.A.R.T section order enforcement
- Pass 4b: Clarification trigger and tool routing section validation
- Pass 5: Skill library registration integrity
- Pass 6: Synthetic rename negative-path checks
- Pass 7: Prompt behavior contract behavioral regression (74 checks across 9 agents)
- Pass 8: Orchestration handoff contract regression (49 checks)
Copy link

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changelog claims the eval suite has “302 checks” with a specific breakdown, but evals/README.md currently documents a different total. Consider either aligning these numbers with the authoritative source or avoiding fixed counts in the changelog to prevent staleness.

Suggested change
**Eval suite (302 checks)**
- Pass 1: Schema validity (Ajv strict mode, JSON Schema 2020-12)
- Pass 2–3: Scenario integrity and cross-scenario structural regression (179 structural checks)
- Pass 4: P.A.R.T section order enforcement
- Pass 4b: Clarification trigger and tool routing section validation
- Pass 5: Skill library registration integrity
- Pass 6: Synthetic rename negative-path checks
- Pass 7: Prompt behavior contract behavioral regression (74 checks across 9 agents)
- Pass 8: Orchestration handoff contract regression (49 checks)
**Eval suite**
- Pass 1: Schema validity (Ajv strict mode, JSON Schema 2020-12)
- Pass 2–3: Scenario integrity and cross-scenario structural regression
- Pass 4: P.A.R.T section order enforcement
- Pass 4b: Clarification trigger and tool routing section validation
- Pass 5: Skill library registration integrity
- Pass 6: Synthetic rename negative-path checks
- Pass 7: Prompt behavior contract behavioral regression across agent prompts
- Pass 8: Orchestration handoff contract regression

Copilot uses AI. Check for mistakes.
Comment thread .github/workflows/ci.yml
Comment on lines +5 to +7
branches: ["master"]
pull_request:
branches: ["master"]
Copy link

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The workflow is only triggered for the master branch. The repo/diff context suggests the default branch may be main, in which case CI won’t run for pushes/PRs to the default branch and the README CI badge may show no results. Consider triggering on both main and master, or updating the branch filter to match the repo’s actual default branch.

Suggested change
branches: ["master"]
pull_request:
branches: ["master"]
branches: ["main", "master"]
pull_request:
branches: ["main", "master"]

Copilot uses AI. Check for mistakes.
Comment thread README.md

[![CI](https://github.com/Smithbox-ai/ControlFlow/actions/workflows/ci.yml/badge.svg)](https://github.com/Smithbox-ai/ControlFlow/actions/workflows/ci.yml)
![Agents](https://img.shields.io/badge/agents-13-blue)
![Eval Checks](https://img.shields.io/badge/eval%20checks-302-brightgreen)
Copy link

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The “Eval Checks” badge hardcodes 302, but the repository’s evals/README.md currently states a different total (283). Hardcoding the number risks the badge and docs drifting out of sync; consider removing the fixed count or deriving it from a single authoritative source and updating all references together.

Suggested change
![Eval Checks](https://img.shields.io/badge/eval%20checks-302-brightgreen)
![Eval Checks](https://img.shields.io/badge/eval%20checks-passing-brightgreen)

Copilot uses AI. Check for mistakes.
Comment thread CONTRIBUTING.md
npm test
```

All 302 checks must pass before any PR can be merged. The suite runs fully offline.
Copy link

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This section states “All 302 checks must pass…”, but evals/README.md currently documents a different total (283). To avoid documentation drift, consider removing the hardcoded number (e.g., “All eval checks must pass”) or updating both files to match the authoritative count.

Suggested change
All 302 checks must pass before any PR can be merged. The suite runs fully offline.
All eval checks must pass before any PR can be merged. The suite runs fully offline.

Copilot uses AI. Check for mistakes.
Comment thread CONTRIBUTING.md

```bash
cd evals
npm install
Copy link

Copilot AI Apr 15, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The contributor instructions use npm install, while CI uses npm ci. Using npm ci locally (with the committed lockfile) better matches CI’s deterministic dependency resolution and reduces “works locally but not in CI” issues.

Suggested change
npm install
npm ci

Copilot uses AI. Check for mistakes.
@Smithbox-ai Smithbox-ai merged commit 91fc86d into master Apr 15, 2026
4 of 5 checks passed
@Smithbox-ai Smithbox-ai deleted the copilot/evaluate-current-project-status branch April 15, 2026 20:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants