Skip to content

docs(claude-md): trim root, move eval suite to evals/AGENTS.md#3288

Merged
caio-pizzol merged 2 commits into
mainfrom
caio/agent-docs-b-prime-trim
May 14, 2026
Merged

docs(claude-md): trim root, move eval suite to evals/AGENTS.md#3288
caio-pizzol merged 2 commits into
mainfrom
caio/agent-docs-b-prime-trim

Conversation

@caio-pizzol
Copy link
Copy Markdown
Contributor

The root CLAUDE.md had grown to 316 lines. An exploratory benchmark plus a static audit pointed at a smaller load-bearing core, so this PR trims it to 85 lines and moves the eval-suite content to evals/AGENTS.md.

  • Kept in root: rendering pipeline, compressed placement table, direction-aware mirroring rule, style resolution boundary with its why, document-api contract-first pattern, core commands, and testing matrix.
  • Cut or moved: project structure tree (derivable from ls), old "Where to Look" and "When to Modify" sections (subsumed by the placement table), per-test-suite detail (already in tests/behavior/AGENTS.md and tests/visual/AGENTS.md), and AI Eval Suite detail (now evals/AGENTS.md, with CLAUDE.md symlinked for Claude compatibility).
  • Behind this: a small variant benchmark (baseline / lean / lean+scoped-rules / organic bloat) measured two SuperDoc tasks: adding a doc-api operation and refactoring an RTL paragraph feature. The lean root matched baseline correctness in the sample at roughly 17% lower cost on the complex doc-api task, with 0% architecture-violation rate in the final scored sample. N=3 per cell, Claude Sonnet only, grep-based correctness. Directional, not statistically conclusive.
  • Deferred to keep this PR small: brand/design guidance to package-level docs, JSDoc/generated/comments rules to .claude/rules/ with path scoping, and a recurring audit workflow.

@caio-pizzol caio-pizzol requested a review from a team as a code owner May 14, 2026 11:32
Root CLAUDE.md goes from 316 to 85 lines. Net -264 / +33.

Keeps the load-bearing parts: Architecture: Rendering with the
placement table compressed to 7 rows, direction-aware mirroring
rule, one boundary-check rg command, Style Resolution Boundary
with the 'why', Document API Contract, Commands, Testing matrix.

Cuts content that's either derivable, redundant with the placement
table, or has a nested home: Project Structure tree (ls works),
the old 'Where to Look' table (subsumed), 'When to Modify Which
System' (subsumed), per-test-suite detail (already in tests/
behavior/AGENTS.md and tests/visual/AGENTS.md).

Eval suite (~60 lines) moves intact to a new evals/AGENTS.md so
the documentation isn't dropped; evals/CLAUDE.md symlinks to it
for cross-agent compatibility.

Brand, comments, JSDoc types, and generated-artifact tables are
move candidates for follow-up PRs (path-scoped rules under
.claude/rules/ or per-package AGENTS.md). Not in this PR to keep
it small.

Background: based on an N=3 exploratory benchmark + static audit
comparing variants of the root CLAUDE.md (baseline / lean /
lean+scoped-rules / organic bloat) against two SuperDoc tasks
(add doc-api operation, refactor RTL paragraph feature module).
Lean root matched baseline correctness in the final scored sample
at ~17% lower cost on the complex doc-api task; architecture-
violation rate was 0% in the final scored sample. C-lean-rules
had one incomplete-implementation run that flagged a real failure
mode (operation not wired into public API). N=3 per cell, Claude
Sonnet only, grep-based correctness. Directional only.
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5fb0d3523e

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Comment thread evals/AGENTS.md Outdated
@caio-pizzol caio-pizzol force-pushed the caio/agent-docs-b-prime-trim branch from 5fb0d35 to d11b227 Compare May 14, 2026 11:34
… any cwd

Per PR review: eval scripts live in evals/package.json, not the
root, so bare 'pnpm run eval'/'eval:e2e'/'view' fail with
ERR_PNPM_NO_SCRIPT from the repo root. Switched all references
to 'pnpm --filter @superdoc-testing/evals run ...' form.
Bare-form usage from inside evals/ called out separately for
operators already cd'd in.
@caio-pizzol caio-pizzol merged commit 1173432 into main May 14, 2026
11 checks passed
@caio-pizzol caio-pizzol deleted the caio/agent-docs-b-prime-trim branch May 14, 2026 11:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants