docs(claude-md): trim root, move eval suite to evals/AGENTS.md by caio-pizzol · Pull Request #3288 · superdoc-dev/superdoc

caio-pizzol · 2026-05-14T11:32:48Z

The root CLAUDE.md had grown to 316 lines. An exploratory benchmark plus a static audit pointed at a smaller load-bearing core, so this PR trims it to 85 lines and moves the eval-suite content to evals/AGENTS.md.

Kept in root: rendering pipeline, compressed placement table, direction-aware mirroring rule, style resolution boundary with its why, document-api contract-first pattern, core commands, and testing matrix.
Cut or moved: project structure tree (derivable from ls), old "Where to Look" and "When to Modify" sections (subsumed by the placement table), per-test-suite detail (already in tests/behavior/AGENTS.md and tests/visual/AGENTS.md), and AI Eval Suite detail (now evals/AGENTS.md, with CLAUDE.md symlinked for Claude compatibility).
Behind this: a small variant benchmark (baseline / lean / lean+scoped-rules / organic bloat) measured two SuperDoc tasks: adding a doc-api operation and refactoring an RTL paragraph feature. The lean root matched baseline correctness in the sample at roughly 17% lower cost on the complex doc-api task, with 0% architecture-violation rate in the final scored sample. N=3 per cell, Claude Sonnet only, grep-based correctness. Directional, not statistically conclusive.
Deferred to keep this PR small: brand/design guidance to package-level docs, JSDoc/generated/comments rules to .claude/rules/ with path scoping, and a recurring audit workflow.

Root CLAUDE.md goes from 316 to 85 lines. Net -264 / +33. Keeps the load-bearing parts: Architecture: Rendering with the placement table compressed to 7 rows, direction-aware mirroring rule, one boundary-check rg command, Style Resolution Boundary with the 'why', Document API Contract, Commands, Testing matrix. Cuts content that's either derivable, redundant with the placement table, or has a nested home: Project Structure tree (ls works), the old 'Where to Look' table (subsumed), 'When to Modify Which System' (subsumed), per-test-suite detail (already in tests/ behavior/AGENTS.md and tests/visual/AGENTS.md). Eval suite (~60 lines) moves intact to a new evals/AGENTS.md so the documentation isn't dropped; evals/CLAUDE.md symlinks to it for cross-agent compatibility. Brand, comments, JSDoc types, and generated-artifact tables are move candidates for follow-up PRs (path-scoped rules under .claude/rules/ or per-package AGENTS.md). Not in this PR to keep it small. Background: based on an N=3 exploratory benchmark + static audit comparing variants of the root CLAUDE.md (baseline / lean / lean+scoped-rules / organic bloat) against two SuperDoc tasks (add doc-api operation, refactor RTL paragraph feature module). Lean root matched baseline correctness in the final scored sample at ~17% lower cost on the complex doc-api task; architecture- violation rate was 0% in the final scored sample. C-lean-rules had one incomplete-implementation run that flagged a real failure mode (operation not wired into public API). N=3 per cell, Claude Sonnet only, grep-based correctness. Directional only.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5fb0d3523e

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

… any cwd Per PR review: eval scripts live in evals/package.json, not the root, so bare 'pnpm run eval'/'eval:e2e'/'view' fail with ERR_PNPM_NO_SCRIPT from the repo root. Switched all references to 'pnpm --filter @superdoc-testing/evals run ...' form. Bare-form usage from inside evals/ called out separately for operators already cd'd in.

caio-pizzol requested a review from a team as a code owner May 14, 2026 11:32

superdoc-bot Bot added the review: quick label May 14, 2026

tupizz approved these changes May 14, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed May 14, 2026

View reviewed changes

Comment thread evals/AGENTS.md Outdated

caio-pizzol force-pushed the caio/agent-docs-b-prime-trim branch from 5fb0d35 to d11b227 Compare May 14, 2026 11:34

caio-pizzol merged commit 1173432 into main May 14, 2026
11 checks passed

caio-pizzol deleted the caio/agent-docs-b-prime-trim branch May 14, 2026 11:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(claude-md): trim root, move eval suite to evals/AGENTS.md#3288

docs(claude-md): trim root, move eval suite to evals/AGENTS.md#3288
caio-pizzol merged 2 commits into
mainfrom
caio/agent-docs-b-prime-trim

caio-pizzol commented May 14, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

caio-pizzol commented May 14, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants