docs(pipeline): teach LLM to avoid 7 patterns audit found across last 10 posts#323
Conversation
… 10 posts After driving PR #322 score 52→est. 22-28 across 3 cold-pass sweeps, an empirical audit on the 10 most recent posts surfaced patterns the new pipeline (PR #319) doesn't catch. Add them as TEACHING (not gates). Audit findings (10 posts, ~22k words): - Long paragraphs >3 sentences in 39-58% of TDD/XP cluster (cap unenforced) - Anaphora pairs (same first-word sentences in same paragraph) in 10/10 posts - "The..." paragraph-opener density at 18-33% in 6/10 posts (target ≤20%) - "We..." paragraph-opener stamping in TDD/XP cluster - Mannerism-noun saturation: 9-18 per post (cap is 3 combined) - The post's own architectural labels become the tell once stamped repeatedly (`contract`, `gate`, `gatekeeper`, `layer`, `round`, `review`, `pass` joined the original mannerism set) - Architecture enumeration in body restating the H2 structure - Formulaic founder-anecdote hook (Priya/Marcus/Sarah opener) in last 5 founder-pain posts - became its own template Voice guide additions (new rows in Banned structural patterns table): - Anaphora pairs - read aloud, no two sentences per paragraph share first word - "The..." opener density - cap 20% of paragraphs - "We..." opener density - cap 15% of paragraphs - Long paragraphs (cap break) - 3 sentences AND 70 words working cap - Architecture enumeration - let H2s carry structure, don't list pillars in body - Formulaic founder-anecdote hook - the recurring shape is now the tell Mannerism-noun row extended: - Cap raised from 3 to 5 (acknowledges some posts genuinely need labels) - New mannerisms added to the watch list: cadence, rhythm, contract, gate, gatekeeper, layer, round, review, pass Writer prompt additions (STEP 4b cadence quotas): - Paragraph cap: 3 sentences AND 70 words (was just 3 sentences) - Anaphora rule - Opener variety (both "The..." and "We..." capped) - Mannerism cap raised to 5, watch list extended - Architecture-enumeration ban - Hook variety guidance Writer prompt additions (STEP 4c BAD/GOOD pairs): - Anaphora pair rewrite - Architecture enumeration rewrite - Long-paragraph split example - Mannerism-noun saturation rewrite - Formulaic founder-anecdote hook with alternative opener shape - "We" opener stamping with rewrite These are TEACHING additions (concrete BAD/GOOD pairs, surface-feature descriptions). No tool recommendations, no programmatic gates. The LLM learns from the patterns, not from a CI script. bin/hugo-build: passes (docs-only). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
|
Warning Rate limit exceeded
To keep reviews running without waiting, you can enable usage-based add-on for your organization. This allows additional reviews beyond the hourly cap. Account admins can enable it under billing. ⌛ How to resolve this issue?After the wait time has elapsed, a review can be triggered using the We recommend that you space out your commits to avoid hitting the rate limit. 🚦 How do rate limits work?CodeRabbit enforces hourly rate limits for each developer per organization. Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout. Please see our FAQ for further information. ℹ️ Review info⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro Run ID: 📒 Files selected for processing (2)
✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Review rate limit: 0/1 reviews remaining, refill in 15 minutes and 17 seconds.Comment |
Summary
After driving PR #322's AI-detector score 52 → est. 22-28 across 3 cold-pass sweeps, an empirical audit on the 10 most recent posts surfaced patterns the new pipeline (PR #319) didn't catch. This PR adds those patterns to the voice guide and writer prompt as teaching (concrete BAD/GOOD pairs, surface-feature descriptions). No tool recommendations, no programmatic gates - the LLM learns from the patterns.
What the audit found (last 10 posts, ~22k words)
The gap: pipeline catches lexical/sentence-level tells; misses paragraph-level and post-level structure tells.
What changed
docs/90-99-content-strategy/strategy-analysis/90.11-voice-guide.mdAdded 6 new rows to the "Banned structural patterns" table:
Extended the existing Mannerism-noun refrain row:
cadence,rhythm,contract,gate,gatekeeper,layer,round,review,passdocs/workflows/blog-pipeline.mdAdded to STEP 4b (cadence quotas):
Added to STEP 4c (BAD/GOOD pairs):
Approach: TEACHING, not gating
Per request: no tool recommendations (no Pangram, GPTZero, grep scripts), no programmatic gates. The LLM reads the patterns + BAD/GOOD pairs and learns what good writing looks like. The cold-eyes Agent J (already in pipeline) catches what the writer misses.
Test plan
bin/hugo-build— passes🤖 Generated with Claude Code