Test Planning: Usability and report output updates#43
Merged
Conversation
…on public-API tests Rework the /test-planning output so it reads as a plain-language overview first instead of a TP-item dump. The template now leads with a Summary, a What Needs Testing and Why themes section, and a What Each Test Covers walkthrough, then drops the per-item Test Plan, Deferred, Dropped, Coverage counts, and Scope into a labeled Technical Reference region below the spine. Add a Step 5 review pass that dispatches information-architect and junior-developer in parallel against the generated plan to confirm it leads with plain language and the plain-language layer stands on its own. Add a behavioral-depth operating principle: every recommended test verifies observable behavior at a public seam (caller inputs, observed outputs and side effects, collaborator interactions), never private methods or internal state, and stops at the critical behaviors a caller depends on rather than over-specifying every branch. Thread it through the agent dispatch prompts, add a behavioral sweep to the merge step, and sync the long-form operator doc.
7bd9374 to
d571966
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This PR reworks the
/test-planningskill so its output leads with plain-language behavior and recommends only public-API behavioral tests, so that a reader without code context can grasp the plan in thirty seconds and the recommended tests survive refactors.## Technical Referenceregion below.Behavior changes
When you run
/test-planningtoday, the resulting test plan opens with a scope table and per-item technical detail (each test tagged with a priority tier, code paths, approach, and justification). After this PR, the same skill produces a plan that opens with plain-language sections written for a reader who has not seen the code: a summary paragraph, the work grouped into 2-4 themes, and one plain-language line per test led by its stable test ID. The old technical sections still exist but now live under a## Technical Referenceregion at the bottom, with the scope table moved from first to last.Two further behavioral shifts: the skill now runs a fifth step that asks two reviewer sub-agents to read the generated plan and flag structure or comprehension problems (since the plan lives in the chat rather than a file, the full plan text is embedded directly in each reviewer's prompt). And the tests it recommends are now constrained to observable behavior at a public seam (caller inputs, outputs, side effects, and interactions with collaborators), with internal-implementation recommendations rewritten to go through the public boundary, dropped if no public seam exposes them, or collapsed into the one behavioral test that catches the same failures.
What to look at first
Files of interest
han.core/skills/test-planning/references/template.md— the output template restructure; this is where the plain-language-first spine and the demoted Technical Reference region are defined.han.core/skills/test-planning/SKILL.md— the skill definition carrying the new public-API operating principle, the Step 5 review pass, and the behavioral-sweep merge step.docs/skills/test-planning.md— the operator-facing doc synced to describe all three changes and the now five-step process.