feat(extensions): add bundled bug triage workflow extension by mnriem · Pull Request #2871 · github/spec-kit

mnriem · 2026-06-05T16:35:42Z

Summary

Adds a bundled bug extension that provides a three-stage bug triage workflow any AI coding agent can drive:

speckit.bug.assess — read a bug report (pasted text or URL), judge whether it is a real bug, locate suspected code paths, and propose a remediation.
speckit.bug.fix — apply the proposed remediation and record exactly what changed.
speckit.bug.test — re-run the reproduction and any added tests, then record the verification result.

Each stage writes a Markdown report into a per-bug directory:

.specify/bugs/<slug>/
├── assessment.md   # speckit.bug.assess
├── fix.md          # speckit.bug.fix
└── test.md         # speckit.bug.test

Design

Slug

A slug is the per-bug directory name and the only handle the three commands share.

User-provided: any shape the user wants, normalized to lowercase kebab-case (e.g. login-timeout, cve-2026-001). No timestamps or numbers appended automatically.
Asked for: interactively, speckit.bug.assess asks for a slug when none is supplied, suggesting a kebab-case default derived from the bug summary.
Automated: when no human is available, the agent generates a slug itself, appending the shortest disambiguating suffix (-2, -3, …) or short date (-20260605) when needed. Existing directories are never overwritten.

Guardrails

speckit.bug.assess and speckit.bug.test never modify source code; they read the repository and write only inside .specify/bugs/<slug>/.
speckit.bug.fix is the only command that edits source code, scoped to the files listed in the assessment unless new evidence requires expanding scope (logged under Deviations from Assessment in fix.md).
No command overwrites an existing report file without explicit confirmation; in automated mode it refuses and picks a new unique slug.
Verification results are not over-claimed: reproductions that were not actually performed are reported as partial or not-run, never verified.

Changes

extensions/bug/extension.yml — manifest (id, version, three commands)
extensions/bug/commands/speckit.bug.{assess,fix,test}.md
extensions/bug/README.md
extensions/catalog.json — register bug (alphabetical, between agent-context and git)
pyproject.toml — wheel mapping to specify_cli/core_pack/extensions/bug

Mirrors the layout of the existing bundled extensions (extensions/git/, extensions/agent-context/). Uses the existing extension registration / skills pipeline — no new CLI commands or core machinery.

Validation

Manifest IDs, names, versions, and tags consistent across extension.yml and extensions/catalog.json.
Layout parity with existing bundled extensions verified.
No test changes required: bundled extensions are covered by the generic suites in tests/test_extensions.py and tests/test_extension_registration.py.

Posted on behalf of @mnriem by GitHub Copilot (model: Claude Opus 4.7).

) Add a bundled 'bug' extension providing a three-stage bug triage workflow: - speckit.bug.assess: triage a bug report (pasted text or URL), locate suspected code paths, and propose a remediation - speckit.bug.fix: apply the proposed remediation and record what changed - speckit.bug.test: validate the fix and record the verification result Each bug gets its own directory under .specify/bugs/<slug>/ with one Markdown report per stage (assessment.md, fix.md, test.md). The slug is the only handle the three commands share; existing bug directories are never overwritten. Mirrors the layout of the existing bundled extensions (git, agent-context): - extensions/bug/extension.yml, README.md, commands/ - extensions/catalog.json: register 'bug' (alphabetical, between agent-context and git) - pyproject.toml: add wheel mapping to specify_cli/core_pack/extensions/bug Closes github#2870

Copilot

Pull request overview

Adds a new bundled bug extension to Spec Kit that defines a three-stage bug triage workflow (assess → fix → test) with standardized per-bug Markdown reports written under .specify/bugs/<slug>/. This fits the existing bundled-extension model (manifest + command markdowns) and registers the extension in the bundled catalog and wheel packaging.

Changes:

Register bundled bug extension in extensions/catalog.json.
Add the bug extension manifest + command markdowns + README under extensions/bug/.
Include extensions/bug in wheel force-include mappings in pyproject.toml.

Show a summary per file

File	Description
pyproject.toml	Bundles the `extensions/bug` directory into the wheel.
extensions/catalog.json	Registers the new bundled `bug` extension metadata and tags.
extensions/bug/README.md	Documents the workflow, commands, slug conventions, and guardrails.
extensions/bug/extension.yml	Declares the extension manifest + the three provided commands.
extensions/bug/commands/speckit.bug.assess.md	Defines the assess-stage workflow and report contract.
extensions/bug/commands/speckit.bug.fix.md	Defines the fix-stage workflow and report contract.
extensions/bug/commands/speckit.bug.test.md	Defines the test-stage workflow and report contract.

Copilot's findings

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Files reviewed: 7/7 changed files
Comments generated: 3

- speckit.bug.assess.md: drop POSIX-specific 'mkdir -p' example; reword the prerequisite to describe the requirement (ensure BUG_DIR exists) without assuming a specific shell. - speckit.bug.fix.md: fix the slug-resolution fallback wording. It listed '.specify/bugs/*/assessment.md' but then keyed off whether 'exactly one bug directory' existed; now it correctly keys off whether exactly one matching 'assessment.md' was found and uses the slug from its parent directory. - tests/extensions/bug/test_bug_extension.py: add a smoke test analogous to the agent-context extension's coverage. Validates the bundled layout, catalog registration, '_locate_bundled_extension("bug")' resolution, and that 'ExtensionManager.install_from_directory' installs the three commands. All 333 tests in tests/extensions/, tests/test_extensions.py, and tests/test_extension_registration.py pass.

Copilot

Copilot's findings

Files reviewed: 8/9 changed files
Comments generated: 2

- Import _locate_bundled_extension from the public 'specify_cli' package (it is re-exported in __init__.py) instead of the private 'specify_cli._assets' module, so the test does not depend on internal module layout. - Clarify module docstring: install_from_directory is called with register_commands=False, so commands are copied and recorded in the installed manifest but not registered with AI agents. Wording updated to avoid implying otherwise.

Copilot

Copilot's findings

Files reviewed: 8/9 changed files
Comments generated: 4

- tests/extensions/bug/test_bug_extension.py: read extension.yml as UTF-8 explicitly to avoid platform-dependent default encoding (notably on Windows). Matches how the README is read in the same module. - extensions/bug/commands/speckit.bug.assess.md: add a 'Safety When Fetching URLs' section. Instructs the agent to treat fetched page content as untrusted input (no obeying embedded prompt-injection directives), forbids supplying credentials/secrets that a page asks for, scopes the fetch to the URL the user provided (no following redirects to other resources), and requires suspicious content to be quoted verbatim under an 'Unverified' heading rather than acted on. - extensions/catalog.json: bump 'updated_at' to today (2026-06-05) so consumers that cache by this field invalidate when 'bug' is added. - extensions/bug/README.md: minor grammar fix ('a reproduction that was not actually performed'). All 251 tests in tests/extensions/bug/, tests/test_extensions.py, and tests/test_extension_registration.py pass.

Builds on the 'Safety When Fetching URLs' section by adding a tiered classification rule the agent applies before any fetch: 1. Refuse outright (no fetch, no prompt) for non-http(s) schemes, loopback, link-local, RFC1918 private space, and known cloud instance-metadata endpoints (169.254.169.254, metadata.google.internal, 100.100.100.200, metadata.azure.com). This closes the SSRF / internal-recon vector opened by 'paste any URL'. 2. Fetch silently for an explicit allowlist of widely-used public bug-report sources (github, gitlab, bitbucket, atlassian.net, linear, stackoverflow/stackexchange, sentry). This preserves the paste-a-URL ergonomics the workflow is built for. 3. Otherwise prompt once in interactive mode (default 'no', naming the resolved host explicitly); in automated mode skip the fetch and record '[UNVERIFIED - fetch skipped: host not on safe list: <host>]' in assessment.md so a human can decide later. In every case, assessment.md records the verbatim URL, the resolved host, and which branch of the policy was taken (allowlisted / confirmed-by-user / auto-refused: <reason>) so the per-bug directory's audit trail is complete. Preflight HEAD probes are explicitly forbidden since the probe itself is the request the policy gates. Execution step 1 now defers to the policy before fetching.

Copilot

Copilot's findings

Files reviewed: 8/9 changed files
Comments generated: 1

The URL Trust Policy explicitly forbids following redirects, but the audit-trail bullet asked the agent to record the host 'post-redirect-resolution', which contradicted that rule and could lead agents to follow redirects unintentionally to determine what to log. Reword both call sites to refer to the host parsed from the URL the user supplied (no resolution implied): - Tier-3 interactive prompt: '...naming the host parsed from the URL explicitly...' - Recorded fields: 'The host parsed from that URL (no redirect following - see the rule above).' No behavior change; clarification only.

Copilot

Copilot's findings

Files reviewed: 8/9 changed files
Comments generated: 0 new

Copilot AI review requested due to automatic review settings June 5, 2026 16:35

Copilot started reviewing on behalf of mnriem June 5, 2026 16:35 View session

Copilot AI reviewed Jun 5, 2026

View reviewed changes

Comment thread extensions/bug/commands/speckit.bug.assess.md

Comment thread extensions/bug/commands/speckit.bug.fix.md

Comment thread extensions/bug/extension.yml

mnriem mentioned this pull request Jun 5, 2026

Add Post-Implementation Debugging and Fixing Workflow #442

Closed

mnriem requested a review from Copilot June 5, 2026 16:50

Copilot started reviewing on behalf of mnriem June 5, 2026 16:50 View session

Copilot AI reviewed Jun 5, 2026

View reviewed changes

Comment thread tests/extensions/bug/test_bug_extension.py Outdated

Comment thread tests/extensions/bug/test_bug_extension.py

mnriem requested a review from Copilot June 5, 2026 16:58

Copilot started reviewing on behalf of mnriem June 5, 2026 16:59 View session

Copilot AI reviewed Jun 5, 2026

View reviewed changes

Comment thread tests/extensions/bug/test_bug_extension.py

Comment thread extensions/bug/commands/speckit.bug.assess.md

Comment thread extensions/catalog.json

Comment thread extensions/bug/README.md Outdated

mnriem added 2 commits June 5, 2026 12:05

Copilot AI review requested due to automatic review settings June 5, 2026 17:08

Copilot started reviewing on behalf of mnriem June 5, 2026 17:08 View session

mnriem requested review from Copilot and removed request for Copilot June 5, 2026 17:21

Copilot started reviewing on behalf of mnriem June 5, 2026 17:21 View session

Copilot AI reviewed Jun 5, 2026

View reviewed changes

Comment thread extensions/bug/commands/speckit.bug.assess.md

mnriem requested a review from Copilot June 5, 2026 17:32

Copilot started reviewing on behalf of mnriem June 5, 2026 17:32 View session

Copilot AI reviewed Jun 5, 2026

View reviewed changes

mnriem merged commit 60302fe into github:main Jun 5, 2026
11 checks passed

mnriem deleted the feat/2870-bug-extension branch June 5, 2026 17:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(extensions): add bundled bug triage workflow extension#2871

feat(extensions): add bundled bug triage workflow extension#2871
mnriem merged 6 commits into
github:mainfrom
mnriem:feat/2870-bug-extension

mnriem commented Jun 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mnriem commented Jun 5, 2026

Summary

Design

Slug

Guardrails

Changes

Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Copilot's findings

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Copilot's findings

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Copilot's findings

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Copilot's findings

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants