🤖 fix: clarify best-of-n prompt guidance by ammar-agent · Pull Request #2949 · coder/mux

ammar-agent · 2026-03-14T16:29:58Z

Summary

Add explicit system-prompt guidance that a user request for best-of-n work should be interpreted as a request to use the task tool's n parameter with suitable sub-agents, and tighten the surrounding test guidance so we do not keep prompt-copy assertions around.

Background

The task tool description already explains how best-of-n spawning works, but the shared prelude did not directly tell the model how to map a plain-English "best of n" request onto that mechanism. This follow-up also removes tautological tests that only mirrored static prompt prose and adds a stronger AGENTS rule against that pattern.

Implementation

add a <best-of-n> section to the shared system prompt prelude in src/node/services/systemMessage.ts
regenerate docs/agents/system-prompt.mdx
remove tautological prelude string assertions from src/node/services/systemMessage.test.ts
strengthen the testing guidance in docs/AGENTS.md

Validation

bun test src/node/services/systemMessage.test.ts
make static-check

Risks

Low: the production behavior change is still limited to prompt guidance, and the rest of the diff removes brittle tests plus adds repo guidance.

Generated with mux • Model: openai:gpt-5.4 • Thinking: xhigh • Cost: n/a

ammar-agent · 2026-03-14T16:30:10Z

@codex review

chatgpt-codex-connector · 2026-03-14T16:32:59Z

Codex Review: Didn't find any major issues. 🎉

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Add a dedicated best-of-n section to the shared system prompt prelude so plain-English best-of-n requests map to the task tool's n parameter with suitable sub-agents. Also remove tautological prompt-copy assertions from systemMessage.test.ts and strengthen AGENTS guidance so tests focus on behavior instead of mirroring static strings. --- _Generated with `mux` • Model: `openai:gpt-5.4` • Thinking: `xhigh` • Cost: `3.12`_

ammar-agent · 2026-03-14T16:43:22Z

@codex review

chatgpt-codex-connector · 2026-03-14T16:45:57Z

Codex Review: Didn't find any major issues. Another round soon, please!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

mintlify Bot deployed to staging - docs March 14, 2026 16:30 View deployment

ammar-agent force-pushed the fix/best-of-n-task-guidance branch from f9fd100 to 213cdcb Compare March 14, 2026 16:43

ammar-agent changed the title ~~🤖 fix: clarify best-of-n system prompt guidance~~ 🤖 fix: clarify best-of-n prompt guidance Mar 14, 2026

mintlify Bot deployed to staging - docs March 14, 2026 16:43 View deployment

ammario approved these changes Mar 14, 2026

View reviewed changes

ammario merged commit 44d8d9e into main Mar 14, 2026
24 checks passed

ammario deleted the fix/best-of-n-task-guidance branch March 14, 2026 19:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🤖 fix: clarify best-of-n prompt guidance#2949

🤖 fix: clarify best-of-n prompt guidance#2949
ammario merged 1 commit into
mainfrom
fix/best-of-n-task-guidance

ammar-agent commented Mar 14, 2026 •

edited

Loading

Uh oh!

ammar-agent commented Mar 14, 2026

Uh oh!

chatgpt-codex-connector Bot commented Mar 14, 2026

Uh oh!

ammar-agent commented Mar 14, 2026

Uh oh!

chatgpt-codex-connector Bot commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ammar-agent commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Background

Implementation

Validation

Risks

Uh oh!

ammar-agent commented Mar 14, 2026

Uh oh!

chatgpt-codex-connector Bot commented Mar 14, 2026

Uh oh!

ammar-agent commented Mar 14, 2026

Uh oh!

chatgpt-codex-connector Bot commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ammar-agent commented Mar 14, 2026 •

edited

Loading