fix: prompt hardening — security, negative rules, tone (research-backed) by kienbui1995 · Pull Request #59 · kienbui1995/mc-code

kienbui1995 · 2026-04-13T22:11:32Z

Research-backed prompt improvements

Sources: Augment Code 11 techniques, Claude Code leak analysis, 134K-star leaked prompts repo.

New sections

Security: prompt injection guardrail, no untrusted execution
What NOT to Do: 7 negative rules (Augment: 'telling model what NOT to do is safe and effective')
Output Format: confidence level, risks/side effects

Reordered

Cost Awareness + Error Recovery moved to END of static prompt (Augment: 'models pay more attention to beginning and especially end')

All 8 new rules

✅ Prompt injection detection
✅ No write_file for small edits
✅ No destructive commands without confirmation
✅ No modify tests unless asked
✅ No install deps silently
✅ No repeat failed approach
✅ State confidence level
✅ Mention risks/side effects

274 tests, 0 fail.

Summary by CodeRabbit

Chores
- Enhanced system prompt guidelines with improved security safeguards, including prompt-injection handling and credential protection
- Refined command execution requirements and output formatting standards

…tion reorder Based on research from Augment Code (11 techniques), Claude Code leak, and leaked prompts repo (134K stars): 1. Security section: prompt injection detection, no untrusted execution, no credential exposure 2. What NOT to Do: 7 negative rules (no write for small edits, no guess, no destructive commands, no modify tests, no silent deps, no repeat fails) 3. Output Format enhanced: confidence level, risks/side effects 4. Section reorder: Cost Awareness + Error Recovery moved to END (models pay most attention to beginning + end of prompt) 274 tests, 0 fail.

coderabbitai · 2026-04-13T22:11:48Z

📝 Walkthrough

Walkthrough

The system prompt in the CLI's main function was extended with new Security, "What NOT to Do", and improved Output Format sections. Duplicate guidance was removed, consolidating overlapping instructions into a single comprehensive prompt.

Changes

Cohort / File(s)	Summary
System Prompt Enhancement `mc/crates/mc-cli/src/main.rs`	Extended `build_system_prompt` with Security section (prompt-injection handling, confirmation before executing untrusted commands, credential protection), What NOT to Do section (tool constraints, test/dependency handling, failure recovery), and refined Output Format requirements. Removed duplicate/overlapping Output Format block and redundant "Be concise" guidance.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~5 minutes

Possibly related PRs

fix: system prompt — all 30 tools + cost awareness + debug/browser guidance #58: Modifies the same system prompt in mc/crates/mc-cli/src/main.rs, updating prompt sections and guidance content for the assistant.

Poem

🐰 A rabbit's refrain on prompts made right:

Guidelines clearer, safer in sight,
Security badges and "do NOT" advice,
Duplicate whispers silenced—so nice!
The AI assistant now knows the way,
To serve with wisdom, come what may. 🌟

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Description check	⚠️ Warning	The description covers the motivation (research-backed improvements), key implementation details (8 new rules, section reordering), and test results, but lacks the Checklist section required by the template.	Add the Checklist section from the template with items for cargo fmt, cargo test, cargo clippy, warnings check, and test coverage confirmation.
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately summarizes the main change: adding security hardening, negative rules, and tone improvements to the system prompt with research backing.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/prompt-deep-research

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist

Code Review

This pull request updates the system prompt in mc-cli by adding new sections for Security, What NOT to Do, and Output Format, while reorganizing existing instructions. A review comment suggests consolidating the What NOT to Do section to remove redundancies with other parts of the prompt, which would help reduce token usage and improve clarity.

gemini-code-assist · 2026-04-13T22:12:42Z

mc/crates/mc-cli/src/main.rs

+         ## What NOT to Do\n\
+         - Do NOT use `write_file` to make small edits — use `edit_file` instead.\n\
+         - Do NOT read entire large files — use offset/limit in `read_file`.\n\
+         - Do NOT guess when requirements are unclear — use `ask_user`.\n\
+         - Do NOT run destructive commands (rm -rf, drop table) without user confirmation.\n\
+         - Do NOT modify test files unless explicitly asked.\n\
+         - Do NOT install new dependencies without mentioning it first.\n\
+         - Do NOT repeat a failed approach — try a different strategy.\n\n\


The 'What NOT to Do' section introduces several rules that are already covered in other sections, leading to significant redundancy. For example:

Line 1877 is redundant with lines 1863 and 1893.

Line 1878 is redundant with line 1895.

Line 1879 is redundant with line 1871.

Line 1883 is redundant with line 1897.

While negative constraints are useful, repeating the same instructions multiple times across different sections increases token usage and can lead to instruction fatigue for the model. Consider consolidating these into a single, clear instruction per topic. For instance, you could move the unique negative constraints (like destructive commands or test file modifications) here and keep the tool-specific ones in 'Tool Usage Guidelines'.

coderabbitai

🧹 Nitpick comments (1)

mc/crates/mc-cli/src/main.rs (1)

1881-1881: Soften the absolute “no test edits” rule to avoid blocking required fixes

Line 1881 is currently absolute; this can prevent necessary test updates when behavior changes are implemented. Consider allowing test edits when strictly required, with explicit justification.

✏️ Proposed wording tweak

-         - Do NOT modify test files unless explicitly asked.\n\
+         - Do NOT modify test files unless explicitly asked; if test changes are strictly required for correctness, do so and explain why.\n\

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@mc/crates/mc-cli/src/main.rs` at line 1881, Update the hardline prohibition
string "- Do NOT modify test files unless explicitly asked.\n" in main.rs to a
softer message that allows test edits when necessary; change the text to
indicate test modifications are permitted only with explicit justification and a
brief note explaining why the change is required (e.g., "Do not modify tests
unless strictly necessary — if a test must be updated, include an explicit
justification and link to the relevant issue/PR"). Ensure you update the exact
string literal where it's defined so help output and any related help/usage text
reflect the new, permissive-but-justified policy.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@mc/crates/mc-cli/src/main.rs`:
- Line 1881: Update the hardline prohibition string "- Do NOT modify test files
unless explicitly asked.\n" in main.rs to a softer message that allows test
edits when necessary; change the text to indicate test modifications are
permitted only with explicit justification and a brief note explaining why the
change is required (e.g., "Do not modify tests unless strictly necessary — if a
test must be updated, include an explicit justification and link to the relevant
issue/PR"). Ensure you update the exact string literal where it's defined so
help output and any related help/usage text reflect the new,
permissive-but-justified policy.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: b43cc99f-7d57-4b65-b6df-cb96e40e6756

📥 Commits

Reviewing files that changed from the base of the PR and between 135d92e and 19b46af.

📒 Files selected for processing (1)

mc/crates/mc-cli/src/main.rs

gemini-code-assist bot reviewed Apr 13, 2026

View reviewed changes

coderabbitai bot reviewed Apr 13, 2026

View reviewed changes

kienbui1995 merged commit 1d3b87e into main Apr 14, 2026
9 checks passed

kienbui1995 deleted the fix/prompt-deep-research branch April 14, 2026 01:42

coderabbitai bot mentioned this pull request Apr 15, 2026

feat: model-tier prompt profiles — adapt per model capability #68

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: prompt hardening — security, negative rules, tone (research-backed)#59

fix: prompt hardening — security, negative rules, tone (research-backed)#59
kienbui1995 merged 1 commit intomainfrom
fix/prompt-deep-research

kienbui1995 commented Apr 13, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Apr 13, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (2 warnings)

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 13, 2026

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kienbui1995 commented Apr 13, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Research-backed prompt improvements

New sections

Reordered

All 8 new rules

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (2 warnings)

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kienbui1995 commented Apr 13, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 13, 2026 •

edited

Loading