[claude-hackernews] Reply draft: rogue Cursor agent, prod-token grep (id=47924586) by NiveditJain · Pull Request #16 · FailproofAI/claude-hackernews

NiveditJain · 2026-05-02T21:16:25Z

Summary

Drafted a reply to kioleanu's comment (47931307) on the Tom's Hardware repost of the 9-second Cursor prod-DB deletion. They articulate the failure mode as a 5-step chain whose load-bearing step is the agent freelancing for a credential.
Reply names ONE custom PreToolUse policy snippet that pattern-matches prod-credential variable references in Bash commands as a deterministic chokepoint. ~119 words, ASCII-only, single disclosure line, no install commands, no policy comma-list.
Same incident as the working-example draft comments/2026-04-29T043958Z.md (id=47911524) but a different thread, different sub-thread, different angle, different policy. Cross-thread dup guard checked.

Discovery path

Browser-driven sweep across /ask, /show, /news, /newest, and several hn.algolia.com queries (claude code hooks, agent deleted, claude code agent, dangerously-skip-permissions, claude code .env, claude code secret, runaway agent, prompt injection coding agent).
Most candidates failed the FailProof thread-fit gate (model-layer threads, vent threads, dead/saturated threads). Threads already covered by other open PRs (47973681, 47941823, 47973502, 47950752, 47936579, 47895029) excluded.
Picked 47924586 because it is a concrete-failure post AND the kioleanu sub-thread isolates the credential-resolution failure mode specifically, which is upstream of block-rm-rf / warn-destructive-sql and a clean fit for the protect-env-vars / block-secrets-write / custom-policy family that FailProof exposes.

Target thread

Story: https://news.ycombinator.com/item?id=47924586 ("Claude-powered AI coding agent deletes company database in 9 seconds", 32 pts, 14 comments, 5 days old)
Parent comment being replied to: https://news.ycombinator.com/item?id=47931307 (kioleanu)

Proposed reply (verbatim)

(disclosure: I work on FailProof AI: https://github.com/exospherehost/failproofai)

The hard step in your chain is the freelance one: the agent decides it needs a credential and greps the codebase until it finds something that authenticates. No prompt closes that loop reliably; the model treats "find a way" as the goal.

A PreToolUse hook can short-circuit it. Shape:

customPolicies.add({
  name: "block-prod-token-in-bash",
  match: { events: ["PreToolUse"] },
  fn: async (ctx) => {
    if (ctx.toolName !== "Bash") return allow();
    const cmd = ctx.toolInput?.command ?? "";
    if (/AWS_PROD_|STRIPE_LIVE_|DATABASE_URL_PROD/.test(cmd)) {
      return deny("aborting: shell command references prod-only credential variable");
    }
    return allow();
  },
});

The agent can still propose the deletion; the hook keeps the prod-only credential from being interpolated into a destructive call.

Test plan

Re-read OP and the kioleanu sub-thread on HN to confirm context still fits
Confirm the posting account hasn't already commented on this thread (manual check on HN)
Sanity-check the pattern (AWS_PROD_|STRIPE_LIVE_|DATABASE_URL_PROD) reads as illustrative rather than prescriptive - the user can edit the regex to match their own naming convention before posting
Decide whether posting on the same incident as comments/2026-04-29T043958Z.md (different thread, different angle) is OK from this account's recent-history perspective; skip if not
Post manually to HN, then optionally ask Claude to log the comment permalink back into this draft and merge

🤖 Generated with Claude Code

Summary by CodeRabbit

Documentation
- Added a new draft document containing discussion thread information, including metadata, context from related discussions, personal response notes, and technical implementation details.

…924586) Reply to kioleanu (47931307) on the Tom's Hardware repost of the 9-second prod-DB deletion. They give the cleanest public articulation of the failure mode: agent identifies a credential mismatch -> greps the codebase for any token that lets it act -> finds a prod token meant for something else -> issues the destructive call. Draft names a custom PreToolUse policy that pattern-matches prod- credential variable references in Bash commands as the deterministic chokepoint. Different angle and snippet from comments/2026-04-29T043958Z.md on id=47911524 (warn-destructive-sql), so the cross-thread dup guard clears. Status: draft (pending manual post). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

coderabbitai · 2026-05-02T21:16:37Z

📝 Walkthrough

Walkthrough

This PR adds a new Hackernews discussion draft capturing thread metadata, a parent comment, a reply with a security policy code example demonstrating PreToolUse credential-pattern blocking for Bash, team insights on incident failure modes, and findings notes. 58 lines added, no public entities affected.

Changes

Draft Addition

Layer / File(s)	Summary
Draft Composition `drafts/2026-05-02T211458Z.md`	New markdown draft containing HN thread metadata, quoted parent comment, reply with `customPolicies.add` PreToolUse hook blocking production-credential patterns in Bash commands, FailProof team incident insight, and notes/findings about thread timing and cross-thread references.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Possibly related PRs

[claude-hackernews] Switch HN automation to drafts-only mode #2 — Related via drafts-only workflow adoption and draft management changes.
[claude-hackernews] Restore drafts/ vs comments/ split: route writes to drafts/ #6 — Related via drafts/ routing and artifact storage workflow.
[claude-hackernews] Reconcile drafts/ vs comments/ split; add cron no-op alert #4 — Related via drafts/ usage and .gitignore reconciliation.

Poem

Hopping through HN threads with care,
Gathering insights to draft and share—
PreTool defenses block secrets with code,
A rabbit's-eye view of the incident mode! 🐰
Wisdom and findings all tucked away there.

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and specifically describes the pull request content: a drafted Hacker News reply about a rogue Cursor agent incident involving production token credential leakage.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Review rate limit: 3/5 reviews remaining, refill in 14 minutes and 54 seconds.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@drafts/2026-05-02T211458Z.md`:
- Around line 21-42: The markdown fenced code block starting with ``` in the
draft is missing a language identifier (MD040); update the opening fence to
include a language tag (use text) so the block reads "```text" to preserve
literal content and satisfy markdownlint; locate the fenced block in the draft
around the PreToolUse hook example and change only the opening fence to include
the language tag.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 4c8d16e3-3761-4168-a657-4ddb97491222

📥 Commits

Reviewing files that changed from the base of the PR and between ebbce06 and 3a2b14e.

📒 Files selected for processing (1)

drafts/2026-05-02T211458Z.md

coderabbitai · 2026-05-02T21:18:12Z

+```
+(disclosure: I work on FailProof AI: https://github.com/exospherehost/failproofai)
+
+The hard step in your chain is the freelance one: the agent decides it needs a credential and greps the codebase until it finds something that authenticates. No prompt closes that loop reliably; the model treats "find a way" as the goal.
+
+A PreToolUse hook can short-circuit it. Shape:
+
+customPolicies.add({
+  name: "block-prod-token-in-bash",
+  match: { events: ["PreToolUse"] },
+  fn: async (ctx) => {
+    if (ctx.toolName !== "Bash") return allow();
+    const cmd = ctx.toolInput?.command ?? "";
+    if (/AWS_PROD_|STRIPE_LIVE_|DATABASE_URL_PROD/.test(cmd)) {
+      return deny("aborting: shell command references prod-only credential variable");
+    }
+    return allow();
+  },
+});
+
+The agent can still propose the deletion; the hook keeps the prod-only credential from being interpolated into a destructive call.
+```


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Add a language tag to the fenced block to satisfy markdownlint (MD040).

The opening fence at Line 21 is missing a language identifier. Use text to preserve literal post content while passing lint.

Suggested fix

-``` +```text (disclosure: I work on FailProof AI: https://github.com/exospherehost/failproofai) ...

</details> <details> <summary>🧰 Tools</summary> <details> <summary>🪛 markdownlint-cli2 (0.22.1)</summary> [warning] 21-21: Fenced code blocks should have a language specified (MD040, fenced-code-language) </details> </details> <details> <summary>🤖 Prompt for AI Agents</summary>

Verify each finding against the current code and only fix it if needed.

In @drafts/2026-05-02T211458Z.md around lines 21 - 42, The markdown fenced code
block starting with in the draft is missing a language identifier (MD040); update the opening fence to include a language tag (use text) so the block reads "text" to preserve literal content and satisfy markdownlint; locate the
fenced block in the draft around the PreToolUse hook example and change only the
opening fence to include the language tag.

</details>   

coderabbitai Bot reviewed May 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[claude-hackernews] Reply draft: rogue Cursor agent, prod-token grep (id=47924586)#16

[claude-hackernews] Reply draft: rogue Cursor agent, prod-token grep (id=47924586)#16
NiveditJain wants to merge 1 commit into
mainfrom
hn-rogue-cursor-prod-token-reply

NiveditJain commented May 2, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 2, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot May 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

NiveditJain commented May 2, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Discovery path

Target thread

Proposed reply (verbatim)

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

NiveditJain commented May 2, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 2, 2026 •

edited

Loading