chore: Tweaks to Haiku prompt to improve repo selection. by sortafreel · Pull Request #60092 · PostHog/posthog

sortafreel · 2026-05-26T14:08:02Z

Problem

The Haiku gate was filtering out things it shouldn't — SDK crashes (the heuristic's "trace" substring tripped on "stack trace"), perf complaints about team-owned sites (the LLM treated them as ops/CDN issues), and "wrong data / missing events" reports (which are almost always tracking bugs in the team's own code, not PostHog).

Changes

Dropped "trace" from the heuristic and tightened the Haiku prompt with principled own-code-vs-PostHog-SaaS guidance
Plus an explicit "wrong data = tracking bug → needs_repo" exception; added 3 eval cases (posthog_product_hang, sdk_instrumentation_on_own_site, wrong_data_tracking_bug) as boundary regression guards.

How did you test this code?

👉 Stay up-to-date with PostHog coding conventions for a smoother review.

Publish to changelog?

Docs update

🤖 Agent context

sortafreel · 2026-05-26T14:08:16Z

chore: Tweaks to Haiku prompt to improve repo selection. #60092 👈 (View in Graphite)
master

This stack of pull requests is managed by Graphite. Learn more about stacking.

Copilot

Pull request overview

This PR tunes the Slack repo-selection classifier so Haiku is less likely to filter out tasks that should proceed to repository discovery, especially SDK/app crashes, team-owned site performance issues, and tracking-related wrong-data reports.

Changes:

Removes "trace" from the no-repo heuristic terms to avoid matching “stack trace.”
Expands the Haiku prompt with own-code vs PostHog SaaS guidance and a wrong-data tracking exception.
Updates the local Slack repo-selection eval with new/updated boundary cases.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
`products/slack_app/backend/api.py`	Adjusts heuristic terms and classifier prompt logic.
`posthog/temporal/ai/eval_slack_repo_selection.py`	Updates expected eval outcomes and adds regression cases for routing boundaries.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+        "the team's code → no_repo. Important exception: 'wrong data', 'missing events', or "
+        "'numbers look off' in PostHog usually means the team's tracking code is broken (wrong "
+        "event names, identification logic, SDK setup) — that's a code fix in their repo → "
+        "needs_repo. When in doubt, lean needs_repo=true — the discovery agent can still report "


greptile-apps · 2026-05-28T09:47:03Z

Prompt To Fix All With AI

Fix the following 1 code review issue. Work through them one at a time, proposing concise fixes.

---

### Issue 1 of 1
posthog/temporal/ai/eval_slack_repo_selection.py:169-192
These two cases are now `agent/found` outcomes but still sit under the `# --- Haiku gate short-circuits (heuristic + LLM) ---` section comment. Now that they represent the fixed (pass-through-to-agent) behaviour they would be easier to find alongside the other agent-path cases below the `# --- Agent path ---` marker.

```suggestion
    Case(
        name="posthog_product_hang",
```

_{Reviews (1): Last reviewed commit: "Merge branch 'master' into repo-selectio..." | Re-trigger Greptile}

greptile-apps · 2026-05-28T09:47:07Z

    Case(
        name="marketing_site",
-        description="Haiku LLM filters as ops/perf rather than code change.",
+        description="Perf complaint about a site the team likely owns code for — agent should route to docs/marketing repo.",
        text_template="@PostHog the docs site loads really slowly on mobile",
        thread_messages=[
            {"user": "tester", "text": "@PostHog the docs site loads really slowly on mobile"},
            {"user": "other", "text": "yeah I noticed the same on /docs/getting-started"},
        ],
-        expected_stage="haiku",
-        expected_outcome="no_repo",
-        note=(
-            "Ideally the agent would route this to a docs/marketing repo if connected. "
-            "Haiku LLM treats it as a perf/CDN question instead. Out of this PR's scope; "
-            "candidate for a follow-up Haiku-tuning PR backed by this eval."
-        ),
+        expected_stage="agent",
+        expected_outcome="found",
    ),
    Case(
        name="sdk_specific_trace_trip",
-        description="Haiku heuristic trips on 'trace' in 'stack trace'.",
+        description="App/SDK crash with stack trace — agent should route to the relevant SDK repo (e.g. posthog-ios).",
        text_template="@PostHog the iOS SDK is crashing on app launch after upgrade to 3.19",
        thread_messages=[
            {"user": "tester", "text": "@PostHog the iOS SDK is crashing on app launch after upgrade to 3.19"},
            {"user": "other", "text": "stack trace shows PostHogReplay.start() failing"},
        ],
+        expected_stage="agent",
+        expected_outcome="found",
+    ),
+    Case(
+        name="posthog_product_hang",


These two cases are now agent/found outcomes but still sit under the # --- Haiku gate short-circuits (heuristic + LLM) --- section comment. Now that they represent the fixed (pass-through-to-agent) behaviour they would be easier to find alongside the other agent-path cases below the # --- Agent path --- marker.

Suggested change

Case(

name="marketing_site",

description="Haiku LLM filters as ops/perf rather than code change.",

description="Perf complaint about a site the team likely owns code for — agent should route to docs/marketing repo.",

text_template="@PostHog the docs site loads really slowly on mobile",

thread_messages=[

{"user": "tester", "text": "@PostHog the docs site loads really slowly on mobile"},

{"user": "other", "text": "yeah I noticed the same on /docs/getting-started"},

],

expected_stage="haiku",

expected_outcome="no_repo",

note=(

"Ideally the agent would route this to a docs/marketing repo if connected. "

"Haiku LLM treats it as a perf/CDN question instead. Out of this PR's scope; "

"candidate for a follow-up Haiku-tuning PR backed by this eval."

),

expected_stage="agent",

expected_outcome="found",

),

Case(

name="sdk_specific_trace_trip",

description="Haiku heuristic trips on 'trace' in 'stack trace'.",

description="App/SDK crash with stack trace — agent should route to the relevant SDK repo (e.g. posthog-ios).",

text_template="@PostHog the iOS SDK is crashing on app launch after upgrade to 3.19",

thread_messages=[

{"user": "tester", "text": "@PostHog the iOS SDK is crashing on app launch after upgrade to 3.19"},

{"user": "other", "text": "stack trace shows PostHogReplay.start() failing"},

],

expected_stage="agent",

expected_outcome="found",

),

Case(

name="posthog_product_hang",

Case(

name="posthog_product_hang",

Prompt To Fix With AI

This is a comment left during a code review. Path: posthog/temporal/ai/eval_slack_repo_selection.py Line: 169-192 Comment: These two cases are now `agent/found` outcomes but still sit under the `# --- Haiku gate short-circuits (heuristic + LLM) ---` section comment. Now that they represent the fixed (pass-through-to-agent) behaviour they would be easier to find alongside the other agent-path cases below the `# --- Agent path ---` marker. ```suggestion Case( name="posthog_product_hang", ``` How can I resolve this? If you propose a fix, please make it concise.

Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!

deployment-status-posthog · 2026-05-29T09:52:47Z

Deploy status

Environment	Status	Deployed At	Workflow
dev	✅ Deployed	2026-05-29 09:52 UTC	Run
prod-us	✅ Deployed	2026-05-29 10:04 UTC	Run
prod-eu	✅ Deployed	2026-05-29 10:07 UTC	Run

## Problem - The Haiku gate was filtering out things it shouldn't — SDK crashes (the heuristic's `"trace"` substring tripped on `"stack trace"`), perf complaints about team-owned sites (the LLM treated them as ops/CDN issues), and "wrong data / missing events" reports (which are almost always tracking bugs in the team's own code, not PostHog).    ## Changes - Dropped `"trace"` from the heuristic and tightened the Haiku prompt with principled own-code-vs-PostHog-SaaS guidance - Plus an explicit "wrong data = tracking bug → needs_repo" exception; added 3 eval cases (`posthog_product_hang`, `sdk_instrumentation_on_own_site`, `wrong_data_tracking_bug`) as boundary regression guards.   ## How did you test this code?    👉 _Stay up-to-date with_ [_PostHog coding conventions_](https://posthog.com/docs/contribute/coding-conventions) _for a smoother review._ ## Publish to changelog?    ## Docs update  ## 🤖 Agent context

chore: Tweaks to Haiku prompt to improve repo selection.

47c14f7

sortafreel force-pushed the repo-selection-haiku-fixes branch from d0fb56e to 47c14f7 Compare May 26, 2026 14:08

sortafreel requested review from a team May 26, 2026 15:03

Merge branch 'master' into repo-selection-haiku-fixes

abfac11

sortafreel marked this pull request as ready for review May 28, 2026 09:43

Copilot AI review requested due to automatic review settings May 28, 2026 09:43

Copilot started reviewing on behalf of sortafreel May 28, 2026 09:43 View session

Merge branch 'master' into repo-selection-haiku-fixes

25f7d8b

Copilot AI reviewed May 28, 2026

View reviewed changes

greptile-apps Bot reviewed May 28, 2026

View reviewed changes

andrewm4894 approved these changes May 28, 2026

View reviewed changes

sortafreel merged commit 0073089 into master May 29, 2026
228 checks passed

sortafreel deleted the repo-selection-haiku-fixes branch May 29, 2026 09:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: Tweaks to Haiku prompt to improve repo selection.#60092

chore: Tweaks to Haiku prompt to improve repo selection.#60092
sortafreel merged 3 commits into
masterfrom
repo-selection-haiku-fixes

sortafreel commented May 26, 2026 •

edited

Loading

Uh oh!

sortafreel commented May 26, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

greptile-apps Bot commented May 28, 2026

Uh oh!

greptile-apps Bot May 28, 2026

Uh oh!

Uh oh!

deployment-status-posthog Bot commented May 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sortafreel commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Changes

How did you test this code?

Publish to changelog?

Docs update

🤖 Agent context

Uh oh!

sortafreel commented May 26, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

greptile-apps Bot commented May 28, 2026

Uh oh!

greptile-apps Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

deployment-status-posthog Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploy status

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sortafreel commented May 26, 2026 •

edited

Loading

deployment-status-posthog Bot commented May 29, 2026 •

edited

Loading