feat(signals): Agentic report generation (repo discovery -> signals research -> actionability -> priority -> summary) by sortafreel · Pull Request #51408 · PostHog/posthog

sortafreel · 2026-03-17T22:55:22Z

Problem

Signal reports were summarized by a single LLM call (title + summary), then judged for actionability, with no ability to investigate the underlying codebase, production data, or verify signal claims before deciding what to do.

How to make it work

Ensure you can run local sandboxes (products/tasks/backend/temporal/process_task/SETUP_GUIDE.md)
Ensure you have signals-agentic-report-generation FF enabled for the team (or make signals_agentic_report_gate_activity return True when testing)

Changes

Add agentic report research flow behind a feature flag: after the safety judge passes, a sandbox agent clones the relevant repo, investigates each signal against the actual code/data (via MCP tools + gh CLI), and produces per-signal findings, and per-report actionability, priority, and a title/summary
Add repository selection step — picks the most relevant repo from the team's GitHub integrations (single repo: direct, N repos: sandbox agent explores candidates)
Support re-promotion of ready reports: when enough new signals accumulate, the workflow re-runs and reuses the previous repo selection + lightly validates previous findings instead of re-researching from scratch
Add local testing harnesses (analyze_report, select_repo, parse_sandbox_log) for exercising the flow against synthetic signals — intended to be reworked into evals

How did you test this code?

Repo selection

Report generation

👉 Stay up-to-date with PostHog coding conventions for a smoother review.

Publish to changelog?

Docs update

…eck.

…ately.

products/signals/backend/temporal/grouping.py

sortafreel · 2026-03-21T06:30:31Z

products/tasks/backend/temporal/process_task/activities/send_followup_to_sandbox.py

    try:
        task_run = TaskRun.objects.select_related("task__created_by").get(id=input.run_id)
    except TaskRun.DoesNotExist:
+        error_msg = "Task run not found"


@tatoalo @joshsny Pinged you for a review as I did a change to follow-ups (raise when failing), plus added a generic custom_prompt_multi_turn_runner.py. Don't expect you to check the whole thing :)

Twixes · 2026-03-24T17:08:12Z

feat(signals): Agentic report generation (repo discovery -> signals research -> actionability -> priority -> summary) #51408 : 2 dependent PRs (#52141 , #52623 ) 👈 (View in Graphite)
master

This stack of pull requests is managed by Graphite. Learn more about stacking.

Twixes

This doesn't upset me

Twixes · 2026-03-25T01:22:03Z

products/signals/backend/report_generation/research.py

+    @field_validator("explanation")
+    @classmethod
+    def explanation_must_not_be_empty(cls, v: str) -> str:
+        if not v.strip():
+            raise ValueError("Explanation must not be empty")
+        return v


Looks like a complex way of doing min_length=1

Twixes · 2026-03-27T14:44:20Z

products/signals/backend/temporal/agentic/select_repository.py

+    membership = (
+        OrganizationMembership.objects.select_related("user")
+        .filter(organization=team.organization)
+        .order_by("id")
+        .first()
+    )


Well, this will be completely false, we should have a "system" user instead

Twixes · 2026-03-27T14:49:08Z

products/signals/backend/management/commands/select_repo.py

Please do test commands in a stacked PR in the future, as this really inflates the mental overhead of the PR vs. just its core functionality

Twixes · 2026-03-27T14:53:42Z

products/signals/backend/report_generation/select_repo.py

+# Small public repo to copy into the sandbox to init. Could be removed later when it's possible to create sandboxes without repos.
+REPO_SELECTION_DUMMY_REPOSITORY = "PostHog/.github"


This is really dumb and a waste of time, but I get that that the sandbox doesn't yet allow not cloning a repo

sortafreel added 30 commits March 12, 2026 18:04

feat: Extract run-sandbox-with-a-custom-prompt

4832137

fix: Remove semaphore.

b73c3cd

Merge branch 'master' into make-sandbox-agent-run-custom-prompt

096fa57

fix: Reuse Task.create_and_run

f1ea2b2

feat: Custom prompt test command.

783578e

feat: Improve the JSON extraction.

e4262c5

chore: More logging.

47322c2

chore: Rename class for clarity.

99e45fa

chore: Rename for clarity.

c447acd

chore: Comment.

8a2bab9

fix: Ensure to pick owner user.

53bd8d2

fix: Avoid printing logs by default.

73e4e11

fix: Expose branch through create_and_run. Remove excessive branch ch…

07a7564

…eck.

fix: Add retries for log reqds.

da1a627

fix: Ensure the last message is not empty.

27b6976

paranoid: Store the state of messages even if timeout hits.

71df80c

feat: JSON extraction tests.

620e5c1

fix: Remove excessive logs.

d9e4001

fix: Mypy.

65fec7a

feat: Base research.

c33b3fc

feat: Multi-turn research.

66ded98

Merge branch 'master' into signals/report-generation-agent

41f893e

chore: Reuse code.

8ad2274

fix: Reuse code.

38ba521

feat: Split research into actionability/priority to access them separ…

0d5cc26

…ately.

feat: Clean up prompt.

2b81932

chore: Make title/summary optional.

0a4fee4

chore: Tiny dedup safeguard.

7871a0c

fix: Parse logs.

012e97b

chore: Helper script to read logs.

4571ea0

sortafreel added 19 commits March 20, 2026 14:13

feat: Repo selection test.

6ab903e

fix: Propagate error.

fbe693b

fix: Typo.

bdb0a8e

fix: Strip.

5bd1051

fix: Avoid validation errors.

ad22bdd

chore: Revert increase.

c3ea4d7

chore: Nvm, return it back.

55be1e1

fix: Avoid reading logs from the previous turn.

c3552bf

fix: Fail follow-up instead of waiting for timeout.

85c5d4d

fix: Ensure we use proper signal id.

2f6e0be

fix: Standartize cases.

f39aa35

fix: Add heartbeat to the report.

d9241d2

fix: Cover errors.

7fa02e6

fix: Repo selection.

8491fe7

fix: Pagination.

3edc8f0

fix: Test.

c7ebf6d

fix: Add dependency.

4c63558

fix: Logs.

4a9694d

Merge branch 'master' into signals/report-generation-agent

f05d669

graphite-app bot reviewed Mar 20, 2026

View reviewed changes

products/signals/backend/temporal/grouping.py Show resolved Hide resolved

sortafreel added 2 commits March 20, 2026 16:07

chore: Tests.

e0c7b6e

fix: Tests.

f8450b9

sortafreel requested review from a team, joshsny and tatoalo March 21, 2026 06:26

sortafreel commented Mar 21, 2026

View reviewed changes

Twixes mentioned this pull request Mar 24, 2026

fix(tasks): Simplify repository selector to reuse existing components #52141

Open

Twixes approved these changes Mar 27, 2026

View reviewed changes

Twixes mentioned this pull request Mar 27, 2026

feat: suggested reviewers and signal report improvements #52623

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(signals): Agentic report generation (repo discovery -> signals research -> actionability -> priority -> summary)#51408

feat(signals): Agentic report generation (repo discovery -> signals research -> actionability -> priority -> summary)#51408
sortafreel wants to merge 111 commits intomasterfrom
signals/report-generation-agent

sortafreel commented Mar 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

sortafreel Mar 21, 2026

Uh oh!

Twixes commented Mar 24, 2026 •

edited

Loading

Uh oh!

Twixes left a comment

Uh oh!

Twixes Mar 25, 2026

Uh oh!

Twixes Mar 27, 2026

Uh oh!

Twixes Mar 27, 2026

Uh oh!

Twixes Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# Small public repo to copy into the sandbox to init. Could be removed later when it's possible to create sandboxes without repos.
		REPO_SELECTION_DUMMY_REPOSITORY = "PostHog/.github"

Conversation

sortafreel commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

How to make it work

Changes

How did you test this code?

Repo selection

Report generation

Publish to changelog?

Docs update

Uh oh!

Uh oh!

sortafreel Mar 21, 2026

Choose a reason for hiding this comment

Uh oh!

Twixes commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Twixes left a comment

Choose a reason for hiding this comment

Uh oh!

Twixes Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Twixes Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Twixes Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Twixes Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sortafreel commented Mar 17, 2026 •

edited

Loading

Twixes commented Mar 24, 2026 •

edited

Loading