Skip to content

Repo Pulse v2: deterministic pre-fetch + disable integrity filter#16421

Open
joperezr wants to merge 2 commits intomainfrom
repo-pulse-v2
Open

Repo Pulse v2: deterministic pre-fetch + disable integrity filter#16421
joperezr wants to merge 2 commits intomainfrom
repo-pulse-v2

Conversation

@joperezr
Copy link
Copy Markdown
Member

@joperezr joperezr commented Apr 24, 2026

Description

Rework the Repo Pulse daily workflow so every PR and issue in the window reliably shows up in the pinned report (#16404). The first production run was missing items that the team expected to see.

What's changing

  • Deterministic data collection. All five sections' data is now gathered in a pre-agent-steps block using gh api --paginate before the agent starts, on the same runner/workspace. The agent just reads the resulting JSON files and renders the report — it no longer enumerates GitHub itself, so coverage is no longer model-dependent.
  • Smaller, schema-tight input to the agent. The pre-step extracts only the fields the report actually uses (number, title, author, labels, timestamps, URL). Titles are normalized (whitespace-trimmed, HTML comments stripped) before being handed to the agent.
  • Filter out quarantined-test / failing-test from the Activity Highlights section per team feedback — label churn on those surfaces isn't meaningful "attention going somewhere new" signal. Filed Issues is unchanged (new issues with those labels are still legit new work).
  • Precomputed "See all" URLs are passed through meta.json so the agent doesn't have to build GitHub search queries itself.

Validation

  • gh aw compile clean (0 errors, 1 unrelated "fuzzy schedule" advisory we kept intentionally for 08:00 PST). actions-lock.json unchanged vs main.
  • bash -n on the extracted pre-agent-steps script passes.
  • End-to-end jq test of the field-extraction filter on sample items produces the expected output.
  • Final verification is a workflow_dispatch run after merge to confirm the report reflects the expected items.

FYI @davidfowl · reviewer @radical

Fixes # (issue)

Checklist

  • Is this feature complete?
    • Yes. Ready to ship.
    • No. Follow-up changes expected.
  • Are you including unit tests for the changes and scenario tests if relevant?
    • Yes
    • No
  • Did you add public API?
    • Yes
      • If yes, did you have an API Review for it?
        • Yes
        • No
      • Did you add <remarks /> and <code /> elements on your triple slash comments?
        • Yes
        • No
    • No
  • Does the change make any security assumptions or guarantees?
    • Yes
      • If yes, have you done a threat model and had a security review?
        • Yes
        • No
    • No
  • Does the change require an update in our Aspire docs?

@joperezr joperezr requested review from Copilot and radical April 24, 2026 03:11
@joperezr
Copy link
Copy Markdown
Member Author

FYI @davidfowl — this is the v2 fix for the Repo Pulse missing-PRs issue surfaced in the Teams thread.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 24, 2026

🚀 Dogfood this PR with:

⚠️ WARNING: Do not do this without first carefully reviewing the code of this PR to satisfy yourself it is safe.

curl -fsSL https://raw.githubusercontent.com/microsoft/aspire/main/eng/scripts/get-aspire-cli-pr.sh | bash -s -- 16421

Or

  • Run remotely in PowerShell:
iex "& { $(irm https://raw.githubusercontent.com/microsoft/aspire/main/eng/scripts/get-aspire-cli-pr.ps1) } 16421"

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Reworks the Repo Pulse agentic workflow to make daily reporting deterministic by pre-fetching/sanitizing GitHub data before the agent runs, and disables the GitHub MCP integrity filter since the agent should no longer ingest untrusted GitHub content directly.

Changes:

  • Adds pre-agent-steps that fetch Repo Pulse datasets via gh api --paginate, sanitizes fields (notably titles), and writes a local .repo-pulse/ JSON bundle for the agent to render.
  • Disables the GitHub MCP integrity filter (min-integrity: none) and restricts MCP toolsets to repos (agent is instructed not to do any GitHub search/list calls).
  • Updates the locked workflow to include the new prefetch step and updated MCP guard configuration.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 6 comments.

File Description
.github/workflows/repo-pulse.md Adds deterministic prefetch + sanitization and updates the agent instructions/guardrails to consume only .repo-pulse/*.json.
.github/workflows/repo-pulse.lock.yml Regenerates the locked workflow to run the prefetch script and apply the updated MCP configuration.

Comment thread .github/workflows/repo-pulse.md Outdated
Comment thread .github/workflows/repo-pulse.lock.yml Outdated
Comment thread .github/workflows/repo-pulse.md
Comment thread .github/workflows/repo-pulse.md
Comment thread .github/workflows/repo-pulse.md Outdated
Comment thread .github/workflows/repo-pulse.md
Rework the Repo Pulse workflow so every PR and issue in the 3-day window
reliably shows up in the pinned report (#16404). The first production run
was missing items that the team expected to see; enumeration depended on
the model's pagination behaviour.

Changes:

1. Deterministic data collection in a `pre-agent-steps` block using
   `gh api --paginate`. All five sections' data is collected into JSON
   files under `.repo-pulse/` before the agent starts, on the same
   runner/workspace. The agent reads those files and renders the report;
   it no longer enumerates GitHub itself.

2. Smaller, schema-tight input to the agent: only the fields the report
   actually uses (number, title, author, labels, timestamps, URL).
   Titles are normalized before being handed to the agent.

3. Filter out `quarantined-test` and `failing-test` labels from the
   Activity Highlights section per team feedback. Filed Issues is
   unchanged (new issues with those labels are still legit new work).

4. Precomputed `See all'' URLs are passed through `meta.json` so the
   agent doesn't have to build GitHub search queries itself.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Comment thread .github/workflows/repo-pulse.md Outdated
Comment thread .github/workflows/repo-pulse.md
Addresses review feedback on PR #16421 around daily-job reliability:

- Each section's fetch now runs independently: a single gh api failure no longer aborts the entire report. Failing sections write an empty array and surface a warning; remaining sections still render.

- The search/issues API can set incomplete_results: true when the backend times out. We now check this signal across all paginated responses and record a warning rather than silently publishing a partial slice as if it were complete.

- Warnings are accumulated into meta.data_quality_warnings (array of strings). The agent prompt now instructs the renderer to emit a banner at the top of the pinned issue when that array is non-empty, so readers can see at a glance that the dashboard may be partial.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@github-actions
Copy link
Copy Markdown
Contributor

🎬 CLI E2E Test Recordings — 75 recordings uploaded (commit 935715e)

View all recordings
Status Test Recording
AddPackageInteractiveWhileAppHostRunningDetached ▶️ View Recording
AddPackageWhileAppHostRunningDetached ▶️ View Recording
AgentCommands_AllHelpOutputs_AreCorrect ▶️ View Recording
AgentInitCommand_DefaultSelection_InstallsSkillOnly ▶️ View Recording
AgentInitCommand_MigratesDeprecatedConfig ▶️ View Recording
AspireAddPackageVersionToDirectoryPackagesProps ▶️ View Recording
AspireUpdateRemovesAppHostPackageVersionFromDirectoryPackagesProps ▶️ View Recording
Banner_DisplayedOnFirstRun ▶️ View Recording
Banner_DisplayedWithExplicitFlag ▶️ View Recording
Banner_NotDisplayedWithNoLogoFlag ▶️ View Recording
CertificatesClean_RemovesCertificates ▶️ View Recording
CertificatesTrust_WithNoCert_CreatesAndTrustsCertificate ▶️ View Recording
CertificatesTrust_WithUntrustedCert_TrustsCertificate ▶️ View Recording
ConfigSetGet_CreatesNestedJsonFormat ▶️ View Recording
CreateAndRunAspireStarterProject ▶️ View Recording
CreateAndRunAspireStarterProjectWithBundle ▶️ View Recording
CreateAndRunEmptyAppHostProject ▶️ View Recording
CreateAndRunJavaEmptyAppHostProject ▶️ View Recording
CreateAndRunJsReactProject ▶️ View Recording
CreateAndRunPythonReactProject ▶️ View Recording
CreateAndRunTypeScriptEmptyAppHostProject ▶️ View Recording
CreateAndRunTypeScriptStarterProject ▶️ View Recording
CreateJavaAppHostWithViteApp ▶️ View Recording
CreateTypeScriptAppHostWithViteApp_UsesConfiguredToolchain ▶️ View Recording
DashboardRunWithOtelTracesReturnsNoTraces ▶️ View Recording
DeployK8sBasicApiService ▶️ View Recording
DeployK8sWithGarnet ▶️ View Recording
DeployK8sWithMongoDB ▶️ View Recording
DeployK8sWithMySql ▶️ View Recording
DeployK8sWithPostgres ▶️ View Recording
DeployK8sWithRabbitMQ ▶️ View Recording
DeployK8sWithRedis ▶️ View Recording
DeployK8sWithSqlServer ▶️ View Recording
DeployK8sWithValkey ▶️ View Recording
DeployTypeScriptAppToKubernetes ▶️ View Recording
DescribeCommandResolvesReplicaNames ▶️ View Recording
DescribeCommandShowsRunningResources ▶️ View Recording
DetachFormatJsonProducesValidJson ▶️ View Recording
DetachFormatJsonProducesValidJsonWhenRestartingExistingInstance ▶️ View Recording
DoListStepsShowsPipelineSteps ▶️ View Recording
DocsCommand_RendersInteractiveMarkdownFromLocalSource ▶️ View Recording
DoctorCommand_DetectsDeprecatedAgentConfig ▶️ View Recording
DoctorCommand_TypeScriptAppHostReportsMissingConfiguredToolchain ▶️ View Recording
DoctorCommand_WithSslCertDir_ShowsTrusted ▶️ View Recording
DoctorCommand_WithoutSslCertDir_ShowsPartiallyTrusted ▶️ View Recording
GlobalMigration_HandlesCommentsAndTrailingCommas ▶️ View Recording
GlobalMigration_HandlesMalformedLegacyJson ▶️ View Recording
GlobalMigration_PreservesAllValueTypes ▶️ View Recording
GlobalMigration_SkipsWhenNewConfigExists ▶️ View Recording
GlobalSettings_MigratedFromLegacyFormat ▶️ View Recording
InitTypeScriptAppHost_AugmentsExistingViteRepoAtRoot ▶️ View Recording
InvalidAppHostPathWithComments_IsHealedOnRun ▶️ View Recording
LegacySettingsMigration_AdjustsRelativeAppHostPath ▶️ View Recording
LogsCommandShowsResourceLogs ▶️ View Recording
OtelLogsReturnsStructuredLogsFromStarterAppCore ▶️ View Recording
PsCommandListsRunningAppHost ▶️ View Recording
PsFormatJsonOutputsOnlyJsonToStdout ▶️ View Recording
PublishWithConfigureEnvFileUpdatesEnvOutput ▶️ View Recording
PublishWithDockerComposeServiceCallbackSucceeds ▶️ View Recording
PublishWithoutOutputPathUsesAppHostDirectoryDefault ▶️ View Recording
RestoreGeneratesSdkFiles ▶️ View Recording
RestoreGeneratesSdkFiles_WithConfiguredToolchain ▶️ View Recording
RestoreRefreshesGeneratedSdkAfterAddingIntegration ▶️ View Recording
RestoreSupportsConfigOnlyHelperPackageAndCrossPackageTypes ▶️ View Recording
RunFromParentDirectory_UsesExistingConfigNearAppHost ▶️ View Recording
SecretCrudOnDotNetAppHost ▶️ View Recording
SecretCrudOnTypeScriptAppHost ▶️ View Recording
StagingChannel_ConfigureAndVerifySettings_ThenSwitchChannels ▶️ View Recording
StartAndWaitForTypeScriptSqlServerAppHostWithNativeAssets ▶️ View Recording
StopAllAppHostsFromAppHostDirectory ▶️ View Recording
StopAllAppHostsFromUnrelatedDirectory ▶️ View Recording
StopNonInteractiveMultipleAppHostsShowsError ▶️ View Recording
StopNonInteractiveSingleAppHost ▶️ View Recording
StopWithNoRunningAppHostExitsSuccessfully ▶️ View Recording
UnAwaitedChainsCompileWithAutoResolvePromises ▶️ View Recording

📹 Recordings uploaded automatically from CI run #24911440426

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants