feat(daily-semgrep-scan): add semgrep_output_format A/B experiment by Copilot · Pull Request #32802 · github/gh-aw

Copilot · 2026-05-17T12:11:49Z

Instruments the daily-semgrep-scan workflow with a 3-variant A/B experiment to test whether output structure affects code scanning alert creation rate and completeness.

Frontmatter changes

Added experiments.semgrep_output_format block with variants bullet_list, structured_sections, prose
Weighted 34/33/33, proportion_test analysis, 30-run minimum, guardrail run_success_rate >= 0.85
Dropped the unsupported direction: min field; expressed threshold as ">=0.85" (consistent with other workflows)

Prompt changes

Extended the single-line prompt with {{#if}} conditional blocks (single-quoted comparisons per gh-aw convention):

Scan the repository for SQL injection vulnerabilities using Semgrep.

{{#if experiments.semgrep_output_format == 'bullet_list' }}
Report each finding as a flat bullet point in this format:
- **[SEVERITY]** `<file>:<line>` — Rule: `<rule_id>` — <message>

Create one code scanning alert per finding.
{{/if}}
{{#if experiments.semgrep_output_format == 'structured_sections' }}
Structure your findings report with:
1. A summary table: | Severity | Count |
2. Sections grouped by severity (Critical, High, Medium, Low), then by rule ID
3. For each finding: file path, line number, rule, and recommended fix

Create one code scanning alert per finding.
{{/if}}
{{#if experiments.semgrep_output_format == 'prose' }}
Write a narrative security assessment describing the vulnerability patterns found. Embed
specific findings (file, line, rule) within the prose. Conclude with a prioritized
remediation list.

Create one code scanning alert per finding.
{{/if}}

Lock file

Recompiled daily-semgrep-scan.lock.yml — compiles clean (one expected "experimental feature" advisory).

…workflow Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot

Pull request overview

Adds an A/B/n experiment to the daily-semgrep-scan agentic workflow to test whether Semgrep findings output structure affects code scanning alert creation and report completeness.

Changes:

Introduces experiments.semgrep_output_format (3 variants, weighted 34/33/33) in the workflow markdown frontmatter and adds variant-specific prompt sections.
Recompiles the lock workflow to restore/pick experiment assignments, persist experiment state to a dedicated git branch, and thread the chosen variant into prompt interpolation/execution.

Show a summary per file

File	Description
.github/workflows/daily-semgrep-scan.md	Defines the `semgrep_output_format` experiment and adds conditional prompt blocks per variant.
.github/workflows/daily-semgrep-scan.lock.yml	Compiled workflow wiring for experiment state restore/pick, artifact handling, and pushing state back to git.

Copilot's findings

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comments suppressed due to low confidence (2)

.github/workflows/daily-semgrep-scan.md:51

Same issue as above: using single quotes in the equality check prevents the condition from being evaluated as a comparison by the current template engine, so the block will be kept for all runs. Use double quotes for the RHS so the experiment gating works.

{{#if experiments.semgrep_output_format == 'structured_sections' }}

.github/workflows/daily-semgrep-scan.md:59

Same issue as above: this {{#if}} comparison uses single quotes, but the template condition evaluator only supports comparisons against double-quoted strings. As written, this block will always render. Change to double quotes to ensure only the selected variant’s section is included.

{{#if experiments.semgrep_output_format == 'prose' }}

Files reviewed: 2/2 changed files
Comments generated: 2


 Scan the repository for SQL injection vulnerabilities using Semgrep.

+{{#if experiments.semgrep_output_format == 'bullet_list' }}



+{{#if experiments.semgrep_output_format == 'bullet_list' }}
+Report each finding as a flat bullet point in this format:
+- **[SEVERITY]** `<file>:<line>` — Rule: `<rule_id>` — <message>


Initial plan

cb6fcbe

Copilot AI assigned Copilot and pelikhan May 17, 2026

Copilot started work on behalf of pelikhan May 17, 2026 12:11 View session

Copilot AI linked an issue May 17, 2026 that may be closed by this pull request

[ab-advisor] Experiment campaign for daily-semgrep-scan: A/B test output_format #32795

Closed

7 tasks

feat: add semgrep_output_format A/B experiment to daily-semgrep-scan …

5c7a9e1

…workflow Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Experiment campaign for daily-semgrep-scan output format~~ feat(daily-semgrep-scan): add semgrep_output_format A/B experiment May 17, 2026

Copilot finished work on behalf of pelikhan May 17, 2026 12:21

Copilot AI requested a review from pelikhan May 17, 2026 12:21

pelikhan marked this pull request as ready for review May 17, 2026 12:22

Copilot AI review requested due to automatic review settings May 17, 2026 12:22

pelikhan merged commit 7757f81 into main May 17, 2026

pelikhan deleted the copilot/experiment-campaign-output-format branch May 17, 2026 12:22

Copilot started reviewing on behalf of pelikhan May 17, 2026 12:23 View session

Copilot AI reviewed May 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(daily-semgrep-scan): add semgrep_output_format A/B experiment#32802

feat(daily-semgrep-scan): add semgrep_output_format A/B experiment#32802
pelikhan merged 2 commits into
mainfrom
copilot/experiment-campaign-output-format

Copilot AI commented May 17, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		Scan the repository for SQL injection vulnerabilities using Semgrep.

		{{#if experiments.semgrep_output_format == 'bullet_list' }}

Conversation

Copilot AI commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Frontmatter changes

Prompt changes

Lock file

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented May 17, 2026 •

edited

Loading