Agent Persona Exploration - 2026-03-19 #21704

2026-03-19T01:33:40Z

github-actions[bot]
bot Mar 19, 2026

This report documents a systematic evaluation of the developer.instructions agent (the local proxy for the agentic-workflows custom agent) across 6 representative automation scenarios drawn from 5 software worker personas.

Persona Overview

Agent tested: developer.instructions (agentic-workflows dispatcher)
Scenarios tested: 6 (from 5 personas: Backend Engineer, Frontend Developer, DevOps Engineer, QA Tester, Product Manager)
Average quality score: 4.9 / 5.0
Run: §23275168075

Key Findings

🏆 Consistently high quality — every scenario scored 4.8–5.0; no scenario fell below "very good"
🔐 Security posture is strong — lockdown: true, safe-outputs-only writes, and strict: true were applied universally across all scenarios
🎯 Trigger selection is nuanced — the agent correctly chose workflow_run over pull_request when CI artifacts are needed (QA coverage, DevOps incidents), with clear reasoning documented
📤 noop enforcement — every scenario included a mandatory noop fallback path, preventing silent failures
💡 Bonus patterns — responses routinely added non-asked-for value: cost estimates, concurrency configs, timezone guidance, fetch-depth: 0, and label-based branch protection

Top Patterns Observed

Trigger hierarchy: workflow_run → pull_request (with path filters) → schedule + workflow_dispatch → push: tags
Toolset specificity: Always uses toolsets: [repos, pull_requests] (not default) — minimizes blast radius
Security trinity: lockdown: true + sandbox.agent: awf + strict: true applied universally
hide-older-comments: true on all PR comment outputs — prevents comment spam on re-runs
Data staging pattern: Use steps: block to pre-fetch artifacts before agent starts (S7 QA scenario)

View Top Scoring Responses (5.0/5.0)

S1 — Backend Schema Review (Backend Engineer)

Perfect trigger: pull_request with path filters (**.sql, */migrations/*.py)
6-step analysis pipeline with a rubric (DANGER/WARNING/SAFE classification table)
Label-based branch protection integration (migration-review: danger blocks merges)
submit-pull-request-review + add-labels safe-outputs for formal review workflow

S5 — Deployment Incident Responder (DevOps Engineer)

workflow_run trigger with if: $\{\{ github.event.workflow_run.conclusion == 'failure' }}
cache-memory used as a growing RCA pattern database (learns from past failures)
close-older-issues: true prevents duplicate incident backlog on recurring failures
expires: 7d creates an SLA forcing function

S10 — Release Notes Drafter (Product Manager)

push: tags: v[0-9]+.[0-9]+.[0-9]+ with fetch-depth: 0 for full git history
Handles both merge-commit repos (git log --merges) and squash-merge repos (date-range API fallback)
Explicitly prevents shell injection: export CURRENT_TAG="$\{\{ github.ref_name }}" then \$\{CURRENT_TAG} in commands
workflow_dispatch inputs for manual backfills with previous_tag override

View Areas for Improvement (minor issues only)

S3 — Visual Regression (Frontend Developer) — score 4.8

⚠️ Script injection risk via PR branch package.json scripts is noted but mitigation could be stronger (e.g., explicitly recommend --ignore-scripts flag or using a locked lockfile)
Suggestion: add a steps: block to run npm ci --ignore-scripts before handing off to the agent

S7 — QA Coverage (QA Tester) — score 4.8

The steps: pre-fetch pattern is excellent, but the artifact naming convention (coverage-report) is hardcoded — should provide guidance on making this configurable
Security section is briefer than other scenarios; could explicitly warn about coverage XML being untrusted input

S9 — Weekly Feature Digest (Product Manager) — score 4.8

The prompt uses gh CLI in bash tool for queries, but the gh CLI may not be pre-authenticated in all environments — the GitHub MCP toolset is preferable for reads and was also included, creating slight redundancy
Suggestion: use GitHub MCP exclusively for read operations, reserve bash for date math and jq filtering only

Recommendations

Document workflow_run vs pull_request decision tree — The agent correctly chooses between these, but the reasoning should be codified as a reusable pattern in the create-agentic-workflow.md prompt to help new workflow authors understand the tradeoff
Add a steps: pre-fetch pattern to the prompt library — The artifact-staging pattern (S7: download CI artifacts before agent starts) is non-obvious but essential for coverage/artifact workflows; it deserves a named example in the docs
Clarify gh CLI auth in bash tool context — Several responses used gh CLI commands in bash; the prompt should explicitly note that gh is authenticated via GITHUB_TOKEN in the steps: context but may behave differently in the agent's bash context vs. the GitHub MCP

References:

§23275168075

AI generated by Agent Persona Explorer · history

pelikhan · 2026-03-19T01:41:43Z

pelikhan
Mar 19, 2026
Maintainer

/plan

1 reply

github-actions[bot] bot Mar 19, 2026
Author

🚀 Plan Command has started processing this discussion comment

2026-03-19T03:33:05Z

github-actions[bot]
bot Mar 19, 2026
Author

🚀 Beep boop! The smoke test agent has landed in this discussion like a confused robot at a tea party! 🤖☕

Tests ran, bits were toggled, and the Copilot engine valiantly compiled Go code while the world slept. Everything passed (mostly)! The build succeeded, Playwright confirmed GitHub still exists (phew!), and the web-fetch tool reports that the internet continues to internet. 🌐

Now back to my regularly scheduled automation...

📰 BREAKING: Report filed by Smoke Copilot · ◷

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent Persona Exploration - 2026-03-19 #21704

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Agent Persona Exploration - 2026-03-19 #21704

Uh oh!

github-actions[bot] bot Mar 19, 2026

Persona Overview

Key Findings

Top Patterns Observed

Recommendations

Replies: 2 comments · 1 reply

Uh oh!

pelikhan Mar 19, 2026 Maintainer

Uh oh!

github-actions[bot] bot Mar 19, 2026 Author

Uh oh!

github-actions[bot] bot Mar 19, 2026 Author

github-actions[bot]
bot Mar 19, 2026

Replies: 2 comments 1 reply

pelikhan
Mar 19, 2026
Maintainer

github-actions[bot] bot Mar 19, 2026
Author

github-actions[bot]
bot Mar 19, 2026
Author