[experiments] Daily Experiment Report — 2026-05-27 #35162
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it expired on 2026-05-30T09:11:35.780Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
🧪 Daily Experiment Report — 2026-05-27
22 active experiments tracked across 21 workflows. 8 experiments have outcome data available for statistical analysis — all are still collecting data (no experiment reached statistical significance today). No promotions or guardrail failures detected.
⚡ Quick Stats
caveman·smoke-copilot.mdDoes caveman mode affect smoke test success rate?
📈 View Detailed Statistics
Sample Sizes & Progress
noyesRecommendation: ABANDON — No significant difference detected with all variants at 30+ runs
subagent_model·smoke-copilot.mdDoes sub-agent model size affect test success and duration?
📈 View Detailed Statistics
Sample Sizes & Progress
largesmallChart not available
Recommendation: EXTEND — Collecting data: not all variants reached 30 runs
sub_agent_strategy·smoke-gemini.mdDoes using sub-agents improve Gemini smoke test success?
📈 View Detailed Statistics
Sample Sizes & Progress
single_agentsub_agentsRecommendation: EXTEND — Collecting data: not all variants reached 30 runs
sub_agent_decomposition·smoke-pi.mdDoes parallel sub-agent decomposition improve Pi calculation quality?
📈 View Detailed Statistics
Sample Sizes & Progress
single_agentparallel_sub_agentsRecommendation: EXTEND — Collecting data: not all variants reached 20 runs
prompt_style·ci-coach.mdH0: no change in PR rate. H1: concise prompt reduces tokens ≥25% without quality loss
📈 View Detailed Statistics
Sample Sizes & Progress
detailedconciseChart not available
Recommendation: EXTEND — Collecting data: not all variants reached 20 runs
output_format·daily-issues-report.md(not specified)
📈 View Detailed Statistics
Sample Sizes & Progress
collapsibleinlineChart not available
Recommendation: EXTEND — Collecting data: not all variants reached 30 runs
prompt_style·daily-community-attribution.md(not specified)
📈 View Detailed Statistics
Sample Sizes & Progress
conciseverboseRecommendation: EXTEND — Collecting data: not all variants reached 20 runs
prompt_style·daily-astrostylelite-markdown-spellcheck.md(not specified)
📈 View Detailed Statistics
Sample Sizes & Progress
concisedetailedChart not available
Recommendation: EXTEND — Collecting data: not all variants reached 30 runs
📊 Summary — All Experiments
View Full Experiments Table
prompt_compressionagentperformanceanalyzersub_agent_strategyagentpersonaexplorerprompt_stylecicoachdetail_leveldailyarchitecturediagramprompt_styledailyastrostylelitemarkdownspellcheckoutput_formatdailycodemetricsprompt_styledailycommunityattributionoutput_formatdailycompilerqualityreasoning_depthdailyfactoutput_formatdailyissuesreportprompt_styledailynewsreasoning_depthdailysecurityredteamsemgrep_output_formatdailysemgrepscanoutput_formatdeepreporttool_verbositygpcleanprompt_styleissuearboristsub_agent_strategysmokeantigravitycavemansmokecopilotsubagent_modelsmokecopilotsub_agent_strategysmokegeminisub_agent_decompositionsmokepitone_styletypistWarning
Firewall blocked 1 domain
The following domain was blocked by the firewall during workflow execution:
proxy.golang.orgSee Network Configuration for more information.
Beta Was this translation helpful? Give feedback.
All reactions