[experiments] Daily Experiment Report — 2026-06-08 #37786
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by daily-experiment-report. A newer discussion is available at Discussion #38076. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
🧪 Daily Experiment Report — 2026-06-08
Analysed 32 experiments across 30 workflows. No experiments cleared all guardrails with p < 0.05. 15 ABANDON (guardrail/no-effect), 17 EXTEND (collecting data).
⚡ Quick Stats
smoke-copilot/subagent_model❌ ABANDONsmalllargeRationale: No significant effect detected (p = 0.0582, all variants reached min_samples).
smoke-gemini/sub_agent_strategy❌ ABANDONsingle_agentsub_agentssingle_agent.success_rate= 0.83 (need >=0.95)Rationale: Guardrail failure: success_rate >=0.95 (got 0.83 for
single_agent)smoke-antigravity/sub_agent_strategy❌ ABANDONsingle_agentsub_agentssub_agents.success_rate= 0.88 (need >=0.95)Rationale: Guardrail failure: success_rate >=0.95 (got 0.88 for
sub_agents)daily-astrostylelite-markdown-spellcheck/prompt_style🟡 EXTENDconcisedetailedRationale: Still collecting data: concise=16/30, detailed=20/30
agent-performance-analyzer/prompt_compression❌ ABANDONverbosecavemancaveman.run_success_rate= 0.67 (need >=0.90)Rationale: Guardrail failure: run_success_rate >=0.90 (got 0.67 for
caveman)📋 All 32 Experiments Summary
agent-performance-analyzerprompt_compressionci-coachprompt_styledaily-architecture-diagramdetail_leveldaily-cache-strategy-analyzermodel_sizedaily-caveman-optimizermodel_sizedaily-compiler-qualityoutput_formatdaily-doc-healermodel_sizedaily-doc-updatermodel_sizedaily-factreasoning_depthdaily-function-namermodel_sizedaily-security-red-teamreasoning_depthsmoke-antigravitysub_agent_strategysmoke-copilotcavemansmoke-copilotsubagent_modelsmoke-geminisub_agent_strategyagent-persona-explorersub_agent_strategyaw-failure-investigatortone_variantdaily-agentrx-trace-optimizersub_agent_strategydaily-astrostylelite-markdown-spellcheckprompt_styledaily-code-metricsoutput_formatdaily-community-attributionprompt_styledaily-issues-reportoutput_formatdaily-newsprompt_styledaily-semgrep-scansemgrep_output_formatdeep-reportoutput_formatdependabot-campaignsummary_detaildependabot-go-checkerprompt_stylegpcleantool_verbosityissue-arboristprompt_stylesmoke-copilot-aoai-apikeycavemansmoke-copilot-aoai-apikeysubagent_modeltypisttone_styleReferences: §27127971961
Warning
Firewall blocked 1 domain
The following domain was blocked by the firewall during workflow execution:
proxy.golang.orgSee Network Configuration for more information.
Beta Was this translation helpful? Give feedback.
All reactions