[experiments] Daily Experiment Report — 2026-06-07 #37519
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by daily-experiment-report. A newer discussion is available at Discussion #37786. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
🧪 Daily Experiment Report — 2026-06-07
33 experiments across 31 workflows. 4 ready · 1 significant · ✅ 0 · 🟡 27 · ❌ 6
⚡ Quick Stats
📊 Featured Experiments
caveman—smoke-copilot🟢 READYno: 60 runs, 36% succ ·yes: 61 runs, 38% succp = 0.919 · ❌ ABANDON — No significant effect detected (p=0.9193) with sufficient samples.
📈 Charts
subagent_model—smoke-copilot🟢 READYlarge: 50 runs, 53% succ ·small: 51 runs, 20% succp = 0.058 · ❌ ABANDON — No significant effect detected (p=0.0582) with sufficient samples.
📈 Charts
reasoning_depth—daily-security-red-team🟡 COLLECTINGiterative: 15 runs, 100% succ ·single_pass: 11 runs, 73% succp = 0.032* · ❌ ABANDON — Guardrail metric violation detected.
📈 Charts
output_format—daily-issues-report🟡 COLLECTINGcollapsible: 14 runs, 0% succ ·inline: 17 runs, 0% succp = N/A · 🟡 EXTEND — Still collecting data — min_samples (30) not yet reached for all variants.
📈 Charts
output_format—deep-report🟡 COLLECTINGexecutive_brief: 9 runs, 100% succ ·full_briefing: 9 runs, 100% succ ·annotated_brief: 6 runs, 83% succp = 0.205 · 🟡 EXTEND — Still collecting data — min_samples (15) not yet reached for all variants.
📈 Charts
📋 All Experiments
View All 33 Experiments
cavemansmoke-copilotno(60),yes(61)subagent_modelsmoke-copilotlarge(50),small(51)sub_agent_strategysmoke-geminisub_agents(54),single_agent(32)sub_agent_decompositionsmoke-pisingle_agent(34),parallel_sub_agents(42)sub_agent_strategysmoke-antigravitysingle_agent(27),sub_agents(30)prompt_styledaily-astrostylelite-markdown-spellcheckdetailed(20),concise(15)prompt_styledaily-community-attributionconcise(16),verbose(17)output_formatdaily-issues-reportcollapsible(14),inline(17)prompt_styledaily-newsdetailed(17),concise(10)tone_variantaw-failure-investigatorclinical(10),narrative(10),assertive(6)reasoning_depthdaily-security-red-teamiterative(15),single_pass(11)output_formatdeep-reportexecutive_brief(9),full_briefing(9),annotated_brief(6)output_formatdaily-compiler-qualitydetailed(9),concise(13)output_formatdaily-code-metricsexecutive_summary(11),full_detail(10)semgrep_output_formatdaily-semgrep-scanstructured_sections(9),prose(6),bullet_list(6)prompt_compressionagent-performance-analyzercaveman(11),verbose(7)sub_agent_strategyagent-persona-explorerbatch(13),per_scenario(5)reasoning_depthdaily-factsingle_pass(11),multi_candidate(6)prompt_styleci-coachconcise(6),detailed(10)tool_verbositygpcleanfull_bash(5),minimal_toolset(9)tone_styletypistconversational(5),formal(4)sub_agent_strategydaily-agentrx-trace-optimizersub_agents(1),single_agent(3)timeout_settingNonedefault(2),relaxed(1),tight(1)prompt_styledependabot-go-checkerstep_by_step(1),concise(2),detailed(1)prompt_styleissue-arboristconcise(2),detailed(2)model_sizedaily-doc-healersmall-agent(1),agent(2)cavemansmoke-copilot-aoai-apikeyyes(2),no(1)subagent_modelsmoke-copilot-aoai-apikeysmall(2),large(1)model_sizedaily-cache-strategy-analyzersmall-agent(1),agent(1)model_sizedaily-caveman-optimizersmall-agent(2)model_sizedaily-doc-updatersmall-agent(2)model_sizedaily-function-nameragent(2)detail_leveldaily-architecture-diagrambrief(1)Warning
Firewall blocked 2 domains
The following domains were blocked by the firewall during workflow execution:
proxy.golang.orgreleaseassets.githubusercontent.comSee Network Configuration for more information.
Beta Was this translation helpful? Give feedback.
All reactions