You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
🧪 Daily Experiment Report — 2026-06-17
41 experiments · 38 workflows · 1,396 runs · 🟢 READY: 8 · 🟡 EXTEND: 33 · ❌ ABANDON: 0
🟢 Ready for Analysis
caveman·smoke-copilot· 164 runs · balance: ✅ (χ2=0.000, p=1.0000)Recommendation: READY_FOR_ANALYSIS — All variants reached minimum samples and allocation appears balanced
subagent_model·smoke-copilot· 144 runs · balance: ✅ (χ2=0.000, p=1.0000)Recommendation: READY_FOR_ANALYSIS — All variants reached minimum samples and allocation appears balanced
sub_agent_strategy·smoke-gemini· 127 runs · balance:Recommendation: READY_FOR_ANALYSIS — All variants reached minimum samples; allocation appears imbalanced but analysis can proceed with caution
sub_agent_strategy·smoke-antigravity· 98 runs · balance: ✅ (χ2=0.041, p=0.8399)Recommendation: READY_FOR_ANALYSIS — All variants reached minimum samples and allocation appears balanced
caveman·smoke-copilot-aoai-apikey· 46 runs · balance: ✅ (χ2=0.000, p=1.0000)Recommendation: READY_FOR_ANALYSIS — All variants reached minimum samples and allocation appears balanced
subagent_model·smoke-copilot-aoai-apikey· 46 runs · balance: ✅ (χ2=0.000, p=1.0000)Recommendation: READY_FOR_ANALYSIS — All variants reached minimum samples and allocation appears balanced
sub_agent_decomposition·smoke-pi· 76 runs · balance: ✅ (χ2=0.842, p=0.3588)Recommendation: READY_FOR_ANALYSIS — All variants reached minimum samples and allocation appears balanced
prompt_style·daily-community-attribution· 43 runs · balance: ✅ (χ2=0.023, p=0.8788)Recommendation: READY_FOR_ANALYSIS — All variants reached minimum samples and allocation appears balanced
🟡 EXTEND — Experiments with Tracking Issues
tone_variant·aw-failure-investigator·#36105· 66 runsSlowest:
assertive██░░░░ 17/50 (34%)output_format·daily-issues-report·#30573· 42 runsSlowest:
inline████░░ 20/30 (66%)reasoning_depth·daily-security-red-team·#31673· 36 runsSlowest:
single_pass███░░░ 17/30 (56%)prompt_style·daily-news·#31190· 34 runsSlowest:
concise███░░░ 13/30 (43%)output_format·daily-compiler-quality·#32390· 32 runsSlowest:
detailed████░░ 13/20 (65%)semgrep_output_format·daily-semgrep-scan·#32795· 31 runsSlowest:
bullet_list██░░░░ 10/30 (33%)output_format·daily-code-metrics·#1· 31 runsSlowest:
executive_summary████░░ 15/20 (75%)prompt_compression·agent-performance-analyzer·#33280· 28 runsSlowest:
verbose█████░ 11/14 (78%)reasoning_depth·daily-fact·#31324· 24 runsSlowest:
multi_candidate█░░░░░ 7/30 (23%)prompt_style·ci-coach·#32335· 23 runsSlowest:
concise███░░░ 9/20 (45%)tone_style·typist·#34032· 16 runsSlowest:
formal████░░ 7/10 (70%)summary_detail·dependabot-campaign·#37533· 10 runsSlowest:
brief█░░░░░ 5/30 (16%)prompt_style·issue-arborist·#30015· 4 runsSlowest:
concise░░░░░░ 2/30 (6%)detail_level·daily-architecture-diagram·#31926· 3 runsSlowest:
comprehensive░░░░░░ 0/10 (0%)sub_agent_strategy·architecture-guardian·#39062· 2 runsSlowest:
sub_agents░░░░░░ 1/30 (3%)prefetch_strategy·weekly-blog-post-writer·#38590· 1 runsSlowest:
eager░░░░░░ 0/10 (0%)caveman_mode·dataflow-pr-discussion-dataset·#37102· 1 runsSlowest:
no░░░░░░ 0/10 (0%)All 33 EXTEND experiments (compact)
tone_variantaw-failure-investiga#36105prompt_styledaily-astrostyleliteoutput_formatdaily-issues-report#30573cavemansmoke-copilot-aoai-esubagent_modelsmoke-copilot-aoai-ereasoning_depthdaily-security-red-t#31673prompt_styledaily-news#31190output_formatdaily-compiler-quali#32390semgrep_output_formatdaily-semgrep-scan#32795output_formatdaily-code-metrics#1output_formatdeep-reportsub_agent_strategyagent-persona-explorprompt_compressionagent-performance-an#33280tool_verbositygpcleanreasoning_depthdaily-fact#31324prompt_styleci-coach#32335tone_styletypist#34032model_sizedaily-doc-healermodel_sizedaily-caveman-optimisub_agent_strategydaily-agentrx-trace-model_sizedaily-cache-strategymodel_sizedaily-function-namermodel_sizedaily-doc-updatersummary_detaildependabot-campaign#37533output_formatcopilot-agent-analysprompt_styledependabot-go-checkelog_fetch_strategydaily-safe-output-optimeout_settingdailysubagentoptimizprompt_styleissue-arborist#30015detail_leveldaily-architecture-d#31926sub_agent_strategyarchitecture-guardia#39062prefetch_strategyweekly-blog-post-wri#38590caveman_modedataflow-pr-discussi#37102Warning
Firewall blocked 1 domain
The following domain was blocked by the firewall during workflow execution:
proxy.golang.orgSee Network Configuration for more information.
Beta Was this translation helpful? Give feedback.
All reactions