[experiments] Daily Experiment Report — 2026-05-05 #30314
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by daily-experiment-report. A newer discussion is available at Discussion #30799. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
🧪 Daily Experiment Report — 2026-05-05
4 experiments analysed across 4 workflows. No experiments have reached statistical readiness — all are in the data-collection phase. Recommendation for all active experiments: EXTEND.
📊 Overview
Experiment 1 —
daily-community-attribution·prompt_styleExperiment 2 —
daily-astrostylelite-markdown-spellcheck·prompt_styleExperiment 3 —
smoke-copilot·cavemanExperiment 4 —
issue-arborist·prompt_style🔍 Infrastructure Finding
A bug was identified in experiment state persistence. In
push_experiment_state.cjs, when creating a new orphan branch for the first time,checkoutOrCreateBranch()returns an empty string""asbaseRef. This causespushSignedCommitsto computegit rev-list ""..HEADwhich yields zero commits, logging"no new commits to push via GraphQL"and returning without error — while the state was never actually pushed to the remote branch. Subsequent runs all start from empty state rather than accumulating history.Impact: Per-variant cumulative counts and run histories are not being persisted. Each run picks variants randomly instead of using the balanced least-count strategy over the full history.
Tracking: See
push_signed_commits.cjsline 138 andpush_experiment_state.cjsfor the orphan branch path.📋 Summary Table
*smoke-copilot count includes all non-skipped runs; per-variant split is random (state not persisted)
References: §25366113777
Warning
Firewall blocked 3 domains
The following domains were blocked by the firewall during workflow execution:
productionresultssa13.blob.core.windows.netproductionresultssa14.blob.core.windows.netproxy.golang.orgSee Network Configuration for more information.
Beta Was this translation helpful? Give feedback.
All reactions