Nightly 2026-05-08 — 1 productive cycle, +0 code-driven goals (already 100%), closed ONE-TIME #7 probe registry#265
Closed
boshu2 wants to merge 2 commits into
Closed
Nightly 2026-05-08 — 1 productive cycle, +0 code-driven goals (already 100%), closed ONE-TIME #7 probe registry#265boshu2 wants to merge 2 commits into
boshu2 wants to merge 2 commits into
Conversation
The curator's staleness probes were a hardcoded chain of `if probeX(...);
ok` calls inside `probeDreamPacketStaleness`. Adding a new false-positive
title shape required editing that chain plus writing a Go function — two
edits, two review surfaces, easy to forget the second.
Introduce `dreamProbeShape{ID, Probe}` with a registry slice
`dreamProbeShapes`. `probeDreamPacketStaleness` now walks the registry
and returns the first conclusive match. Adding a new shape is a registry
append: write the probe func (or a thin closure adapter for the existing
`extract*` helpers), append `{ID, Probe}` to the slice. The existing
shape functions are unchanged so all probe-shape tests continue to pass
verbatim.
Registers the four shipped shapes:
- target-files-exist
- repo-ref-tokens
- skill-line-limit-claim
- add-to-skill-claim
New test `TestDreamProbeShapes_RegistryIDsAreUniqueAndStable` guards the
registry contract: every shape has a non-empty ID, IDs are unique, the
canonical four are present.
Behaviour-preserving. Order in the registry matches the prior `if`
chain, so degraded-finding reasons surface in the same order. Tests:
`go test ./cmd/ao` 29.2s, all probe-shape tests pass.
Closes the curator side of one-time-fix #7 from the nightly routine
prompt: "Promote dream-curator probe shapes to a registry table; emit-
side probing must consult it." Emit-side path (`probeDreamPacket` ->
`probeDreamPacketStaleness`) now consults `dreamProbeShapes` directly.
This is cycle 1 of nightly/2026-05-08. Includes the run's tracked
audit-truth artefacts under `.agents/nightly/2026-05-08/` and bootstraps
`.agents/evolve/cycle-history.jsonl` + `.agents/evolve/session-state.json`
that compound across nightlies.
https://claude.ai/code/session_01Gq1ohV1YWC55EUdycQEbfX
Captures the post-cycle-1 fitness snapshot (21 pass / 1 skip / 0 fail — code-driven 100%, runtime-artifact 2/2) and resolves the pending commit SHA in cycle-history.jsonl to 3df3689. Audit-truth bookkeeping for the nightly digest; no production code change. https://claude.ai/code/session_01Gq1ohV1YWC55EUdycQEbfX
boshu2
added a commit
that referenced
this pull request
May 11, 2026
Cherry-pick the code-only part of PR #265, excluding stale .agents nightly runtime artifacts.
Owner
Author
|
Code-only portion cherry-picked to main in 635ca9d. Closing this stale nightly PR rather than merging the repo-root .agents runtime artifacts. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
TL;DR
flywheel-compoundingunder f-2026-04-30-002 dormancy precondition).compile-freshness,compile-no-oscillation; both flipped because Dream re-ran during this routine and rewrote.agents/overnight/latest/defrag/latest.json, which the goal scripts now fall back to. Per goals.yamlruntime-artifacttag these are excluded from the headline).mainat prelude.batsnot installed in canonical VM image failspre-push-bats-suiteandrelease-gate-contract-testswith exit 127;cursor-runtime-smokefails for similar reasons). Verified by runningpre-push-gate.sh --fastat HEAD == origin/main before any cycle began. None introduced by this branch.Cycle summary
cli/cmd/ao/overnight_packets.go: dream-curator probe-shape registry table3df3689The headline is flat because the repo entered the run at code-driven 100%. The cycle closes one of the prompt's named ONE-TIME REPO FIXES (#7: "Promote dream-curator probe shapes to a registry table; emit-side probing must consult it"). PR #252's body explicitly listed this as deferred — it is now landed.
Cycle 1 — dream-curator probe-shape registry
Refactor
probeDreamPacketStalenessfrom a hardcoded chain ofif probeX(...); okcalls into a registry walk overdreamProbeShapes []dreamProbeShape{ID, Probe}. Adding a new false-positive title shape used to require two coordinated edits (write a new Go function AND extend theifchain) which created drift; now it is a single registry append.Registers the four shipped shapes:
target-files-existrepo-ref-tokensskill-line-limit-claimadd-to-skill-claimBehaviour-preserving: order in the registry matches the prior
ifchain, so degraded-findingreasonstrings surface in the same order. New testTestDreamProbeShapes_RegistryIDsAreUniqueAndStableguards the registry contract (every shape has a non-empty unique ID; canonical four are present). All existing probe-shape tests pass verbatim.End-to-end verification:
cd cli && go test ./cmd/ao30.9s green;ao overnight start --max-iterations 1 --warn-onlyre-run against this branch synthesizes 0 morning packets against the dormant corpus, same as the baseline, and the curator's probe pipeline still walks the registered shapes (no behavior delta).Fitness delta (per goal)
The two
runtime-artifactflips (compile-freshness,compile-no-oscillation) are a Dream side effect (Dream wrote.agents/overnight/latest/defrag/latest.jsonand the goal scripts read that fallback path); they do not contribute to the headline delta per the prompt's runtime-artifact tabulation rule.Stale-audit accounting
497dcddaudit-truth bookkeeping commit is a follow-up to cycle 1, not a cycle of its own; cycle accounting follows the productive-cycle definition exactly).Auto-reverts and transient flakes
go-cli-teststruncation observed in the very first prelude measure as in PR Nightly 2026-05-07 — 3 productive cycles, +0 code-driven goals (already 100%), 2 audit-truth regressions fixed #252; warmed cache viago test -count=1 ./internal/...before snapshotting baseline. The persisted baseline-goals.json showsgo-cli-testsPASS).Quarantined goals (monitored, no fitness contribution)
quarantined: truein goals.yaml.flywheel-compounding(w=3, long-cycle, corpus-state) remains in SKIP state under the f-2026-04-30-002 precondition — corpus is fully dormant (σ=0 ρ=0, 0 citations, 0 artifacts, 0 learnings created over the last 7 days). Not re-attempted; will not be re-attempted on dormant corpus.Heavy-goal attempt history
No heavy goal (weight ≥ 5) was failing in code-driven space at the start of this run, so the heavy-goal selection ladder rung was empty. Time-pressure escalation did not fire (no unblocked heavy failing goals exist; per the prompt's rule "the escalation does NOT fire if every heavy goal is in the blocker matrix; in that case drop to ladder step 2" — same outcome with an empty heavy-failing set). No
attempts.jsonlentries authored this run.Dream probe-results summary
mine-findings, 0 fromexternal-watchlist.close_loop_promoted=0,harvest_promoted=7(auto-promotion only, none routed to packets). Corpus dormancy continues to suppress Dream's actionable output, which is the steady state since the f-2026-04-30-002 SKIP precondition.The probe-shape registry refactor (cycle 1) does not change probe behavior; it only changes how new shapes are added. The next time a recurring stale shape is observed in production, the work to suppress it is a registry append rather than a coordinated function-write + chain-edit.
Open-PR blocker matrix
{"open_prs": 0, "blocked_goals": [], "matrix": []}No nightly or triage PRs open against
mainat prelude. Persisted to.agents/nightly/2026-05-08/blocker-matrix.json.Inherited-from-main CI failures
Verified by running
scripts/pre-push-gate.sh --fastat HEAD == origin/main (before any cycle commit). All three failures are environment-related and not introduced by this branch. Persisted to.agents/nightly/2026-05-08/main-ci-baseline.json:agentops-core.pre-push-gate-governance / pre-push-bats-suite— exit 127 (batsnot installed in canonical VM image)agentops-core.release-security-gates / release-gate-contract-tests— exit 127 (batsnot installed in canonical VM image)agentops-core.runtime-smoke-matrix / cursor-runtime-smoke— exit 1 (cursor runtime smoke harness fails in canonical VM image)The
batsinstallation is one of the prompt's "ONE-TIME REPO FIXES" (#4 / #4-adjacent — VM image change, not a per-PR fix).Findings
.agents/findings/f-*.md..agents/findings/; the registry remains empty across nightlies because no run has yet authored to it. This is mechanically correct given corpus dormancy — Dream'smine-findingsandexternal-watchlistgenerators both reportcandidate_count=0this run).ONE-TIME REPO FIX status (from the routine prompt):
.gitignoreexceptions for persisted JSON pathsflywheel-compoundingreturns SKIP)compile-freshness/compile-no-oscillationasruntime-artifactin goals.yamlbdin canonical VM imageWORKTREE_DISPOSITION_ALLOW=nightly/*env supportaffects_files:per goal in goals.yamlgoals-affects-files.yaml).agents/findings/registry.jsonl7/11 numbered findingsbd
bd unavailable in this VM image. Selection ladder step 3 (open ready beads) skipped per spec (single-line note in the prompt's bd-unavailable handling).
Previous-run audit (anchor:
60911ff— Nightly 2026-05-07 #252 merge commit)The branch
origin/nightly/2026-05-07was deleted post-merge, so the audit anchors on the merge commit instead. Used the persisted, trackedbaseline-goals.jsonfrom.agents/nightly/2026-05-07/for the fitness comparison (this is the first nightly post the gitignore force-track fix where a real anchor JSON exists in main)..agents/.gitignorereconcile), Wave 1A-D: factory-claim-ledger reconciliation (soc-e4ulx) #264 (factory-claim-ledger reconciliation Wave 1A-D).Full audit:
.agents/nightly/2026-05-08/previous-run-audit.md.Commits
3df3689refactor(dream): promote curator probe shapes to a registry table497dcddchore(nightly): persist final-goals.json and cycle-history commit SHATest plan
cd cli && go build ./...cleancd cli && go vet ./...cleancd cli && go test -count=1 ./...green (cmd/ao 29.2s, all internal/ packages cached green)cd cli && env -u AGENTOPS_RPI_RUNTIME go run ./cmd/ao autodev validate --file ../PROGRAM.md --jsonvalidbash skills/heal-skill/scripts/heal.sh --strict→ "All clean. No findings."WORKTREE_DISPOSITION_ALLOW='nightly/*' bash scripts/check-worktree-disposition.sh→ PASS (verifies ONE-TIME fix: align with Anthropic marketplace standards #5 still works)ao goals measure --json→ code-driven 100% (19/19 + 1 skip), runtime-artifact 2/2 (final)ao overnight start --max-iterations 1 --warn-onlyre-run against this branch — same 0-morning-packet behavior as the baseline (probe registry is behaviour-preserving)pre-push-bats-suite,release-gate-contract-tests,cursor-runtime-smoke) — see "Inherited-from-main CI failures" above. No new red signals expected.https://claude.ai/code/session_01Gq1ohV1YWC55EUdycQEbfX
Generated by Claude Code