oversee: triage task queue — close 16 duplicates and superseded tasks#252
Merged
oversee: triage task queue — close 16 duplicates and superseded tasks#252
Conversation
…one) Queue before: 72 pending + 9 wontfix-in-active-dir Queue after: 65 pending + 0 wontfix (all converted to done for archiving) Merged into primary tasks (5 closures): - #175 -> #174: both add tests to TestAuthFailureDetection, same PR - #163 -> #162: both are scoring module tests from PR #158 review, same PR - #124 -> #122: both validate doc snapshot consistency, same PR scope - #196 -> #173: both add entries to PROMPT_GUARD_FILES in lib-agent.sh - #180 -> #179: both touch _is_valid_eval_file() in pick-role.py, same PR Closed as obsolete (1): - #78: references non-existent "evolve.md Step 8" and the multi-agent review panel replaced by unified review in PR #107 Closed as low-value (1): - #230: _DELEGATION_ROLE_MAP covers all 8 current agent types; new agent types require major framework work making the map update obvious Converted wontfix -> done for archiving (9): - #77, #80, #107, #111, #115, #119, #127, #129, #134 All had wontfix status with rationale already documented; changed to done so daemon's archive_done_tasks() housekeeping removes them
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
First OVERSEE delegation in the v2 brain era. Addresses task #225 (queue growing) and #226 (brain never uses oversee).
Queue: 72 pending -> 65 pending (9 wontfix-in-active-dir -> done for archiving)
Tasks Closed from Pending (7)
Merged into primary tasks (5)
TestAuthFailureDetectionintest_nightshift.py. Same PR scope — the malformed-JSON test (fix: stale .next-id silent-drop and autonomy score fabrication bypass #175) is a natural addition alongside the non-result-event test (fix: close costs.json budget-stop poisoning and dirty-clone eval detection (#0125) #174)._extract_cycle_fixesempty-fixes test (fix: pentest hardening #0087 -- mktemp guard + pentest report cap + #0169 scope #163) belongs with the mixed accepted+rejected run test (overseer: fix ALERT_CONTENT pentest-tag bypass + urgent task for Codex false-green #162).validate-docs.shpass, same PR.PROMPT_GUARD_FILESinlib-agent.sh. One-line additions to the same array._is_valid_eval_file()inpick-role.py, same PR fix: pentest security hardening -- autonomy first-match, eval validation, unified.md guard #170 origin. CRLF test + stderr warning belong together.Closed as obsolete (1)
evolve/SKILL.mdhas 6 steps). Also references the multi-agent review panel (PR feat: multi-agent PR review panel (task #0047) #63) which was replaced by unified review (PR feat: unified daemon — agent picks its own role each cycle #107). The concept is valid but the task instructions target non-existent infrastructure.Closed as low-value (1)
_DELEGATION_ROLE_MAPalready covers all 8 current agent types. New agent types require major framework changes where updating a 10-line dict is trivially obvious. Checklist/test overhead exceeds value.Wontfix -> Done (9 tasks)
Tasks 0077, 0080, 0107, 0111, 0115, 0119, 0127, 0129, 0134 already had
status: wontfixwith documented rationale. Converted tostatus: doneso the daemon'sarchive_done_tasks()housekeeping removes them from the active directory.wontfixis not a valid status per the task guide (onlypending,in-progress,done,blocked).Verification
make checkpasses (1164 tests)Tasks NOT Closed (kept as genuine work)
Reviewed all 72 pending tasks. The following were examined for possible closure and kept as genuine work:
|| truesilently swallows errors