spawner: auto-replace completed Tasks on re-discovery by axon-agent[bot] · Pull Request #415 · kelos-dev/kelos

axon-agent · 2026-02-23T00:20:29Z

🤖 Axon Agent @gjkim42

Summary

Fix the spawner's re-work gap: when a work item is re-discovered (e.g., axon/needs-input label removed), automatically delete the old completed/failed Task and create a fresh one
Eliminates the need for /reset-worker or manual kubectl delete task to re-trigger work on the same issue
Adds 3 new unit tests covering succeeded, failed, and active task re-discovery scenarios

Problem

The current spawner deduplication logic at cmd/axon-spawner/main.go:188 treats any existing Task — whether Running, Succeeded, or Failed — as "already handled":

if !existingTasks[taskName] {
    newItems = append(newItems, item)
}

This creates a re-work bottleneck in the label-based feedback loop used by axon-workers:

Issue gets actor/axon label → spawner creates Task/axon-workers-42
Agent completes work, adds axon/needs-input label
Human reviews, removes axon/needs-input → issue reappears in discovery
Bug: Spawner sees existing Task/axon-workers-42 (Succeeded) and skips it
Issue sits in limbo until TTL (1 hour) deletes the old Task

The /reset-worker GitHub Actions workflow (.github/workflows/reset-axon-worker.yaml) exists specifically to work around this — it requires GKE auth and kubectl access to delete the Task. Issue #369 proposes another workaround for CI-failure retriggers. Both are symptoms of this root cause.

Solution

Change the spawner's deduplication logic to check the Task phase. When a re-discovered item's Task is in a terminal phase (Succeeded/Failed), delete it and treat the item as new:

if phase == TaskPhaseSucceeded || phase == TaskPhaseFailed {
    cl.Delete(ctx, existing)
    newItems = append(newItems, item)
}

Active Tasks (Running/Pending/Waiting) are never affected.

Impact on self-development

Before: Remove axon/needs-input → wait up to 1 hour for TTL, or use /reset-worker
After: Remove axon/needs-input → spawner picks it up on next poll (1 minute for axon-workers)

The /reset-worker workflow remains useful for force-resetting active tasks, but is no longer needed for the common re-work case.

Test plan

3 new unit tests added:
- TestRunCycleWithSource_RediscoveredCompletedTaskIsReplaced — succeeded task is replaced
- TestRunCycleWithSource_RediscoveredFailedTaskIsReplaced — failed task is replaced
- TestRunCycleWithSource_ActiveTaskNotReplaced — running task is NOT replaced
All 25 existing tests pass (some adjusted for new semantics)
go build ./cmd/axon-spawner/ succeeds
Deploy to test cluster and verify re-work loop works end-to-end

Related issues

Partially addresses Workflow: Self-development cron spawners lack resilience and quality controls #287 (problem 1: cron dedup prevents retry after failure)
Root cause behind Workflow: Auto-retrigger axon-workers on CI failure for agent-created PRs #369 (auto-retrigger on CI failure workaround)
Complements Multi run feature #400 (multi-run — this fix is at spawner level, not agent level)

🤖 Generated with Claude Code

Summary by cubic

Auto-replace completed or failed Tasks when their work items are rediscovered, so re-queued issues (e.g., label removed/re-added) are processed on the next poll without using /reset-worker. Active tasks are not touched.

Bug Fixes
- Dedup logic checks Task phase and deletes terminal-phase Tasks on rediscovery before creating a fresh Task.
- Added unit tests covering succeeded, failed, and active scenarios.

^{Written for commit 28ab2a0. Summary will update on new commits.}

When a work item reappears in discovery results (e.g., a label was removed and re-added to re-queue an issue), the spawner now automatically deletes the old completed/failed Task and creates a fresh one. Previously, the spawner skipped any item whose Task already existed regardless of phase, requiring the /reset-worker GitHub Actions workflow or manual Task deletion to re-trigger work. This closes the re-work loop natively: remove the excludeLabel from an issue, and the spawner will pick it up on the next poll cycle without any external intervention. Active (Running/Pending/Waiting) Tasks are never affected — only Tasks in terminal phases (Succeeded/Failed) are replaced. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cubic-dev-ai

No issues found across 2 files

gjkim42 · 2026-02-23T13:09:42Z

Actually, preventing a new task with the same name from being generated is a feature.
TTLSecondsAfterFinished is there to control the minimum interval duration for the same issue.

/reset-worker

gjkim42 · 2026-02-23T13:25:24Z

not to auto-replace completed tasks is intentional in order to have a cooldown for the same issue or PR.
I'll close this PR.

axon-agent bot added generated-by-kelos ok-to-test labels Feb 23, 2026

github-actions bot added needs-priority needs-kind Indicates an issue or PR lacks a kind/* label needs-triage needs-actor labels Feb 23, 2026

cubic-dev-ai bot reviewed Feb 23, 2026

View reviewed changes

gjkim42 closed this Feb 23, 2026

gjkim42 deleted the axon-fake-strategist-20260223-0000 branch February 23, 2026 13:25

kelos-bot bot mentioned this pull request Mar 1, 2026

Workflow: Add kelos-retrospective TaskSpawner for continuous PR outcome analysis and prompt improvement #513

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spawner: auto-replace completed Tasks on re-discovery#415

spawner: auto-replace completed Tasks on re-discovery#415
axon-agent[bot] wants to merge 1 commit intomainfrom
axon-fake-strategist-20260223-0000

axon-agent bot commented Feb 23, 2026 •

edited by cubic-dev-ai bot

Loading

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

gjkim42 commented Feb 23, 2026

Uh oh!

gjkim42 commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

axon-agent bot commented Feb 23, 2026 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Impact on self-development

Test plan

Related issues

Summary by cubic

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

gjkim42 commented Feb 23, 2026

Uh oh!

gjkim42 commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

axon-agent bot commented Feb 23, 2026 •

edited by cubic-dev-ai bot

Loading