feat: [ENG-2499] Daemon handlers (list paginated, get, delete, bulk, … by bao-byterover · Pull Request #568 · campfirein/byterover-cli

bao-byterover · 2026-04-27T13:47:02Z

No description provided.

…clear) + task:deleted broadcast

github-actions · 2026-04-27T13:47:20Z

Claude finished @bao-byterover's task in 4m 19s —— View job

PR Review — feat: [ENG-2499] Daemon handlers (list paginated, get, delete, bulk, clear)

Summary

Overall score: 3 / 5

Good structural work — the DIP-injected getTaskHistoryStore factory, graceful degradation when the store is absent, and the broadcast-on-delete pattern are all sound. The test suite is the highlight: real FileTaskHistoryStore on a tmpdir with full lifecycle coverage, including stale-recovery, store-error fallback, and broadcast verification.

That said there are two correctness bugs, one performance issue, and two smaller concerns that should be addressed before merge.

Bugs (must fix)

#	Location	Issue
1	`task-router.ts:1149` + `task-router.ts:142`	`type` filter is silently dropped — `data.type` is never destructured, never passed to `store.list()`, and `matchesListFilters` has no `typeFilter` parameter. The field is declared in the schema and the interface but has zero effect at runtime. See inline comments for the three-point fix.
2	`task-router.ts:1157–1161`	`store.list()` called without `limit` — all persisted entries are loaded from disk before the page slice is applied. `ITaskHistoryStore.list` already accepts `limit`; pass it through to avoid O(N) reads.

Issues (should fix)

#	Location	Issue
3	`task-router.ts:1190`	Cursor tie-breaking — `nextCursor = paged.at(-1)?.createdAt` is not unique. Two tasks sharing the same millisecond timestamp straddle the page boundary; the second is silently excluded on the next fetch. Use `createdAt - 1` as the cursor or document the known gap.
4	`task-events.ts:84`	`error.name` promoted to required — a type-level breaking change. Serialised entries written before this commit (no `name` field) will now be structurally non-conformant. Either add a normalisation step in `FileTaskHistoryStore.getById` or keep the field optional and default at call sites.

Nitpicks / minor

#	Location	Note
5	`task-router.ts:1055–1059`	`deletedCount` in `handleTaskDeleteBulk` counts every `success: true`, including "not found" idempotent deletes. The semantics would be clearer as "actually removed" count.
6	`task-router.ts:1016–1022`	`status !== undefined` guard is dead code — `TaskInfo.status` is always initialised to `'created'` on construction.
7	`task-router.ts:154–221`	`synthesizeEntryFromTaskInfo` intentionally mirrors `TaskHistoryHook` logic; the comment flags the drift risk. A `task-history-entry-builder.ts` extraction (mentioned in the comment) would eliminate this class of skew — worth scheduling soon.

What's good

Clean DIP via getTaskHistoryStore?: (projectPath) => ITaskHistoryStore — pre-M2.09 tests are fully unaffected.
broadcastTaskDeleted correctly skips when projectPath is undefined; broadcast-on-clear correctly dedupes in-memory + store IDs with a Set.
handleTaskDelete refusal for non-terminal in-memory tasks prevents deleting a running agent's task out from under it.
Test file is production-quality: real disk I/O, merge-wins verification, stale-recovery override, store-error fallback, idempotency, and broadcast assertions all covered.

github-actions · 2026-04-27T13:50:56Z


-    const items: TaskListItem[] = []
+    const limit = data.limit ?? DEFAULT_TASK_LIST_LIMIT
+    const {before, status: statusFilter} = data


bug: type filter is silently dropped

data.type is never destructured here, so the newly-added type?: string[] filter has no effect even though it is declared in the schema and the interface.

matchesListFilters also has no typeFilter parameter, and store.list() is called without type. You need all three:

Suggested change

const {before, status: statusFilter} = data

const limit = data.limit ?? DEFAULT_TASK_LIST_LIMIT

const {before, status: statusFilter, type: typeFilter} = data

Then pass typeFilter into matchesListFilters and into the store.list() call (type: typeFilter).

github-actions · 2026-04-27T13:50:59Z


+function matchesListFilters(
+  item: TaskListItem,
+  filters: {before?: number; projectFilter: string; statusFilter?: TaskListItemStatus[]},


bug: type filter missing from matchesListFilters

The signature has no typeFilter parameter, so even after handleTaskList is fixed to extract data.type, the per-item check will not apply it to in-memory entries.

Suggested change

filters: {before?: number; projectFilter: string; statusFilter?: TaskListItemStatus[]},

function matchesListFilters(

item: TaskListItem,

filters: {before?: number; projectFilter: string; statusFilter?: TaskListItemStatus[]; typeFilter?: string[]},

): boolean {

Add a corresponding guard in the body:

if (filters.typeFilter && filters.typeFilter.length > 0 && !filters.typeFilter.includes(item.type)) return false

github-actions · 2026-04-27T13:51:04Z

+        persisted = await store.list({
+          projectPath: projectFilter,
+          ...(before === undefined ? {} : {before}),
+          ...(persistedStatuses === undefined ? {} : {status: persistedStatuses}),
+        })


performance: store.list() is called without a limit, loading unbounded data

The limit is only applied after merging all persisted entries with all in-memory entries. For a project with thousands of history entries this eagerly reads everything from disk before discarding most of it.

Pass a ceiling to the store so it can short-circuit early. A safe upper bound is limit + in-memory count, but even just limit is better than nothing:

Suggested change

persisted = await store.list({

projectPath: projectFilter,

...(before === undefined ? {} : {before}),

...(persistedStatuses === undefined ? {} : {status: persistedStatuses}),

})

persisted = await store.list({

projectPath: projectFilter,

...(before === undefined ? {} : {before}),

...(persistedStatuses === undefined ? {} : {status: persistedStatuses}),

...(typeFilter === undefined ? {} : {type: typeFilter}),

limit,

})

(ITaskHistoryStore.list already accepts limit.)

github-actions · 2026-04-27T13:51:08Z

+    const items = [...merged.values()].sort((a, b) => b.createdAt - a.createdAt)
+    const paged = items.slice(0, limit)
+    const hasMore = items.length > paged.length
+    const nextCursor = hasMore && paged.length > 0 ? paged.at(-1)?.createdAt : undefined


issue: cursor is not unique — same-createdAt entries will be skipped across pages

nextCursor is the raw createdAt of the last item on the page. If a client uses this as before on the next request, any entry with the exact same timestamp as the cursor is excluded (item.createdAt >= before drops it). In practice two tasks can share the same millisecond timestamp (especially in tests or rapid automation), causing silent data loss across pages.

Consider using nextCursor = paged.at(-1)!.createdAt - 1 so the boundary is exclusive by one millisecond, or document the known limitation clearly.

github-actions · 2026-04-27T13:51:13Z

+    let deletedCount = 0
+    for (const taskId of data.taskIds) {
+      // eslint-disable-next-line no-await-in-loop -- per-id projectPath resolution is sequential
+      const result = await this.handleTaskDelete({taskId}, clientId)
+      if (result.success) deletedCount++


issue: deletedCount overcounts — idempotent "not found" deletes report success: true

handleTaskDelete always returns {success: true} for a missing task (idempotent by design). That means deletedCount here is really "number of taskIds that were not refused due to non-terminal status", not "number of entries actually removed". A caller passing 10 already-deleted IDs gets deletedCount: 10.

If the contract is "how many were actually live", count wasInMemory || wasLive instead, or return a richer response from handleTaskDelete (e.g. {deleted: true, success: true} vs {deleted: false, success: true}).

github-actions · 2026-04-27T13:51:16Z

+    // Refusal: non-terminal in-memory tasks must not be deleted out from under the agent.
+    const liveTask = this.tasks.get(taskId)
+    if (liveTask !== undefined) {
+      const {status} = liveTask
+      if (status !== undefined && !TERMINAL_STATUSES.has(status)) {
+        return {error: `cannot delete task in status '${status}'`, success: false}
+      }


nitpick: status !== undefined guard is dead code

TaskInfo.status defaults to 'created' at construction (line 850), so it is always defined for any task in this.tasks. The outer guard if (liveTask !== undefined) ensures a task exists, so status is always a TaskListItemStatus here.

Removing the !== undefined check makes the intent clearer and the TypeScript types more accurate:

Suggested change

// Refusal: non-terminal in-memory tasks must not be deleted out from under the agent.

const liveTask = this.tasks.get(taskId)

if (liveTask !== undefined) {

const {status} = liveTask

if (status !== undefined && !TERMINAL_STATUSES.has(status)) {

return {error: `cannot delete task in status '${status}'`, success: false}

}

if (liveTask !== undefined) {

const {status} = liveTask

if (!TERMINAL_STATUSES.has(status ?? 'created')) {

return {error: `cannot delete task in status '${status}'`, success: false}

}

}

github-actions · 2026-04-27T13:51:21Z

    code?: string
    message: string
-    name?: string
+    name: string


issue: making error.name required is a breaking wire-format change

Any TaskListItem serialised before this commit (on-disk history entries, cached API responses, existing clients) that omits the error.name field will now fail TypeScript strict-mode checks on the reading side. The Zod schema in schemas.ts (TaskListItemStatusSchema) does not enforce error.name, so runtime parsing won't catch the mismatch — only TypeScript will.

If name must be required, add a migration step (e.g. normalise to 'UnknownError' on read in FileTaskHistoryStore.getById). Otherwise revert to name?: string and use a non-null assertion or default at call sites that actually need it.

* feat: [ENG-2486] Extend transport schemas with provider/model (#554) * feat: [ENG-2487] TaskRouter stamps provider + model on task:create (#555) * feat: [ENG-2488] Wire resolveActiveProvider in daemon bootstrap (#556) * feat: [ENG-2491] Domain entity + store interface (Level 2 schema) (#559) * feat: [ENG-2492] FileTaskHistoryStore: Index + Data API (basic save/get/list) * feat: [ENG-2498] Transport schemas (pagination, get, delete events) (#560) * feat: [ENG-2495] FileTaskHistoryStore: Delete + Clear API (#561) * feat: [ENG-2494] FileTaskHistoryStore: Stale recovery on read (#562) * feat: [ENG-2496] Daemon llmservice accumulator + real-time TaskHistoryHook (#565) * feat: [ENG-2497] Daemon bootstrap wiring + per-project store factory + startup audit (#567) * feat: [ENG-2499] Daemon handlers (list paginated, get, delete, bulk, clear) + task:deleted broadcast (#568) * feat: [ENG-2493] FileTaskHistoryStore: Prune + Compaction (#569) * Feat/eng 2515 (#575) * feat: [ENG-2515] Task history persistence — correctness fixes before WebUI ships * fix: [ENG-2515] retention design + WebUI provider/model payload Two correctness fixes surfaced while preparing the M2.10–M2.13 WebUI handover: 1. Disable age-based task history retention by default. Age prune (30 days) was a design mistake: it silently deletes user tasks even when the count cap (1000) has plenty of room left. Task history is a business artifact for audit/review, not a log — count-based rotation is the correct retention policy. Set the default to 0; deployments that want time-based eviction can still opt in via the constructor option. 2. Expose provider/model on the WebUI task:created subscriber. Daemon stamps both fields on every task:create (M1) but the local TaskCreatedPayload omitted them, so the chip M1.04 builds on would have no source. Extend the payload type and forward both fields into upsertStatus. * Feat/eng 2515 (#576) * feat: [ENG-2515] Task history persistence — correctness fixes before WebUI ships * fix: [ENG-2515] retention design + WebUI provider/model payload Two correctness fixes surfaced while preparing the M2.10–M2.13 WebUI handover: 1. Disable age-based task history retention by default. Age prune (30 days) was a design mistake: it silently deletes user tasks even when the count cap (1000) has plenty of room left. Task history is a business artifact for audit/review, not a log — count-based rotation is the correct retention policy. Set the default to 0; deployments that want time-based eviction can still opt in via the constructor option. 2. Expose provider/model on the WebUI task:created subscriber. Daemon stamps both fields on every task:create (M1) but the local TaskCreatedPayload omitted them, so the chip M1.04 builds on would have no source. Extend the payload type and forward both fields into upsertStatus. * fix: [ENG-2515] eliminate task-history test flake — close cache-invalidation race and add deterministic flush API Two pathologies surfaced as ~13% flake on local and similar on CI: 1. Cache-invalidation race in production code. A firePrune pass that started before a save() could finish AFTER the save's indexCache = undefined, overwriting the invalidation with a snapshot missing the just-saved row. Symptom: list(), clear(), deleteMany() returning N-1 entries after N awaited saves. Fix: bump indexEpoch on every index write (save / tombstone / recovery / compaction); doReadIndexDedup samples the epoch at the start and skips its setCache step if the epoch advanced. Also clear indexDedupInFlight on every write so a list() cannot pick up a pre-write in-flight promise. 2. Timing-based polling in tests. waitForPruneToSettle polled at 80 x 4ms with a 4-stable-check exit, which under CI parallel-worker load missed the setTimeout(0) macrotask window. Fix: expose flushPendingOperations() — each firePrune wraps its setTimeout in a promise assigned to pruneChain; the public method awaits the chain (loops because pruneRequested re-extends it) and drains operationLock. Test callers replace polling with this single await store.flushPendingOperations(). Plus: tempDir entropy upgraded from Math.random() to randomUUID() in two test files to remove a residual collision risk in CI parallel runs. Verification: a new race-regression test (file-task-history-store-cache-race.test.ts) reproduces the race at ~27% rate on production code and drops to 0% post-fix; running each affected test file 30x consecutively yields 120/120 pass; full suite npm test reports 7103 passing, 0 failing. * fix: [ENG-2515] FileTaskHistoryStore — chunk tombstone appends under 4 KB to keep concurrent saves from corrupting JSON lines Large clear() / deleteMany() batches previously issued a single appendFile of all tombstones joined. POSIX guarantees nothing about regular-file write atomicity (PIPE_BUF applies to FIFOs/sockets only). On filesystems that don't serialize appends per inode, an unlocked concurrent save() can interleave content into the middle of a multi-line tombstone write, corrupting a JSON line. The line is then silently skipped by IndexLineSchema.safeParse, so the tombstone is "missing" from the dedup map even though the data file was already unlinked → list() returns a ghost row whose getById() returns undefined. Fix: introduce chunkLinesByBytes (exported for direct unit-test) and issue tombstone appends in chunks under MAX_APPEND_CHUNK_BYTES (3.5 KB). Each individual appendFile call stays under the 4 KB page boundary so the kernel page-cache write path treats it as a single sector-level write on common filesystems. Chunks are sequential under the existing withOperationLock so concurrent saves can only land BETWEEN chunks, never WITHIN one. Tests: chunker unit test covers boundary + oversized-line + 200-tombstone realistic case; integration test covers (a) clear() with 200 entries removing all tombstones + data files, (b) every appendFile chunk well-formed JSON post-clear, (c) clear() interleaved with 50 concurrent fresh saves keeping the index parseable. * refactor: [ENG-2515] move TaskHistoryEntry into shared/ + runtime layer-isolation test The persisted-entry shape lived in src/server/core/domain/entities/. The WebUI Tasks tab needs the same TS type (TaskGetResponse, stored detail), which forced a webui→server import — inverting the boundary the ESLint rule on tui/ already enforces. Move the TS shape (TaskHistoryEntry, TaskHistoryEntryBase, TaskErrorData, TaskHistoryStatus, TASK_HISTORY_SCHEMA_VERSION) into src/shared/transport/events/task-events.ts. The Zod schema stays in src/server/core/domain/entities/task-history-entry.ts and now carries `satisfies z.ZodType<TaskHistoryEntry>`, so any drift between the runtime schema and the type is a typecheck error. The entity file re-exports the type so existing server-side imports continue to compile unchanged. test/unit/shared/layer-isolation.test.ts: runtime walk over src/shared/ asserting no `from '...server/...'` (or other higher layers) appears in any .ts/.tsx file. Catches regressions even if a future ESLint rule is bypassed. Incidental: src/server/infra/process/task-router.ts picks up formatting churn from the prettier/eslint-fix pass on the same change set (line wrapping only, no logic change). * feat: [ENG-2490] add provider chip column to WebUI task list * feat: [ENG-2500] wire WebUI delete + bulk delete + clear-completed mutations * feat: [ENG-2501] render task detail from disk via task:get for cold tasks * feat: [ENG-2503] dim styling + tooltip for daemon-interrupted task entries * feat: [ENG-2490] cover (provider, '') edge case in formatProviderModel * feat: [ENG-2500] keep selection on bulk-delete failure + cover Delete failed fallback * feat: [ENG-2501] gate task:get on live status + pin enabled + infinite staleTime * feat: [ENG-2503] tighten isInterrupted status type + cover precedence * feat: [ENG-2503] use design-system Tooltip for interrupted-row hint * feat: [ENG-2539] M2.16 task:list server-side filter/search + numbered… (#594) * feat: [ENG-2539] M2.16 task:list server-side filter/search + numbered pagination Schema: drop cursor (before/limit/nextCursor), add page/pageSize/total/pageCount + searchText, provider[], model[], createdAfter/Before, minDuration/maxDurationMs. Response adds counts (Model A: matches current filter scope), availableProviders, availableModels (history-derived, exclude pivots so dropdowns dont shrink). Handler: 2-pass search. Pass-1 over index summary (content + error.message). Pass-2 lazy-cracks data file via store.getById for completed tasks where full result text might match. In-memory tasks read task.result directly (no I/O). Race-safe: getById errors swallowed with log-once-per-query. Store: ITaskHistoryStore.list rename after/before -> createdAfter/createdBefore, add provider/model coarse pre-filter, drop limit (handler paginates). Tests: 5 new test files updated. New cases cover all 7 filter dims, AND combination, pass-2 happy path (full result match past byte 1500), pass-2 in-memory path, file-race swallow, counts Model A invariant counts.all === total, availableProviders/Models exclude pivots + symmetric empty-string guard. * fix: [ENG-2539] M2.16 PR review fixes Address all 8 inline comments from PR #594: - Docs: counts is Model A post-filter, counts.all === total invariant (was incorrectly described as pre-status-filter) - Docs: page is echoed back as-sent without upper-bound clamp against pageCount; documented contract on JSDoc - Interface: document why provider/model/status options remain on ITaskHistoryStore.list (direct-caller use; handler bypasses pivots) - mapBounded signature: explicit fn returns R | undefined - handleTaskList: refactor 2-pass to single-pass over candidatesNoSearch; derive merged inline; pass-2 reuses the map (was 2xN, now 1xN) - Comment: clarify pass2Filter spread auto-inherits new non-pivot dims - Schemas: bind via satisfies z.ZodType<X> to task-events.ts interfaces; re-export TaskListRequest/Response/Counts/AvailableModel from shared so the interface is single source of truth (compile-time drift prevention) - Tests: add maxDurationMs upper-bound + no-store fallback coverage * feat: [ENG-2502] paginate task list with useInfiniteQuery + Load more table row * feat: [ENG-2502] keyboard-accessible Load more row + indent describe blocks * feat: [ENG-2547] M2.17 WebUI numbered pagination + Filter dropdown + active filter tags * feat: [ENG-2547] address bot review: memo serverStatus + closeTask, dedup useGetProviders + isDurationPreset, memo tags * feat: [ENG-2610] M2.18 pagination polish + default page size 20 + friendly provider names * feat: [ENG-2610] memoize providerNames against providersResponse, not derived array --------- Co-authored-by: ncnthien <nhatthien185@gmail.com>

feat: [ENG-2499] Daemon handlers (list paginated, get, delete, bulk, …

7e0bda2

…clear) + task:deleted broadcast

bao-byterover requested a review from leehpham as a code owner April 27, 2026 13:47

bao-byterover merged commit 488bab6 into proj/persis-task-history Apr 27, 2026
3 checks passed

bao-byterover deleted the feat/ENG-2499 branch April 27, 2026 13:47

github-actions Bot reviewed Apr 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: [ENG-2499] Daemon handlers (list paginated, get, delete, bulk, …#568

feat: [ENG-2499] Daemon handlers (list paginated, get, delete, bulk, …#568
bao-byterover merged 1 commit into
proj/persis-task-historyfrom
feat/ENG-2499

bao-byterover commented Apr 27, 2026

Uh oh!

github-actions Bot commented Apr 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

github-actions Bot Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	const {before, status: statusFilter} = data
	const limit = data.limit ?? DEFAULT_TASK_LIST_LIMIT
	const {before, status: statusFilter, type: typeFilter} = data

Conversation

bao-byterover commented Apr 27, 2026

Uh oh!

github-actions Bot commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review — feat: [ENG-2499] Daemon handlers (list paginated, get, delete, bulk, clear)

Summary

Bugs (must fix)

Issues (should fix)

Nitpicks / minor

What's good

Uh oh!

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions Bot commented Apr 27, 2026 •

edited

Loading