feat: memory system optimizations by royosherove · Pull Request #2 · inceptionstack/roundhouse

royosherove · 2026-05-07T11:22:43Z

Changes

Skip file re-read when only read-only tools used (read/grep/find/ls/glob)
Flush model: Haiku 4.5 (faster, matches quality for structured writes)
compactWithModel: keeps Haiku for BOTH flush + compact (was restoring to Opus between steps)
Live progress display: Telegram message edits in-place during compact
Persistent timing log: ~/.roundhouse/logs/compact-timing.jsonl
READ_ONLY_TOOLS + CompactResult/CompactTiming: proper exports in types.ts

Results

Total compact time: 390s → 197s (2x faster)
222 tests passing
All automated code reviews + architect review passed

Fix #1: Cache MemorySnapshot and fileSet in PreparedTurn. In finalizeMemoryForTurn, skip the disk re-read entirely when no tools were used during the turn (turnUsedTools=false). When tools ran, reuse the cached fileSet (avoids re-resolving paths) but still re-read file contents to detect modifications. handleStreaming() now returns { usedTools } flag. Flush model: Add config.compact.flushModel option and AgentAdapter.promptWithModel() method. When configured, memory flush turns use a cheaper/faster model (e.g. Sonnet) instead of the expensive conversation model. Pi adapter implements via session.setModel() with automatic restore after flush. All 217 tests pass, no new type errors.

Flush turns are structured tasks (read context → write markdown) that don't need frontier reasoning. Sonnet responds 3-5x faster than Opus for these turns. Default: amazon-bedrock/us.anthropic.claude-sonnet-4-6-20250514-v1:0 Set flushModel: null in config to fall back to conversation model.

P2: Replace session.setModel() with in-memory agent.state.model swap. Avoids persisting flush model to settings.json or session log. Crash-safe: if process dies mid-flush, settings remain on user's configured model. P3: Revert complement/unknown mode pre-read. finalizeMemoryForTurn skips digest checks for complement mode, so reading files was wasted I/O.

- Always restore agentState.model in finally (even if undefined) - Only set turnUsedTools=true for write/edit/bash/multi_edit tools (read/grep/ls/find are read-only — no need to re-hash memory files)

Unknown/extension tools now trigger memory re-read (safe default). Only known read-only tools (read, grep, find, ls, glob) skip re-read. Addresses codex review: custom extension tools that modify memory files would have been missed by the previous write-tool allowlist.

- Telegram messages now show flush/compact/total time + model used - Timing persisted to ~/.roundhouse/logs/compact-timing.jsonl (JSONL) - Both manual /compact and auto-compact show timing

- READ_ONLY_TOOLS: shared constant in memory/types.ts (not buried in handleStreaming) - CompactTiming + CompactResult: proper types in types.ts (SRP) - lifecycle.ts imports CompactResult instead of inline type - gateway.ts imports READ_ONLY_TOOLS + CompactResult from types - Tests: READ_ONLY_TOOLS membership, CompactResult with/without timing

Before: flush used Haiku (238s) but compact restored to Opus (152s) = 390s total After: both flush AND compact use Haiku via new compactWithModel adapter method - Added compactWithModel?(threadId, modelId) to AgentAdapter interface - Implemented in pi.ts: in-memory model swap around session.compact() - lifecycle.ts: prefers compactWithModel when available, falls back to compact()

…-place) - New telegram-progress.ts: createProgressMessage() → editable message handle - Compact shows: 'Flushing memory...' → 'Compacting... (flush took Xs)' → final result - Single message edited in-place (no spam of multiple messages) - Works for both manual /compact and auto-compact - Falls back gracefully for non-Telegram adapters (no-op updates)

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 78189ad064

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-07T11:25:40Z

+    // Non-Telegram: just post once, updates are no-ops
+    await thread.post(initialText);
+    return { update: async () => {} };


Keep progress updates for non-Telegram threads

Return a real update() implementation for non-Telegram adapters instead of a no-op. After this commit, gateway flows like /compact and auto-compaction call progress.update(...) for success/failure messages, so non-Telegram users now only see the initial “Saving memory...” post and never see completion or error status. This regresses observability of long-running operations outside Telegram.

Useful? React with 👍 / 👎.

…31) (#133) * refactor(session-repair): beautify per code review (7 findings) Addresses all 7 findings from the post-v0.5.30 maintainability review. Pure refactor \u2014 zero behavior change. 536 tests passing (+2 new for the divergent-change fix). #1 + #7 \u2014 DRY: extract matchesErrorPatterns + fix divergent change isContextOverflowError and isToolPairingError shared ~80% of structure (top-level .message, cause-chain walk, stringify-gate, circular try/catch) but with subtle drift: only one walked the cause chain. Extracted matchesErrorPatterns(err, patterns, { stringifyGate }) shared helper. Both classifiers now ~7 lines and BOTH walk cause chains. Added 2 tests proving isToolPairingError now classifies wrapped/Bedrock-nested tool-pairing errors (regression guard for the divergent-change fix). Also extracted looksLikeValidationError() shared gate so the two classifiers don't repeat the 4xx/ValidationException check. #2 \u2014 long method: extracted buildTrimmedEntries(entries, cutIdx) from softResetSessionFile. The orchestration is now linear: validate \u2192 cut \u2192 build \u2192 repair \u2192 write \u2192 report. Each step at the same abstraction level. #3 \u2014 long catch with nested try: extracted attemptSoftResetRecovery() in lifecycle.ts. flushMemoryThenCompact catch block went from ~60 lines with nested try/catch to ~25 lines, linear: classify \u2192 recover \u2192 log \u2192 persist. #4 \u2014 magic number: MAX_CAUSE_CHAIN_DEPTH = 5 named constant. #5 \u2014 file >400 lines: split src/agents/shared/session-repair.ts (574 lines, two domains) into: - session-repair.ts (81 lines) \u2014 public surface, tool-pair repair, re-exports for backward compat - session-soft-reset.ts (120 lines) \u2014 softResetSessionFile + types - error-classifiers.ts (75 lines) \u2014 matchesErrorPatterns + isContextOverflowError + isToolPairingError + MAX_CAUSE_CHAIN_DEPTH - session-repair-internal.ts (239 lines) \u2014 shared filesystem + in-memory repair primitives All public exports preserved via re-exports (Feathers seam pattern). Existing callers compile unchanged. #6 \u2014 naming: introduced RepairResult { entries, report } named type for the previously-anonymous repairEntriesInMemory return shape. Better IDE hover/autocomplete; no inline destructuring noise at call sites. Verification: npm test \u2014 536 passing (was 534, +2 for cause-chain regression tests) wc -l src/agents/shared/session-repair.ts \u2014 81 (was 574) All public API surfaces preserved. * chore: bump to v0.5.31 + CHANGELOG --------- Co-authored-by: Roy Osherove <575051+royosherove@users.noreply.github.com>

Loki FastStart added 9 commits May 7, 2026 09:38

fix: restore model unconditionally, track only file-modifying tools

d089aa8

- Always restore agentState.model in finally (even if undefined) - Only set turnUsedTools=true for write/edit/bash/multi_edit tools (read/grep/ls/find are read-only — no need to re-hash memory files)

feat: add timing info to compact messages + persistent timing log

5b07d9f

- Telegram messages now show flush/compact/total time + model used - Timing persisted to ~/.roundhouse/logs/compact-timing.jsonl (JSONL) - Both manual /compact and auto-compact show timing

royosherove merged commit df958b9 into main May 7, 2026

royosherove deleted the feat/memory-optimization branch May 7, 2026 11:22

chatgpt-codex-connector Bot reviewed May 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: memory system optimizations#2

feat: memory system optimizations#2
royosherove merged 9 commits into
mainfrom
feat/memory-optimization

royosherove commented May 7, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

royosherove commented May 7, 2026

Changes

Results

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant