v0.6.1
Patch release: token-savings tuning, output-cache DRY refactor, and stat-accounting fixes.
Changed
- Token-savings tuning across the hint, compaction, and output surfaces. Three internal improvement sweeps tightened the text Token-Goat injects into the conversation: shorter session read-hints and bash / grep / web dedup hints, leaner PreCompact manifest framing, a more compact post-compaction recovery hint, terser
token-goat mapoutput framing, and budgeted git-history and project-memory injections. The CLAUDE.md / SKILL.md / AGENTS.md directive blocks written bytoken-goat installwere condensed without dropping any guidance. The result is the same hints for fewer tokens. - Command
--jsonoutput is now compact single-line JSON.stats,map,config,bash-output,web-output,bash-history,web-history,compact-hint, and the surgical-read commands emit--jsonwith no indentation whitespace. JSON written to disk (settings.json and config files) stays pretty-printed for human editing. bash-outputandweb-outputrecall now default to a smart head-and-tail view for large cached outputs, with--fullto retrieve the whole thing.- DRY pass on the output-cache layer.
bash_cacheandweb_cachewere near-parallel implementations; their shared pieces (the cache-filename pattern, session-id sanitization, JSON-sidecar loading, and LRU disk-cap eviction) now live in onecache_commonmodule. No user-visible behavior change. Regression tests were added across the token-savings, stat-accounting, and cache surfaces.
Fixed
compact_recoverystat accounting. The post-compaction recovery hint recorded no injection overhead and was bucketed under theothersource instead ofcompact. It now records acompact_recovery_overheadrow consistent with thesession_hint,diff_hint, andbash_dedup_hintsiblings, and bothcompact_recoverykinds map to thecompactsource bucket.bash-outputandweb-outputrecalls were credited no savings. Retrieving a cached output instead of re-running a command, or a cached response instead of re-fetching a URL, now records abash_output_recallorweb_output_recallstat. This closes a measurement gap where thousands of cache hits showed zero tokens saved.