release: v0.2.5818 — MCP stability + issue-44 + security + correctness by justrach · Pull Request #497 · justrach/codedb

justrach · 2026-05-24T09:31:58Z

Summary

MCP server no longer dies on client disconnect (SIGPIPE + broken stdout detection)
Issue-44: stale snapshot content now visible to search after working-tree changes
.env-local and .env_production no longer bypass the sensitive path filter
Broken error recovery in re-index fixed + BM25 NaN safety
Monolithic tests.zig split into 8 independent test binaries
Per-tier search breakdown telemetry (OTEL-style spans)

Merge strategy

Merge commit — do not squash.

🤖 Generated with Claude Code

…roken stdout Three root causes fixed: - No SIGPIPE handler: writing to a broken pipe killed the process outright - writeResult/writeError/writeRequest silently swallowed write failures (catch return), leaving the main loop unaware the client disconnected - Main loop had no exit path for write failures or watchdog shutdown signal Added cio.ignoreSigpipe() via sigaction at startup. Added stdout_broken atomic flag set on any stdout write failure. Main loop now checks both stdout_broken and a shutdown flag (passed from the watchdog thread) each iteration. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…ing tree changes loadSnapshotFast correctly detected stale files (disk mtime > snapshot mtime) but re-indexed them with indexFileOutlineOnly, which skips the word index and trigram index. Since searchContent relies on these indices, updated content was invisible to all search tiers. Changed to indexFile (full_index=true) so stale files get complete indexing. Also fixed isSensitivePath to block .env-* and .env_* variants (not just .env.*) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

isSensitivePath only matched .env and .env.X (dot-delimited) but missed .env-local, .env_production and similar hyphen/underscore variants. Extended the check to also treat '-' and '_' as delimiters after .env prefix. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Two correctness fixes: 1. commitParsedFileOwnedOutline: prior_content was hardcoded to null, so the errdefer on trigram indexing failure always removed the word index entry instead of restoring the prior content. Now fetches prior content before overwriting. 2. avgDocLength returns 1.0 when total_tokens=0 (prevents Inf in BM25 normalization). Sort comparator in rerankAndFinalize treats NaN scores as 0 to prevent unstable ordering. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The 11,621-line tests.zig (467 tests) compiled as a single binary that pegged CPU. Split into 8 independent binaries by domain: test-core 21 tests (store, agent, config, edit) test-explore 95 tests (explorer, word index, dep-graph, git, threads) test-index 141 tests (trigram, bloom, regex, disk, sparse ngram, perf) test-parser 46 tests (PHP, Go, Ruby, Swift, C, HCL, R, Dart, Python, TS) test-search 47 tests (BM25, rerank, callers) test-snapshot 16 tests (snapshot read/write/corruption) test-mcp 57 tests (MCP protocol, bundle, nuke, update, telemetry) test-query 44 tests (query pipeline, fuzzy, glob) Each can be run individually (zig build test-core) or all at once (zig build test). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…s fixes Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Instruments searchContent with nanoTimestamp() around each of the 7 search tiers plus rerank. Stores a SearchBreakdown struct on the Explorer after every search, emitted as a search_breakdown telemetry event for codedb_search/codedb_find/codedb_word calls. Fields: tier0_ns through tier5_ns, rerank_ns, tier_reached, candidate_count, result_count. ~160ns overhead per search (8 clock_gettime calls on Apple Silicon). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-actions · 2026-05-24T09:34:34Z

Benchmark Regression Report

Thresholds: 10.00% and 50,000 ns absolute delta

NOISE means the percentage threshold was exceeded, but the absolute delta was too small to fail CI.

Tool	Base (ns)	Head (ns)	Delta	Abs Delta (ns)	Status
`codedb_bundle`	578435	566215	-2.11%	-12220	OK
`codedb_changes`	58283	61673	+5.82%	+3390	OK
`codedb_deps`	10001	10335	+3.34%	+334	OK
`codedb_edit`	7260	7748	+6.72%	+488	OK
`codedb_find`	67127	66253	-1.30%	-874	OK
`codedb_hot`	108485	113933	+5.02%	+5448	OK
`codedb_outline`	339657	348217	+2.52%	+8560	OK
`codedb_read`	106734	119420	+11.89%	+12686	NOISE
`codedb_search`	163660	172208	+5.22%	+8548	OK
`codedb_snapshot`	312878	332344	+6.22%	+19466	OK
`codedb_status`	14246	14944	+4.90%	+698	OK
`codedb_symbol`	66354	65216	-1.72%	-1138	OK
`codedb_tree`	68522	84508	+23.33%	+15986	NOISE
`codedb_word`	93887	97692	+4.05%	+3805	OK

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 577785a5b8

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-24T09:35:47Z

+    if (std.mem.eql(u8, name, "codedb_search") or std.mem.eql(u8, name, "codedb_find") or std.mem.eql(u8, name, "codedb_word")) {
+        telem.recordSearchBreakdown(explorer.last_search_breakdown);


Record breakdown only for real tiered content searches

recordSearchBreakdown is emitted for codedb_find and codedb_word as well as failed codedb_search calls, but those paths do not update explorer.last_search_breakdown, so this logs stale data from a previous request. In practice, a successful codedb_search followed by codedb_find will attribute the old tier timings to the filename search, which corrupts per-tool telemetry and any downstream analysis based on these events.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-24T09:35:47Z

+            breakdown.tier_reached = if (breakdown.tier5_ns > 0 and result_list.items.len > 0) 7
+                else if (breakdown.tier4_ns > 0 and result_list.items.len > 0) 6


Derive tier_reached from actual executed search tier

tier_reached is computed from tier5_ns > 0, but tier5_ns is always positive because timing is recorded even when Tier 5 is skipped. Any non-early-return search with results therefore reports tier 7, even if matches were found in earlier tiers, making the new per-tier telemetry misleading for performance/debugging decisions.

Useful? React with 👍 / 👎.

justrach and others added 7 commits May 24, 2026 15:18

release: v0.2.5818 — MCP stability + issue-44 + security + correctnes…

2ca64ac

…s fixes Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

justrach changed the title ~~release: merge v0.2.5808–v0.2.5818 back to main~~ release: v0.2.5818 — MCP stability + issue-44 + security + correctness May 24, 2026

chatgpt-codex-connector Bot reviewed May 24, 2026

View reviewed changes

justrach merged commit 577785a into main May 25, 2026
1 check passed

justrach deleted the release/v0.2.5818 branch May 25, 2026 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

release: v0.2.5818 — MCP stability + issue-44 + security + correctness#497

release: v0.2.5818 — MCP stability + issue-44 + security + correctness#497
justrach merged 7 commits into
mainfrom
release/v0.2.5818

justrach commented May 24, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 24, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 24, 2026

Uh oh!

chatgpt-codex-connector Bot May 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		if (std.mem.eql(u8, name, "codedb_search") or std.mem.eql(u8, name, "codedb_find") or std.mem.eql(u8, name, "codedb_word")) {
		telem.recordSearchBreakdown(explorer.last_search_breakdown);

		breakdown.tier_reached = if (breakdown.tier5_ns > 0 and result_list.items.len > 0) 7
		else if (breakdown.tier4_ns > 0 and result_list.items.len > 0) 6

Conversation

justrach commented May 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Merge strategy

Uh oh!

github-actions Bot commented May 24, 2026

Benchmark Regression Report

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 24, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

justrach commented May 24, 2026 •

edited

Loading