feat(util): allowlisted env reader (B5) + test(integration): cross-family handoff matrix (B4) + 08-pi-mono comparison by omerakben · Pull Request #26 · omerakben/code-oz

omerakben · 2026-05-11T02:54:17Z

Summary

Doc + targeted-code subset of the pi-mono comparison borrow set landed under the Codex accept-with-modifications verdict (thread 019e12f0). Four commits + merge from origin/main:

B4 — cross-family handoff matrix offline test asserts rule-2 invariant (reviewer family != builder family) over 12 directional pairs across claude/codex/gemini/xai, using the buildProviderRegistry({ providerOverride: 'fake' }) family-aliasing seam.
B5 — allowlisted env reader (src/util/env.ts) with Bun /proc/self/environ fallback. Reads only declared keys, never the whole environment as a Map (rule-13 privacy-by-default).
MR-1 — CLAUDE.md status-line refresh to v0.17.0-alpha.0 / M16 / 3108 tests.
08-pi-mono comparison set — COMPARISON.md, CODEX_BRIEFING.md, CODEX_RESPONSE.md, SYNTHESIS.md capturing the seven accepted modifications (B2 split, B3 hardening, B5 allowlist, B6 reframe, B7 deferral, B8 downgrade, S1 deferral) and demand-gated R1 annotation.

Merged origin/main on top to absorb 11 PRs that landed earlier today (#12-#22). Conflicts resolved: CLAUDE.md status line (kept this branch's pi-mono-era line; rule-1 expansion and rule-16 persona paragraph from main preserved); docs/comparison/README.md (new file from main, appended 08-pi-mono row, removed pi-mono from backlog).

Verification

bun test -> 3196 pass / 2 skip / 0 fail (was 3108 before merge; +88 from the new B4 matrix + B5 fallback tests plus the union of origin/main's new suites).
Live xAI integration skipped as expected (gated behind CODE_OZ_LIVE_PROVIDER_TESTS=xai + CODE_OZ_LIVE_XAI_MODEL).

Test plan

bun test passes offline
No new lint or type-check regressions from the merge (full test suite green)
CLAUDE.md keeps rule-1 expansion + rule-16 persona paragraph from main
docs/comparison/README.md has the 08-pi-mono row and removes pi-mono from the backlog list

…odifications) COMPARISON.md, CODEX_BRIEFING.md, CODEX_RESPONSE.md, SYNTHESIS.md. Codex thread 019e12f0; verdict accept-with-modifications. Locked borrow set: B1 (renamed requestedModel/responseId), B2a/B2b split, B3 hardened to observer-only + wrapper-owned redactor, B4 12-pair offline matrix, B5 allowlisted env reader (not whole-env Map), B6 reframed as code-oz-original typed ProviderDiagnostic, B7 deferred behind compiled-binary keepalive test, B8 downgraded to model lifecycle guard. S1 catalog -> docs only; R1-R5 unchanged with R1 annotated demand-gated for meta-providers.

…tests (MR-1) Closes the block-push miss surfaced by Codex during the pi-mono comparison. Line 9 had drifted four milestones behind: it still described v0.13.0-alpha.0 / PE-1 closed / 1983 tests. Actual state is M16 production CLI completion (per-task cursor, dispatch build/verify/review under M14 + M15 authorities), W3-lite tarballs shipped at v0.14.0-alpha.0, and the package.json version is 0.17.0-alpha.0. Test count now reflects the bun test baseline (3108 pass / 1 skip). PE-1 trust-boundary discipline and live xAI gating language preserved.

…ck (B5) Closes pi-mono borrow B5 with the synthesis MR-3 hardening (locked in docs/comparison/08-pi-mono/SYNTHESIS.md). Pi-mono's getProcEnv parses the entire /proc/self/environ into a Map and caches it; that violates rule 13 by capturing every env var indefinitely. This implementation: - readEnv(allowedKeys: readonly string[]) -> Record<string,string> - Reads process.env first. - On Linux + Bun + Object.keys(process.env).length === 0 (the oven-sh/bun#27802 empty-env signal), falls back to a one-shot, allowlist-filtered read of /proc/self/environ for ONLY the requested keys. - No module-level cache. - No log/serialize of unrelated vars. - Never throws on /proc read failure. 7 Bun tests cover the API + the platform-skip fallback path.

…nt (B4) Closes pi-mono borrow B4 (synthesis LD-2). Code-oz already runs cross-family handoff in production at every M14 Reviewer panel call and M10 Debate runtime round, but had no explicit test for the round-trip. A future change to artifact projection, the M11 capability-routed wrapper, or the M14 panel fan-out could break the property silently. Implementation: - tests/cross-family-handoff.test.ts uses a tiny in-test FamilyAlias wrapper around a single shared FakeProvider so each of {claude, codex, gemini, xai} reports its own (id, family) without modifying src/providers/fake.ts. - test.each parametrizes over the 12 directional family pairs (4 ids x 3 not-self). - For each pair: invoke source -> serialize assistant content + tool_calls to an explicit artifact path -> invoke target with only that path in req.files. Assert source family != target family, byte-identical content survives, agent_invoked carries the source provider id, target manifest contains only the passed-in path (rule 13 + rule 18). Test count: 3108 -> 3127 (12 new test.each cases + 7 from B5 util-env tests; full suite passes 3127 / 1 skip / 0 fail).

# Conflicts: # CLAUDE.md

coderabbitai · 2026-05-11T02:54:23Z

Warning

Rate limit exceeded

@omerakben has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 49 minutes and 27 seconds before requesting another review.

You’ve run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: ace75b5b-7e69-48f5-b998-f90813de1664

📥 Commits

Reviewing files that changed from the base of the PR and between 075af4a and b1a85d0.

📒 Files selected for processing (9)

CLAUDE.md
docs/comparison/08-pi-mono/CODEX_BRIEFING.md
docs/comparison/08-pi-mono/CODEX_RESPONSE.md
docs/comparison/08-pi-mono/COMPARISON.md
docs/comparison/08-pi-mono/SYNTHESIS.md
docs/comparison/README.md
src/util/env.ts
tests/cross-family-handoff.test.ts
tests/util-env.test.ts

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/pi-mono-borrows

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist

Code Review

This pull request adds a detailed comparison between code-oz and pi-mono, including synthesis documentation for future feature 'borrows'. It also introduces a new environment variable utility with a Linux-specific fallback for Bun and a cross-family handoff integration test. Feedback identifies an outdated test count in CLAUDE.md and suggests a more efficient single-pass implementation for reading the Linux environment buffer.

gemini-code-assist · 2026-05-11T02:56:01Z

 `code-oz` is a standalone Bun + TypeScript CLI that boots an adaptive multi-agent software-company simulation over a hybrid phase-graph + agentic sub-orchestration spine. Hard SDLC gates between phases (file-based, schema-validated). Cross-family adversarial review. Non-technical-user intent elicitation at the front. Multi-provider via `IAgentProvider` (Claude / Codex / Gemini SDKs reading CLI OAuth tokens).

-Status: **v0.17.0-alpha.0 — M16 closed.** Production CLI completion (per-task cursor, dispatch infra, milestone-level e2e through the binary): `code-oz run`, `approve`, and `doctor` are wired end-to-end across DEFINE → REVIEW; full `resume` command remains M17. 3108 offline tests pass (+402 across M16); live xAI gated behind `CODE_OZ_LIVE_PROVIDER_TESTS=xai` + `CODE_OZ_LIVE_XAI_MODEL=<grok-variant>`. M16 R0/R1/R2 closed (8 production bugs caught by C12 e2e + 4 by Codex R1; per-commit cross-model peer review pattern validated for shared infra). Latest tag pushed: `v0.17.0-alpha.0` (2026-05-10).
+Status: **v0.17.0-alpha.0 — M16 closed and pushed.** Production CLI completion: per-task cursor (`bun run dev run`), build/verify/review dispatch under M14 Reviewer panel + M15 debate-policy scheduler authorities, exit-code discipline, prod seams, phase-locks, validators. M13 (role-cost policy under `budgets.global`) shipped alongside W3-lite tarballs at v0.14.0-alpha.0 (Mach-O `code-oz` darwin tarball; smoke 5/5; ralph-loop overnight closure 2026-05-02). PE-1 (xAI direct HTTP, OpenAI-compatible subset, strict request-body allowlist; trust-boundary discipline in `docs/references/provider-contract.md` § "Auth model — subprocess delegation + API-key transmission (v0.1)") shipped at v0.13.0-alpha.0; review trail in `docs/research/CODEX_REVIEW_PE1.md`. **3108 offline tests pass** (1 skip is the live xAI integration gated behind `CODE_OZ_LIVE_PROVIDER_TESTS=xai` + `CODE_OZ_LIVE_XAI_MODEL=<grok-variant>`). PE-2 demand-gated; M17+ pending milestone planning.


The test count in the status line appears to be outdated. The PR description indicates that the test suite now has 3196 passing tests (up from 3108 after merging main), but this line still references 3108.

gemini-code-assist · 2026-05-11T02:56:01Z

+function readAllowedProcEnv(allowedKeys: readonly string[]): Record<string, string> {
+  let environ: Buffer
+  try {
+    environ = readFileSync(PROC_SELF_ENVIRON)
+  } catch {
+    return {}
+  }
+
+  const values: Record<string, string> = {}
+  const seen = new Set<string>()
+  for (const key of allowedKeys) {
+    if (seen.has(key)) {
+      continue
+    }
+    seen.add(key)
+
+    const value = findProcEnvValue(environ, key)
+    if (value !== undefined) {
+      values[key] = value
+    }
+  }
+  return values
+}
+
+function findProcEnvValue(environ: Buffer, key: string): string | undefined {
+  if (key === '' || key.includes('=')) {
+    return undefined
+  }
+
+  const needle = Buffer.from(`${key}=`, 'utf8')
+  let offset = 0
+
+  while (offset < environ.length) {
+    const match = environ.indexOf(needle, offset)
+    if (match === -1) {
+      return undefined
+    }
+
+    if (match === 0 || environ[match - 1] === 0) {
+      const valueStart = match + needle.length
+      const nul = environ.indexOf(0, valueStart)
+      const valueEnd = nul === -1 ? environ.length : nul
+      return environ.subarray(valueStart, valueEnd).toString('utf8')
+    }
+
+    offset = match + 1
+  }
+
+  return undefined
+}


The current implementation of readAllowedProcEnv is inefficient as it performs a full buffer scan for every key in allowedKeys (O(K * E)). A single-pass approach iterating through the null-terminated records in /proc/self/environ is more efficient and simplifies the logic by removing the need for findProcEnvValue.

function readAllowedProcEnv(allowedKeys: readonly string[]): Record<string, string> { let environ: Buffer try { environ = readFileSync(PROC_SELF_ENVIRON) } catch { return {} } const allowedSet = new Set(allowedKeys) const values: Record<string, string> = {} let start = 0 while (start < environ.length) { const end = environ.indexOf(0, start) const limit = end === -1 ? environ.length : end const entry = environ.subarray(start, limit) const eqIdx = entry.indexOf(61) // '=' if (eqIdx !== -1) { const key = entry.subarray(0, eqIdx).toString('utf8') if (allowedSet.has(key)) { values[key] = entry.subarray(eqIdx + 1).toString('utf8') } } if (end === -1) break start = end + 1 } return values }

Copilot

Pull request overview

Adds an allowlisted environment-variable reader (with a Linux Bun /proc/self/environ fallback) and expands offline test coverage around cross-family REVIEW handoffs, alongside documentation updates capturing the “08 pi-mono” comparison packet and updating project status/index docs.

Changes:

Introduce readEnv(allowedKeys) in src/util/env.ts, including a Bun-on-Linux /proc/self/environ fallback that only decodes allowlisted keys.
Add offline tests: env allowlist behavior and a 12-direction cross-family assistant-message handoff matrix using FakeProvider.
Update comparison index/docs for the new “08 pi-mono” session and refresh CLAUDE.md status text.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
tests/util-env.test.ts	New unit tests for allowlisted env reads via `readEnv`.
tests/cross-family-handoff.test.ts	New offline matrix test asserting cross-family REVIEW handoff invariants + byte-preserving file manifest handoff.
src/util/env.ts	New allowlisted env reader with `/proc/self/environ` fallback under Bun/Linux empty-env condition.
docs/comparison/README.md	Adds row for 08 pi-mono and removes pi-mono from backlog list.
docs/comparison/08-pi-mono/SYNTHESIS.md	Adds the pi-mono synthesis/decision record.
docs/comparison/08-pi-mono/COMPARISON.md	Adds the pi-mono comparison writeup (initial analysis).
docs/comparison/08-pi-mono/CODEX_BRIEFING.md	Adds the debate prompt captured for Codex.
docs/comparison/08-pi-mono/CODEX_RESPONSE.md	Adds the verbatim Codex response captured for the session.
CLAUDE.md	Updates the project status line/details.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+function shouldReadProcEnv(): boolean {
+  return process.platform === 'linux'
+    && process.versions?.bun !== undefined
+    && Object.keys(process.env).length === 0
+}


+        const fake = new FakeProvider({ strict: true })
+        const aliases = makeAliases(fake)
+        const registry = new ProviderRegistry({ providers: aliases })
+        const aliasById = new Map(aliases.map((alias) => [alias.id, alias]))
+


+---
+template: pi-mono
+template-path: ~/Projects/agents/templates/pi-mono
+template-status: audited (CLAUDE.md influence library row "pi-mono → Streaming event model + multi-provider abstraction")
+session: 2026-05-10


+## What ships from this comparison (acceptance gates per item)
+
+Nothing ships from the comparison itself — this is research, not implementation. Locked decisions become inputs to:
+
+- **Next M13 follow-up commit** picks up B2a + the `cacheSessionId` privacy invariant test.


 `code-oz` is a standalone Bun + TypeScript CLI that boots an adaptive multi-agent software-company simulation over a hybrid phase-graph + agentic sub-orchestration spine. Hard SDLC gates between phases (file-based, schema-validated). Cross-family adversarial review. Non-technical-user intent elicitation at the front. Multi-provider via `IAgentProvider` (Claude / Codex / Gemini SDKs reading CLI OAuth tokens).

-Status: **v0.17.0-alpha.0 — M16 closed.** Production CLI completion (per-task cursor, dispatch infra, milestone-level e2e through the binary): `code-oz run`, `approve`, and `doctor` are wired end-to-end across DEFINE → REVIEW; full `resume` command remains M17. 3108 offline tests pass (+402 across M16); live xAI gated behind `CODE_OZ_LIVE_PROVIDER_TESTS=xai` + `CODE_OZ_LIVE_XAI_MODEL=<grok-variant>`. M16 R0/R1/R2 closed (8 production bugs caught by C12 e2e + 4 by Codex R1; per-commit cross-model peer review pattern validated for shared infra). Latest tag pushed: `v0.17.0-alpha.0` (2026-05-10).
+Status: **v0.17.0-alpha.0 — M16 closed and pushed.** Production CLI completion: per-task cursor (`bun run dev run`), build/verify/review dispatch under M14 Reviewer panel + M15 debate-policy scheduler authorities, exit-code discipline, prod seams, phase-locks, validators. M13 (role-cost policy under `budgets.global`) shipped alongside W3-lite tarballs at v0.14.0-alpha.0 (Mach-O `code-oz` darwin tarball; smoke 5/5; ralph-loop overnight closure 2026-05-02). PE-1 (xAI direct HTTP, OpenAI-compatible subset, strict request-body allowlist; trust-boundary discipline in `docs/references/provider-contract.md` § "Auth model — subprocess delegation + API-key transmission (v0.1)") shipped at v0.13.0-alpha.0; review trail in `docs/research/CODEX_REVIEW_PE1.md`. **3108 offline tests pass** (1 skip is the live xAI integration gated behind `CODE_OZ_LIVE_PROVIDER_TESTS=xai` + `CODE_OZ_LIVE_XAI_MODEL=<grok-variant>`). PE-2 demand-gated; M17+ pending milestone planning.


…rows

omerakben added 5 commits May 10, 2026 14:28

Merge remote-tracking branch 'origin/main' into feat/pi-mono-borrows

b1a85d0

# Conflicts: # CLAUDE.md

Copilot AI review requested due to automatic review settings May 11, 2026 02:54

Copilot started reviewing on behalf of omerakben May 11, 2026 02:54 View session

gemini-code-assist Bot reviewed May 11, 2026

View reviewed changes

omerakben merged commit bf5dc5a into main May 11, 2026
3 checks passed

Copilot AI reviewed May 11, 2026

View reviewed changes

omerakben added a commit that referenced this pull request May 11, 2026

merge(main): integrate PR #26 pi-mono; combine status lines + README …

aa168ca

…rows

omerakben deleted the feat/pi-mono-borrows branch May 30, 2026 03:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(util): allowlisted env reader (B5) + test(integration): cross-family handoff matrix (B4) + 08-pi-mono comparison#26

feat(util): allowlisted env reader (B5) + test(integration): cross-family handoff matrix (B4) + 08-pi-mono comparison#26
omerakben merged 5 commits into
mainfrom
feat/pi-mono-borrows

omerakben commented May 11, 2026

Uh oh!

coderabbitai Bot commented May 11, 2026

Rate limit exceeded

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 11, 2026

Uh oh!

gemini-code-assist Bot May 11, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

omerakben commented May 11, 2026

Summary

Verification

Test plan

Uh oh!

coderabbitai Bot commented May 11, 2026

Rate limit exceeded

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 11, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants