fix(131): address P1 review findings (tool-call, Dockerfile, fly.toml, CI) by andreiships-bot · Pull Request #4 · andreiships/opencode

andreiships-bot · 2026-02-21T14:32:59Z

Summary

Retroactive fixes for P1 findings from post-merge review of PRs #1, #2, #3.

tool-call.ts: Agent.defaultAgent() returns a name string, not Agent.Info — now resolved with Agent.get() before passing to ToolRegistry.tools(); agent.id on a string is undefined, fixed to pass agentName; use agent's configured model instead of hardcoded opencode/default
Dockerfile: Quote $(find ...) + add existence check; remove irrelevant BUN_RUNTIME_TRANSPILER_CACHE_PATH (compiled binary doesn't use Bun's transpiler)
fly.toml: min_machines_running: 0 → 1 to avoid cold starts on interactive sessions
smoke test: Replace hardcoded /tmp/ paths with mktemp tmpdir + EXIT trap for parallel-CI safety
build-push.yml: Gate :latest on stable tags only; add provenance: false; explicit platforms: linux/amd64

Test coverage

The tool-call.ts bugs were type-level, not behavioral:

No tool reads Tool.Context.agent, so agent: undefined causes no runtime failure
ToolRegistry.tools() receiving a string instead of Agent.Info also fails silently at runtime

Existing coverage that protects against regressions:

typecheck.yml — catches the Agent.Info/string type mismatch (would have blocked the original PRs if they had gone through PR flow)
packages/opencode/test/server/tool-call.test.ts — 4 integration tests covering 404, 400, unknown tool, and successful glob execution (added in PR [131-1] feat: add POST /session/:id/tool/call direct tool execution endpoint #1)

No new tests added — the existing test file is sufficient and bun typecheck is the right guard for this category of type-correctness fix.

PR [131-1] feat: add POST /session/:id/tool/call direct tool execution endpoint #1 inline comments: FINDING-1 (Agent.Info type mismatch), FINDING-2 (unused session model)
PR [131-2] feat: Dockerfile (Bun), fly.toml, and integration smoke test #2 inline comments: FINDING-1 (unquoted find), FINDING-2 (BUN env var), FINDING-3 (cold starts), FINDING-4 (tmp collisions)
PR [131-3] feat: CI pipeline — build and push opencode Docker image on tag #3 inline comments: FINDING-1 (:latest on pre-releases), FINDING-2 (provenance), FINDING-3 (platforms)

Test plan

bun typecheck passes (catches the Agent.Info fix)
bun test passes in packages/opencode (existing 4 tests)
Tag a v0.0.1-rc1 → verify :latest is NOT pushed
Tag a v0.0.1 → verify :latest IS pushed

Fixes issues found during retroactive review of PRs #1, #2, #3. tool-call.ts: - Agent.defaultAgent() returns a name string, not Agent.Info — resolve with Agent.get() before passing to ToolRegistry.tools() - Fix agent: agent.id (undefined on string) → agent: agentName - Use agent's configured model instead of hardcoded opencode/default Dockerfile: - Quote $(find ...) and add existence check to prevent cryptic failures when binary is missing - Remove BUN_RUNTIME_TRANSPILER_CACHE_PATH env var — irrelevant for a compiled native binary that does not use Bun's transpiler fly.toml: - min_machines_running: 0 → 1 to avoid cold starts on interactive sessions scripts/ci/test-opencode-integration.sh: - Replace hardcoded /tmp/ paths with mktemp tmpdir + EXIT trap to avoid collisions in parallel CI runs .github/workflows/build-push.yml: - Gate :latest push on stable tags only (no pre-release suffix like -rc1) - Add provenance: false to prevent OCI attestation manifests from breaking fly deploy digest resolution - Explicitly set platforms: linux/amd64 to avoid silent arch mismatches

github-actions · 2026-02-21T14:33:09Z

Thanks for your contribution!

This PR doesn't have a linked issue. All PRs must reference an existing issue.

Please:

Open an issue describing the bug/feature (if one doesn't exist)
Add Fixes #<number> or Closes #<number> to this PR description

See CONTRIBUTING.md for details.

andreiships-bot · 2026-02-21T14:45:51Z

andreiships-bot

Claude Review

See inline comments for details.

All blacksmith-* runner labels replaced: - blacksmith-4vcpu-ubuntu-2404 → ubicloud-standard-2 - blacksmith-8vcpu-ubuntu-2404-arm → ubicloud-standard-8-arm - blacksmith-4vcpu-ubuntu-2404-arm → ubicloud-standard-2-arm - blacksmith-4vcpu-windows-2025 → windows-latest (no Ubicloud Windows runner)

andreiships-bot · 2026-02-21T18:03:33Z

Claude Single-Pass Review

Summary

This PR is a well-scoped retroactive fix addressing P1 findings from post-merge reviews of PRs #1-3, plus a CI runner migration from Blacksmith to Ubicloud. The substantive logic fixes in tool-call.ts, Dockerfile, fly.toml, and the smoke test are all correct and improve reliability. No new issues introduced.

Findings

[FINDING-1] nit: P2 | packages/opencode/src/server/routes/tool-call.ts:491 | Agent.get() returns Info | undefined (lookup by key in a Record), so agentInfo can be undefined. The optional chain agentInfo?.model correctly handles the model fallback, and ToolRegistry.tools(modelCtx, agentInfo) receives undefined when the agent name is not in the registry. If ToolRegistry.tools has a non-null assertion on its second argument this silently degrades. The PR author acknowledged this is currently safe at runtime, but a defensive check if (!agentInfo) throw new NotFoundError(...) would make the failure explicit.

[FINDING-2] nit: P2 | .github/workflows/build-push.yml:50 | When steps.version.outputs.latest_tag is empty (pre-release tag), the tags block becomes a two-line string where the second line is a blank value. Docker Build Push action trims blank tags, so this works, but it's worth noting the implicit dependency on that trimming behavior. An explicit if: condition on the :latest tag push step would be clearer than relying on blank-line filtering in the multi-line string.

[FINDING-3] nit: P2 | .github/workflows/publish.yml:287 | Windows target now uses windows-latest (GitHub-hosted) instead of blacksmith-4vcpu-windows-2025. This means Windows CI builds will accrue GitHub Actions minutes. The commit message acknowledges "no Ubicloud Windows runner" — this is correct and acceptable, just noting the billing implication is not documented in the PR description.

Code Quality

Correctness verified: tool-call.ts type fixes are sound — Agent.defaultAgent() returns string, Agent.get(string) returns Info | undefined, optional chaining handles the undefined case
Dockerfile: set -e + quoted $binary + existence check is a strict improvement over the unquoted $(find ...) in the cp argument
fly.toml min_machines_running: 1 is appropriate for interactive sessions that should not cold-start
Smoke test: mktemp -d + trap 'rm -rf' EXIT pattern is correct for parallel-CI-safe temp isolation; cat /tmp/... | grep → grep file useless-use-of-cat cleanup is a bonus improvement
CI runner migration: all blacksmith-* labels correctly mapped to ubicloud-standard-* equivalents per the project's Ubicloud Runners rule
nit: Agent.get() return type is Info | undefined — passing undefined to ToolRegistry.tools() as the agent argument is currently safe but not explicitly guarded

Recommendation

[x] Approve with changes — the P2 nits above are optional hardening; the core fixes are correct and CI is green. Safe to merge.

andreiships-bot · 2026-02-21T18:03:35Z

Claude Review - Out-of-Diff Findings

The following findings are on lines outside the PR diff:

packages/opencode/src/server/routes/tool-call.ts:491
[FINDING-1] nit: P2 | Agent.get() returns Info | undefined (lookup by key in a Record), so agentInfo can be undefined. The optional chain agentInfo?.model correctly handles the model fallback, and ToolRegistry.tools(modelCtx, agentInfo) receives undefined when the agent name is not in the registry. If ToolRegistry.tools has a non-null assertion on its second argument this silently degrades. The PR author acknowledged this is currently safe at runtime, but a defensive check if (!agentInfo) throw new NotFoundError(...) would make the failure explicit.

.github/workflows/build-push.yml:50
[FINDING-2] nit: P2 | When steps.version.outputs.latest_tag is empty (pre-release tag), the tags block becomes a two-line string where the second line is a blank value. Docker Build Push action trims blank tags, so this works, but it's worth noting the implicit dependency on that trimming behavior. An explicit if: condition on the :latest tag push step would be clearer than relying on blank-line filtering in the multi-line string.

.github/workflows/publish.yml:287
[FINDING-3] nit: P2 | Windows target now uses windows-latest (GitHub-hosted) instead of blacksmith-4vcpu-windows-2025. This means Windows CI builds will accrue GitHub Actions minutes. The commit message acknowledges "no Ubicloud Windows runner" — this is correct and acceptable, just noting the billing implication is not documented in the PR description.

andreiships-bot · 2026-02-21T18:58:55Z

Gemini Deep Review

Summary

The PR successfully addresses several P1 review findings across CI workflows, Docker configuration, and the direct tool-call endpoint. Key improvements include the transition to Ubicloud runners, enhanced Docker binary detection, and fixing a type mismatch in the tool execution route.

Findings

[gemini-1] nit: P2 | .github/workflows/build-push.yml:32 | Bash-ism in CI script
The if [[ "$tag" != *-* ]]; then block uses [[ which is a non-POSIX bash extension. While GitHub Actions defaults to bash on Linux, using [ is more portable and consistent with the project's preference for POSIX shell scripts where possible.
Fix: Change [[ to [ and != to != (or use a portable string comparison). Actually, [[ is fine if bash is guaranteed, but [ is safer. Given the context, this is a minor nit.

[gemini-2] issue: P1 | packages/opencode/src/server/routes/tool-call.ts:64 | Potential undefined agentInfo
The code calls Agent.get(agentName) which could return undefined. While the modelCtx handles this via optional chaining (agentInfo?.model ?? ...), agentInfo is passed as the second argument to ToolRegistry.tools(modelCtx, agentInfo).
Verified via diff: const tools = await ToolRegistry.tools(modelCtx, agentInfo)
Fix: Ensure ToolRegistry.tools gracefully handles an undefined agent, or add a check if a valid agent is strictly required for tool resolution.

[gemini-3] nit: P2 | scripts/ci/test-opencode-integration.sh:78 | Fragile grep for SESSION_ID
The extraction of SESSION_ID uses grep -o '"id":"[^"]*"' ... | cut -d'"' -f4. This assumes a specific JSON format and may be fragile if the response contains other "id" fields or different spacing.
Fix: Use jq -r '.id' for robust JSON parsing, consistent with other parts of the codebase.

PR Metadata

Suggested PR Title: fix(opencode): address P1 findings for tool-call, Docker, and CI
Suggested Description Update: Added specific mention that the tool-call route was fixed to correctly resolve agent identity before fetching tools, addressing a type mismatch.

Questions

Does the POST /session/:id/tool/call endpoint in the opencode server have authentication middleware applied? Since it allows direct tool execution (including shell and filesystem access), it must be protected by the OPENCODE_SERVER_PASSWORD as specified in Spec 132.

Recommendation

[ ] Approve | [X] Approve with changes | [ ] Request changes

Cherry-picked from fork PR #4 (5cbe0a4), app code portion only. - Agent.defaultAgent() returns a name string, resolve with Agent.get() - Use agent's configured model instead of hardcoded opencode/default - Fix agent: agent.id (undefined on string) → agent: agentName

github-actions Bot added the needs:issue label Feb 21, 2026

andreiships-bot commented Feb 21, 2026

View reviewed changes

Comment thread .github/workflows/build-push.yml

andreiships-bot merged commit 5cbe0a4 into dev Feb 23, 2026
7 checks passed

andreiships-bot mentioned this pull request Mar 25, 2026

chore: upgrade fork to upstream v1.3.2 #27

Closed

3 tasks

This was referenced Mar 25, 2026

chore: apply remaining CI customizations + fix test API #28

Merged

chore: add repo guards to upstream-only workflows #29

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(131): address P1 review findings (tool-call, Dockerfile, fly.toml, CI)#4

fix(131): address P1 review findings (tool-call, Dockerfile, fly.toml, CI)#4
andreiships-bot merged 2 commits intodevfrom
fix/131-review-findings

andreiships-bot commented Feb 21, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Feb 21, 2026

Uh oh!

andreiships-bot commented Feb 21, 2026

Uh oh!

andreiships-bot left a comment

Uh oh!

Uh oh!

andreiships-bot commented Feb 21, 2026

Uh oh!

andreiships-bot commented Feb 21, 2026

Uh oh!

andreiships-bot commented Feb 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

andreiships-bot commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test coverage

Related

Test plan

Uh oh!

github-actions Bot commented Feb 21, 2026

Uh oh!

andreiships-bot commented Feb 21, 2026

Claude Single-Pass Review

Summary

Findings

Code Quality

Recommendation

Uh oh!

andreiships-bot left a comment

Choose a reason for hiding this comment

Claude Review

Uh oh!

Uh oh!

andreiships-bot commented Feb 21, 2026

Claude Single-Pass Review

Summary

Findings

Code Quality

Recommendation

Uh oh!

andreiships-bot commented Feb 21, 2026

Claude Review - Out-of-Diff Findings

Uh oh!

andreiships-bot commented Feb 21, 2026

Gemini Deep Review

Summary

Findings

PR Metadata

Questions

Recommendation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

andreiships-bot commented Feb 21, 2026 •

edited

Loading