V.2.7.0 by PeterTheSavage · Pull Request #3 · PeterTheSavage/OpenLayer

PeterTheSavage · 2026-02-14T00:24:34Z

Summary by Sourcery

Refine Ollama native API integration and history limits, centralize auth/provider helpers and CLI routing, update hook logging, and adjust workspace/bootstrap behavior while expanding coverage and tests.

New Features:

Add a dedicated Ollama native streaming implementation using /api/chat with tool-calling support and abort/maxTokens handling.
Introduce a standalone CLI routing module for health/status/sessions/agents/memory commands with lazy imports.
Expose a generic provider-auth helper module reused across auth flows and plugin provider choices.

Bug Fixes:

Apply provider channel/group historyLimit when compacting embedded-runner history to prevent unbounded channel sessions from overflowing context windows.
Stop auto-creating HEARTBEAT.md during workspace bootstrap so heartbeat behavior matches documented semantics.
Ensure models:list resolves forward-compat and registry-backed models before falling back to direct resolution, and correctly detects local providers.
Use native Ollama api type and normalized base URLs instead of OpenAI-compatible defaults, removing the need for streaming workarounds.
Route hook loader diagnostics through the structured logging subsystem instead of console.* calls.

Enhancements:

Extend model API/config schemas with an ollama api type and wire resolved model/provider base URLs through Ollama streaming.
Refine Telegram media batching timings for more responsive media-group and text-fragment processing in tests.
Avoid creating heartbeat path metadata from ensureAgentWorkspace results and tighten bootstrap file initialization.
Tighten embedded-runner stream function selection to plug in Ollama-native streaming while preserving stable SDK streamFn for other providers.
Share isLocalBaseUrl helper across model commands and registry logic.
Adjust browser control server tests and parallel test isolation to cover status/start and tab operations under a single suite.
Modernize ModelRegistry mocks in models.list tests to avoid dependency on actual SDK types and methods.

Documentation:

Update Ollama provider docs to describe native /api/chat usage, revised baseUrl/api values, and legacy OpenAI-compatible configuration tradeoffs, including streaming/tool-calling notes.

Tests:

Replace npm-pack based plugin archive creation in tests with tar-based packing for portability and determinism.
Expand history-limit tests to cover channel/group providers, per-DM overrides, and undefined historyLimit cases.
Add coverage for Ollama native streaming, NDJSON parsing, and message/tool-call conversion behavior.
Add tests for browser control server status/start, tab listing/opening/focus-conflict handling, and adjust parallel test configuration.
Extend hook loader tests to assert graceful handling without depending on console.error.
Add workspace bootstrap tests ensuring HEARTBEAT.md is not created automatically and heartbeatPath is omitted.
Refine Telegram media-group and text-fragment timing tests to match updated flush delays and lower timeouts.
Update model list forward-compat tests to use resolveForwardCompatModel, registry plumbing, and simplified model structures.

@shadril238

… compaction bypass (#11356) thanks @shadril238 Verified: - pnpm build - pnpm check - pnpm test (ran; one unrelated existing failure in models forward-compat test) - pnpm vitest src/agents/pi-embedded-runner.history-limit-from-session-key.test.ts Co-authored-by: shadril238 <63901551+shadril238@users.noreply.github.com> Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

@steipete

…ing (#11853) Merged via /review-pr -> /prepare-pr -> /merge-pr. Prepared head SHA: 0a723f9 Co-authored-by: BrokenFinger98 <115936166+BrokenFinger98@users.noreply.github.com> Co-authored-by: steipete <58493+steipete@users.noreply.github.com> Reviewed-by: @steipete

@shadril238

…loader (#11029) thanks @shadril238 Verified: - pnpm build - pnpm check - pnpm test Co-authored-by: shadril238 <63901551+shadril238@users.noreply.github.com> Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

…shadril238

…shadril238 Verified: - pnpm install --frozen-lockfile - pnpm build - pnpm check - pnpm test Co-authored-by: shadril238 <63901551+shadril238@users.noreply.github.com> Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

sourcery-ai · 2026-02-14T00:25:03Z

Reviewer's Guide

Refactors CLI routing and provider auth helpers into dedicated modules, introduces a native Ollama streaming implementation with config/schema/docs updates, tightens embedded-runner history limits for channel/group sessions, stops auto-creating HEARTBEAT.md on workspace bootstrap, and replaces ad-hoc logging/NDJSON/URL logic with reusable helpers plus expanded tests.

Sequence diagram for Ollama native streaming in embedded runner

sequenceDiagram
  actor User
  participant EmbeddedRunner as EmbeddedRunner_runEmbeddedAttempt
  participant Agent as PiEmbeddedAgent
  participant OllamaStream as createOllamaStreamFn
  participant OllamaAPI as Ollama_api_chat

  User->>EmbeddedRunner: start embedded run with model api=ollama
  EmbeddedRunner->>EmbeddedRunner: resolve model and config
  EmbeddedRunner->>EmbeddedRunner: detect model.api == ollama
  EmbeddedRunner->>OllamaStream: createOllamaStreamFn(ollamaBaseUrl)
  OllamaStream-->>EmbeddedRunner: StreamFn
  EmbeddedRunner->>Agent: set agent.streamFn = StreamFn

  User->>Agent: send messages context
  Agent->>Agent: agent.streamFn(model, context, options)
  Agent->>OllamaAPI: POST /api/chat (stream=true, tools, num_ctx, num_predict)
  OllamaAPI-->>Agent: NDJSON stream chunks
  Agent->>OllamaStream: parseNdjsonStream(reader)
  OllamaStream-->>Agent: OllamaChatResponse chunks
  Agent->>OllamaStream: buildAssistantMessage(finalResponse, modelInfo)
  OllamaStream-->>Agent: AssistantMessage (text, toolCall content)
  Agent-->>User: streamed assistant events

  User->>Agent: cancel request (abort)
  Agent->>OllamaAPI: abort fetch via signal
  OllamaAPI-->>Agent: stream closed

Sequence diagram for CLI routed commands with new routes module

sequenceDiagram
  actor User
  participant CLI as CliEntryPoint
  participant Router as Routes_findRoutedCommand
  participant Route as RouteSpec_run
  participant Command as CommandModule

  User->>CLI: invoke `openclaw status --json`
  CLI->>CLI: parse argv to path [status]
  CLI->>Router: findRoutedCommand(path)
  Router-->>CLI: RouteSpec for status
  CLI->>CLI: prepareRoutedCommand(argv)
  CLI->>Route: route.run(argv)
  Route->>Command: dynamic import statusCommand
  Command-->>Route: statusCommand
  Route->>Command: statusCommand({ flags }, defaultRuntime)
  Command-->>Route: Promise resolved
  Route-->>CLI: true (handled)
  CLI-->>User: print status output and exit

Class diagram for shared provider auth helper module

classDiagram
  class ProviderAuthHelpers {
    <<module>>
    +resolveProviderMatch(providers, rawProvider) ProviderPlugin
    +pickAuthMethod(provider, rawMethod) ProviderAuthMethod
    +mergeConfigPatch(base, patch) T
    +applyDefaultModel(cfg, model) OpenClawConfig
  }

  class ProviderPlugin {
    +id string
    +aliases string[]
    +auth ProviderAuthMethod[]
  }

  class ProviderAuthMethod {
    +id string
    +label string
  }

  class OpenClawConfig {
    +agents any
    +models any
  }

  class AuthCommands {
    <<module>>
    +resolveRequestedLoginProviderOrThrow(providers, rawProvider) ProviderPlugin
    +loginWithProvider()
  }

  class AuthChoicePluginProvider {
    <<module>>
    +applyAuthChoicePluginProvider(params, options)
  }

  ProviderAuthHelpers ..> ProviderPlugin : uses
  ProviderAuthHelpers ..> ProviderAuthMethod : uses
  ProviderAuthHelpers ..> OpenClawConfig : uses

  AuthCommands ..> ProviderAuthHelpers : uses
  AuthChoicePluginProvider ..> ProviderAuthHelpers : uses
  AuthCommands ..> ProviderPlugin : uses
  AuthCommands ..> ProviderAuthMethod : uses
  AuthChoicePluginProvider ..> ProviderPlugin : uses
  AuthChoicePluginProvider ..> ProviderAuthMethod : uses

File-Level Changes

Change	Details	Files
Extract CLI route handling into a dedicated routes module and decouple it from command registration.	Remove inline RouteSpec and routes from the command registry type and entries. Introduce src/cli/program/routes.ts implementing RouteSpec definitions with dynamic imports for health, status, sessions, agents list, and memory status commands. Update CLI routing to use the new routes module and add unit tests while deleting an obsolete browser server test entry from the parallel test list.	`src/cli/program/command-registry.ts` `src/cli/route.ts` `src/cli/program/routes.ts` `src/cli/program/routes.test.ts` `scripts/test-parallel.mjs`
Deduplicate provider auth helper logic and reuse it across auth flows.	Create a shared provider-auth-helpers module that implements provider matching, auth-method selection, deep config patch merging, and default-model application. Replace duplicated implementations in auth login and plugin-provider auth-choice code with imports from the shared helper. Adjust imports accordingly and keep existing behavior intact.	`src/commands/provider-auth-helpers.ts` `src/commands/models/auth.ts` `src/commands/auth-choice.apply.plugin-provider.ts`
Switch Ollama integration to native /api/chat streaming with new model api type and config/schema/docs updates.	Add an ollama model API type to config types and zod schema, and export a shared native base URL constant. Implement a custom Ollama streaming adapter that converts SDK messages to Ollama chat messages, parses NDJSON streaming responses, accumulates content/tool_calls, maps options (num_ctx, num_predict), and exposes a StreamFn factory with tests. Wire the embedded runner to use the Ollama stream function when the resolved model api is ollama (including provider/model baseUrl resolution and abort handling), otherwise falling back to the SDK streamSimple. Update implicit provider construction to default Ollama to api: "ollama" and normalize /v1 base URLs, drop the streaming:false workaround from model discovery, and update e2e/unit tests to match new behavior. Refresh Ollama provider documentation to describe the native API, new baseUrl/api settings, and legacy OpenAI-compatible configuration and streaming caveats.	`src/agents/ollama-stream.ts` `src/agents/ollama-stream.test.ts` `src/agents/pi-embedded-runner/run/attempt.ts` `src/agents/models-config.providers.ts` `src/agents/models-config.providers.ollama.e2e.test.ts` `src/config/types.models.ts` `src/config/zod-schema.core.ts` `docs/providers/ollama.md` `CHANGELOG.md`
Tighten embedded-runner history limits and extend them to channel/group sessions while keeping backward compatibility.	Generalize getDmHistoryLimitFromSessionKey into getHistoryLimitFromSessionKey to support dm/direct, channel, and group kinds, returning per-DM overrides, dmHistoryLimit, or provider historyLimit depending on session type. Expose the new helper through the public embedded-runner export while aliasing the old name for backward compatibility. Expand and update e2e tests to cover channel/group historyLimit behavior across providers, undefined cases, and backward-compat session keys.	`src/agents/pi-embedded-runner/history.ts` `src/agents/pi-embedded-runner.ts` `src/agents/pi-embedded-runner.get-dm-history-limit-from-session-key.returns-undefined-sessionkey-is-undefined.e2e.test.ts` `src/agents/pi-embedded-runner.history-limit-from-session-key.test.ts` `CHANGELOG.md`
Adjust agents workspace bootstrap so HEARTBEAT.md is not auto-created and update tests accordingly.	Stop treating HEARTBEAT.md as a template-driven bootstrap file in ensureAgentWorkspace, so absence of the file preserves heartbeat behavior described in docs. Update path lists and return shape from ensureAgentWorkspace to exclude heartbeatPath and avoid creating the file even in brand-new workspaces. Add an e2e test verifying that workspace initialization with ensureBootstrapFiles does not create HEARTBEAT.md while still creating agents and bootstrap files.	`src/agents/workspace.ts` `src/agents/workspace.e2e.test.ts` `CHANGELOG.md`
Centralize shared utilities and refactor tests/logging for better reuse and determinism.	Move isLocalBaseUrl from the models list command into shared model helpers and reuse it in registry code, updating imports and type annotations. Replace console.* logging in hook loader with a subsystem logger, and update tests to assert behavior without spying on console. Simplify ModelRegistry mocking in tests by removing dependence on the real class signatures. Make plugin install tests use a tar-based packToArchive helper instead of spawning npm, and adapt test callers to async. Share image test data between formats and tune Telegram test timing constants to reduce flakiness, plus reduce screenshot test sizes. Remove a redundant verbose-on auto-reply test case and adjust browser server tests to cover status/start and tabs operations in a single suite.	`src/commands/models/shared.ts` `src/commands/models/list.registry.ts` `src/commands/models/list.list-command.ts` `src/hooks/loader.ts` `src/hooks/loader.test.ts` `src/commands/models/list.list-command.forward-compat.test.ts` `src/commands/models.list.test.ts` `src/plugins/install.e2e.test.ts` `src/web/auto-reply.web-auto-reply.compresses-common-formats-jpeg-cap.e2e.test.ts` `src/telegram/bot.media.downloads-media-file-path-no-file-download.e2e.test.ts` `src/browser/server.skips-default-maxchars-explicitly-set-zero.test.ts` `src/auto-reply/reply.raw-body.test.ts` `src/browser/screenshot.e2e.test.ts` `scripts/test-parallel.mjs`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

gemini-code-assist · 2026-02-14T00:25:12Z

Summary of Changes

Hello @PeterTheSavage, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces significant improvements to the platform's model integration, particularly for Ollama, by switching to its native API for enhanced streaming and tool-calling capabilities. It also refines agent behavior with more granular control over session history limits and optimizes workspace initialization by preventing unnecessary file creation. Developer experience is boosted through CLI routing refactoring, standardized hook logging, and more robust plugin installation. Additionally, several internal tests were updated and improved for better coverage and efficiency.

Highlights

Ollama Native API Integration: Implemented native Ollama API (/api/chat) support, enabling reliable streaming and tool calling without relying on OpenAI-compatible endpoints. This includes updated configuration, documentation, and dedicated streaming logic.
Improved History Limit Logic: Enhanced history compaction for agents and Discord channels/groups by applying historyLimit configurations, preventing context window overflows in long-running sessions. The getDmHistoryLimitFromSessionKey function was refactored and renamed to getHistoryLimitFromSessionKey to support broader session types.
Workspace Bootstrap Refinement: Stopped the automatic creation of HEARTBEAT.md during workspace initialization, aligning with documentation that allows missing heartbeat files to still trigger the heartbeat process.
CLI Command Routing Refactor: Decoupled CLI command routing logic from the main command registry into a dedicated routes.ts module, improving modularity and maintainability.
Hook Loader Logging Update: Replaced direct console.* output with a subsystem logger for hook loading errors and warnings, ensuring consistent and structured logging.
Antigravity Model Forward Compatibility: Added forward compatibility candidates for Antigravity Opus 4.6 models, allowing them to resolve from existing 4.5 templates.
Plugin Installation Reliability: Switched the plugin archive creation mechanism from npm pack to tar in tests, enhancing the reliability and consistency of plugin installation.
Image Processing Test Improvements: Refactored browser screenshot and web auto-reply image tests to use sharp for image generation and adjusted dimensions, leading to more efficient and robust testing.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

CHANGELOG.md
- Agents/Heartbeat: stopped auto-creating HEARTBEAT.md during workspace bootstrap so missing files continue to run heartbeat as documented. ([Bug]: Missing HEARTBEAT.md behaviour doesn't match documentation openclaw/openclaw#11766) Thanks @shadril238.
- Discord/Agents: applied channel/group historyLimit during embedded-runner history compaction to prevent long-running channel sessions from bypassing truncation and overflowing context windows. (Discord channel sessions bypass compaction, causing context overflow openclaw/openclaw#11224) Thanks @shadril238.
- Ollama/Agents: used resolved model/provider base URLs for native /api/chat streaming (including aliased providers), normalized /v1 endpoints, and forwarded abort + maxTokens stream options for reliable cancellation and token caps. (feat(ollama): add native /api/chat provider for streaming + tool calling openclaw/openclaw#11853) Thanks @BrokenFinger98.
- Hooks: replaced loader console.* output with subsystem logger messages so hook loading errors/warnings route through standard logging. (fix(hooks): replace console logging with proper subsystem logging in loader openclaw/openclaw#11029) Thanks @shadril238.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

sourcery-ai

Hey - I've found 3 issues

Prompt for AI Agents

Please address the comments from this code review:

## Individual Comments

### Comment 1
<location> `src/agents/ollama-stream.ts:326-329` </location>
<code_context>
+          headers.Authorization = `Bearer ${options.apiKey}`;
+        }
+
+        const response = await fetch(chatUrl, {
+          method: "POST",
+          headers,
+          body: JSON.stringify(body),
+          signal: options?.signal,
+        });
</code_context>

<issue_to_address>
**suggestion (bug_risk):** Only a single `done` event is emitted; there is no token-by-token streaming despite using a streaming HTTP API.

Right now the NDJSON stream is fully buffered and only a final `{ type: "done" }` event is emitted, so callers see this as a non-streaming completion. If consumers (e.g. UIs) rely on incremental `type: "update"` events with partial content, this will break parity with other streaming providers. Either emit intermediate events as chunks arrive (keeping `done` for the final accumulated message), or clearly document/enforce that Ollama integrations only provide final messages so callers don’t expect token-level streaming.

Suggested implementation:

```typescript
      // For Ollama we currently only expose a *final* assistant message.
      // Emit a last `update` event with the accumulated content so that
      // consumers relying on `type: "update"` continue to work, even
      // though Ollama does not provide token-by-token updates here.
      if (finalAssistantMessage && finalAssistantMessage.content?.length) {
        controller.enqueue({
          type: "update",
          id: finalAssistantMessage.id,
          role: finalAssistantMessage.role ?? "assistant",
          delta: {
            content: finalAssistantMessage.content,
          },
          // Mark this as the final update in the stream for parity with
          // other providers that send multiple incremental updates.
          isFinal: true,
        });
      }

      // Finalize the stream.
      controller.enqueue({ type: "done" });
      controller.close();

```

I only see a small portion of the file, so you will need to align the above with your actual implementation:

1. Ensure you have a variable (here referenced as `finalAssistantMessage`) that contains the final, fully-accumulated assistant message just before the stream is closed.  
   - If you currently only have a plain string (e.g. `fullText`), then construct the `update` payload accordingly:
   ```ts
   if (fullText) {
     controller.enqueue({
       type: "update",
       role: "assistant",
       delta: { content: fullText },
       isFinal: true,
     });
   }
   ```
   and drop the `id` field if you don't use it elsewhere.

2. If your event shape for other providers differs (different field names for `delta`, `role`, etc.), match the structure you use in the rest of your streaming layer so callers see a consistent `type: "update"` event.

3. Update any relevant documentation / JSDoc for `ollamaStream` (or the exported function in this file) to explicitly state that:
   - Ollama currently produces a *single* `type: "update"` event that contains the full, final message.
   - There is no token-level / chunk-level streaming, even though the HTTP API is streaming.
   This satisfies the “clearly document/enforce that Ollama integrations only provide final messages” part of your review comment.

4. If you want *true* incremental updates in the future, you can refactor the NDJSON parsing loop so each incoming chunk updates the accumulated content and emits an `update` event before the final one; the `isFinal` flag can then be used to signal the last update instead of emitting only once at the end.
</issue_to_address>

### Comment 2
<location> `src/commands/models/list.list-command.ts:101-115` </location>
<code_context>
         continue;
       }
-      const model = modelByKey.get(entry.key);
+      let model = modelByKey.get(entry.key);
+      if (!model && modelRegistry) {
+        const forwardCompat = resolveForwardCompatModel(
+          entry.ref.provider,
+          entry.ref.model,
+          modelRegistry,
+        );
+        if (forwardCompat) {
+          model = forwardCompat;
+          modelByKey.set(entry.key, forwardCompat);
+        }
+      }
+      if (!model) {
+        model = resolveModel(entry.ref.provider, entry.ref.model, undefined, cfg).model;
+      }
</code_context>

<issue_to_address>
**suggestion (performance):** Consider caching the `resolveModel` fallback into `modelByKey` as well to avoid repeated lookups.

Currently, when `modelByKey` and `resolveForwardCompatModel` both fail, you call `resolveModel(...)` but don’t cache the result. If multiple entries share the same key, this leads to repeated `resolveModel` calls. Mirroring the forward-compat caching here would avoid redundant work and keep behavior consistent:
```ts
if (!model) {
  const resolved = resolveModel(entry.ref.provider, entry.ref.model, undefined, cfg);
  model = resolved.model;
  if (model) {
    modelByKey.set(entry.key, model);
  }
}
```

```suggestion
      let model = modelByKey.get(entry.key);
      if (!model && modelRegistry) {
        const forwardCompat = resolveForwardCompatModel(
          entry.ref.provider,
          entry.ref.model,
          modelRegistry,
        );
        if (forwardCompat) {
          model = forwardCompat;
          modelByKey.set(entry.key, forwardCompat);
        }
      }
      if (!model) {
        const resolved = resolveModel(entry.ref.provider, entry.ref.model, undefined, cfg);
        model = resolved.model;
        if (model) {
          modelByKey.set(entry.key, model);
        }
      }
```
</issue_to_address>

### Comment 3
<location> `src/plugins/install.e2e.test.ts:23-26` </location>
<code_context>
-}
-
-function packToArchive({
+async function packToArchive({
   pkgDir,
   outDir,
</code_context>

<issue_to_address>
**question (bug_risk):** Switching from `npm pack` to a raw tarball changes packaging semantics; verify this still matches `installPluginFromArchive` expectations.

The old `npm pack` call applied npm’s packaging rules (`files` in `package.json`, `.npmignore`/`.gitignore`, pruning dev files). The new approach archives the directory verbatim, which may change which files are included. Please confirm that `installPluginFromArchive` only relies on a root `package.json` in a single top-level folder, or otherwise verify that the new tarball layout and contents still match its expectations.
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2026-02-14T00:26:47Z

+        const response = await fetch(chatUrl, {
+          method: "POST",
+          headers,
+          body: JSON.stringify(body),


suggestion (bug_risk): Only a single done event is emitted; there is no token-by-token streaming despite using a streaming HTTP API.

Right now the NDJSON stream is fully buffered and only a final { type: "done" } event is emitted, so callers see this as a non-streaming completion. If consumers (e.g. UIs) rely on incremental type: "update" events with partial content, this will break parity with other streaming providers. Either emit intermediate events as chunks arrive (keeping done for the final accumulated message), or clearly document/enforce that Ollama integrations only provide final messages so callers don’t expect token-level streaming.

Suggested implementation:

// For Ollama we currently only expose a *final* assistant message. // Emit a last `update` event with the accumulated content so that // consumers relying on `type: "update"` continue to work, even // though Ollama does not provide token-by-token updates here. if (finalAssistantMessage && finalAssistantMessage.content?.length) { controller.enqueue({ type: "update", id: finalAssistantMessage.id, role: finalAssistantMessage.role ?? "assistant", delta: { content: finalAssistantMessage.content, }, // Mark this as the final update in the stream for parity with // other providers that send multiple incremental updates. isFinal: true, }); } // Finalize the stream. controller.enqueue({ type: "done" }); controller.close();

I only see a small portion of the file, so you will need to align the above with your actual implementation:

Ensure you have a variable (here referenced as finalAssistantMessage) that contains the final, fully-accumulated assistant message just before the stream is closed.

If you currently only have a plain string (e.g. fullText), then construct the update payload accordingly:

if (fullText) { controller.enqueue({ type: "update", role: "assistant", delta: { content: fullText }, isFinal: true, }); }

and drop the id field if you don't use it elsewhere.

If your event shape for other providers differs (different field names for delta, role, etc.), match the structure you use in the rest of your streaming layer so callers see a consistent type: "update" event.

Update any relevant documentation / JSDoc for ollamaStream (or the exported function in this file) to explicitly state that:

Ollama currently produces a single type: "update" event that contains the full, final message.

There is no token-level / chunk-level streaming, even though the HTTP API is streaming.
This satisfies the “clearly document/enforce that Ollama integrations only provide final messages” part of your review comment.

If you want true incremental updates in the future, you can refactor the NDJSON parsing loop so each incoming chunk updates the accumulated content and emits an update event before the final one; the isFinal flag can then be used to signal the last update instead of emitting only once at the end.

sourcery-ai · 2026-02-14T00:26:48Z

+      let model = modelByKey.get(entry.key);
+      if (!model && modelRegistry) {
+        const forwardCompat = resolveForwardCompatModel(
+          entry.ref.provider,
+          entry.ref.model,
+          modelRegistry,
+        );
+        if (forwardCompat) {
+          model = forwardCompat;
+          modelByKey.set(entry.key, forwardCompat);
+        }
+      }
+      if (!model) {
+        model = resolveModel(entry.ref.provider, entry.ref.model, undefined, cfg).model;
+      }


suggestion (performance): Consider caching the resolveModel fallback into modelByKey as well to avoid repeated lookups.

Currently, when modelByKey and resolveForwardCompatModel both fail, you call resolveModel(...) but don’t cache the result. If multiple entries share the same key, this leads to repeated resolveModel calls. Mirroring the forward-compat caching here would avoid redundant work and keep behavior consistent:

if (!model) { const resolved = resolveModel(entry.ref.provider, entry.ref.model, undefined, cfg); model = resolved.model; if (model) { modelByKey.set(entry.key, model); } }

Suggested change

let model = modelByKey.get(entry.key);

if (!model && modelRegistry) {

const forwardCompat = resolveForwardCompatModel(

entry.ref.provider,

entry.ref.model,

modelRegistry,

);

if (forwardCompat) {

model = forwardCompat;

modelByKey.set(entry.key, forwardCompat);

}

}

if (!model) {

model = resolveModel(entry.ref.provider, entry.ref.model, undefined, cfg).model;

}

let model = modelByKey.get(entry.key);

if (!model && modelRegistry) {

const forwardCompat = resolveForwardCompatModel(

entry.ref.provider,

entry.ref.model,

modelRegistry,

);

if (forwardCompat) {

model = forwardCompat;

modelByKey.set(entry.key, forwardCompat);

}

}

if (!model) {

const resolved = resolveModel(entry.ref.provider, entry.ref.model, undefined, cfg);

model = resolved.model;

if (model) {

modelByKey.set(entry.key, model);

}

}

sourcery-ai · 2026-02-14T00:26:48Z

+async function packToArchive({
  pkgDir,
  outDir,
  outName,


question (bug_risk): Switching from npm pack to a raw tarball changes packaging semantics; verify this still matches installPluginFromArchive expectations.

The old npm pack call applied npm’s packaging rules (files in package.json, .npmignore/.gitignore, pruning dev files). The new approach archives the directory verbatim, which may change which files are included. Please confirm that installPluginFromArchive only relies on a root package.json in a single top-level folder, or otherwise verify that the new tarball layout and contents still match its expectations.

…coverage

Copilot

Pull request overview

This PR is a version release (v2.7.0) containing multiple bug fixes, feature enhancements, and test optimizations across the codebase.

Changes:

Added native Ollama API integration with /api/chat endpoint support for reliable streaming and tool calling
Implemented channel/group history limit support to prevent context overflow in long-running sessions
Fixed workspace bootstrap to stop auto-creating HEARTBEAT.md file as documented
Refactored test infrastructure for better performance and reliability
Added subsystem logging to hooks loader
Consolidated CLI routing code and extracted shared provider auth helpers

Reviewed changes

Copilot reviewed 47 out of 47 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
src/agents/ollama-stream.ts	New native Ollama /api/chat streaming implementation with NDJSON parsing and tool calling support
src/agents/ollama-stream.test.ts	Comprehensive test coverage for Ollama streaming functions
src/agents/pi-embedded-runner/run/attempt.ts	Integrate Ollama native streaming when api type is 'ollama'
src/agents/models-config.providers.ts	Switch Ollama provider to use native API, remove streaming:false workaround
src/agents/models-config.providers.ollama.e2e.test.ts	Update tests for native Ollama API integration
src/agents/pi-embedded-runner/history.ts	Add channel/group historyLimit support, keep backward compat for dm/direct
src/agents/pi-embedded-runner.ts	Export getHistoryLimitFromSessionKey, keep deprecated alias
src/agents/pi-embedded-runner.history-limit-from-session-key.test.ts	Test coverage for history limit with channel/group sessions
src/agents/pi-embedded-runner.get-dm-history-limit-from-session-key.returns-undefined-sessionkey-is-undefined.e2e.test.ts	Extended tests for all session types
src/agents/workspace.ts	Remove HEARTBEAT.md auto-creation during bootstrap
src/agents/workspace.e2e.test.ts	Verify HEARTBEAT.md is not created
src/agents/model-forward-compat.ts	Export forward-compat candidates for reuse
src/hooks/loader.ts	Replace console.* with subsystem logger
src/hooks/loader.test.ts	Remove console spy tests
src/config/zod-schema.core.ts	Add 'ollama' to ModelApiSchema
src/config/types.models.ts	Add 'ollama' to ModelApi type
src/commands/provider-auth-helpers.ts	Extract shared auth helper functions
src/commands/models/auth.ts	Use extracted auth helpers
src/commands/auth-choice.apply.plugin-provider.ts	Use extracted auth helpers
src/commands/models/shared.ts	Add isLocalBaseUrl utility function
src/commands/models/list.registry.ts	Use isLocalBaseUrl from shared, extract forward-compat candidates
src/commands/models/list.list-command.ts	Add forward-compat + resolveModel fallback for missing models
src/commands/models/list.list-command.forward-compat.test.ts	Update mock for resolveForwardCompatModel
src/commands/models.list.test.ts	Simplify MockModelRegistry without SDK type dependencies
src/commands/models.list.e2e.test.ts	Deleted (784 lines removed)
src/cli/route.ts	Import findRoutedCommand from routes.ts
src/cli/program/routes.ts	New file extracting route definitions from command-registry
src/cli/program/routes.test.ts	Tests for routes module
src/cli/program/command-registry.ts	Remove route definitions and findRoutedCommand
src/plugins/install.e2e.test.ts	Replace npm pack with tar library for test reliability
src/web/auto-reply.web-auto-reply.compresses-common-formats-jpeg-cap.e2e.test.ts	Share randomBytes buffer across iterations
src/telegram/bot.media.downloads-media-file-path-no-file-download.e2e.test.ts	Reduce test timing constants for faster execution
src/browser/screenshot.e2e.test.ts	Use sharp.create instead of crypto.randomBytes
src/browser/server.skips-default-maxchars-explicitly-set-zero.test.ts	Merge tests from deleted file
src/browser/server.serves-status-starts-browser-requested.test.ts	Deleted (merged into skips-default-maxchars file)
src/auto-reply/reply.raw-body.test.ts	Remove duplicate test case
scripts/test-parallel.mjs	Remove deleted test file from isolation list
docs/providers/ollama.md	Update docs for native API, remove streaming workaround section
CHANGELOG.md	Document all fixes and changes

gemini-code-assist

Code Review

This pull request updates the version to v.2.7.0, bringing a range of fixes, refactorings, and documentation updates. It includes a significant refactoring of the Ollama integration to use its native API, introduces lazy loading of CLI commands for improved performance, and offers more granular history limit controls. The codebase also sees improvements through refactored shared helper functions and enhanced logging consistency. However, a critical vulnerability exists in the new hook loading mechanism within src/hooks/loader.ts. This logic allows the execution of arbitrary JavaScript files from the agent's workspace via dynamic import(), which can be exploited through prompt injection to achieve Remote Code Execution (RCE) and sandbox escape. It is strongly recommended to disable loading executable code from the workspace or implement a robust code signing and verification mechanism to address this security risk.

gemini-code-assist · 2026-02-14T00:28:45Z

+      try {
+        yield JSON.parse(trimmed) as OllamaChatResponse;
+      } catch {
+        console.warn("[ollama-stream] Skipping malformed NDJSON line:", trimmed.slice(0, 120));


For consistency with other logging changes in this PR (e.g., in src/hooks/loader.ts), please use the subsystem logger here instead of console.warn. This ensures all logs are routed through the standard logging system.

You'll need to add the following at the top of the file:

import { createSubsystemLogger } from "../logging/subsystem.js"; const log = createSubsystemLogger("ollama-stream");

Suggested change

console.warn("[ollama-stream] Skipping malformed NDJSON line:", trimmed.slice(0, 120));

log.warn(`Skipping malformed NDJSON line: ${trimmed.slice(0, 120)}`);

gemini-code-assist · 2026-02-14T00:28:46Z

+      console.warn(
+        "[ollama-stream] Skipping malformed trailing data:",
+        buffer.trim().slice(0, 120),
+      );


As with the previous comment, please use the subsystem logger (log.warn) for consistency.

log.warn(`Skipping malformed trailing data: ${buffer.trim().slice(0, 120)}`);

@steipete

#15847) Merged via /review-pr -> /prepare-pr -> /merge-pr. Prepared head SHA: dc03ce5 Co-authored-by: Spacefish <375633+Spacefish@users.noreply.github.com> Co-authored-by: steipete <58493+steipete@users.noreply.github.com> Reviewed-by: @steipete

…arosstale

…arosstale Verified: - pnpm build - pnpm check - pnpm test Co-authored-by: arosstale <117890364+arosstale@users.noreply.github.com> Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

@arosstale

… thanks @arosstale Verified: - pnpm build - pnpm check - pnpm test Co-authored-by: arosstale <117890364+arosstale@users.noreply.github.com> Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

@steipete

… failure state (#15848) Merged via /review-pr -> /prepare-pr -> /merge-pr. Prepared head SHA: 188a40e Co-authored-by: AI-Reviewer-QS <255312808+AI-Reviewer-QS@users.noreply.github.com> Co-authored-by: steipete <58493+steipete@users.noreply.github.com> Reviewed-by: @steipete

@ikari-pl

…11063) thanks @ikari-pl Verified: - pnpm build - pnpm check - pnpm test (full run; transient lobster timeout rerun passed) Co-authored-by: ikari-pl <811702+ikari-pl@users.noreply.github.com> Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

@battman21

* fix: enforce Telegram 100-command limit with warning (#5787) Telegram's setMyCommands API rejects requests with more than 100 commands. When skills + custom + plugin commands exceed the limit, truncate to 100 and warn the user instead of silently failing on every startup. * fix: enforce Telegram menu cap + keep hidden commands callable (#15844) (thanks @battman21) --------- Co-authored-by: Peter Steinberger <steipete@gmail.com>

…isk (#11560) * fix: preserve ${VAR} env var references when writing config back to disk Fixes #11466 When config is loaded, ${VAR} references are resolved to their plaintext values. Previously, writeConfigFile would serialize the resolved values, silently replacing "${ANTHROPIC_API_KEY}" with "sk-ant-api03-..." in the config file. Now writeConfigFile reads the current file pre-substitution, and for each value that matches what a ${VAR} reference would resolve to, restores the original reference. Values the caller intentionally changed are kept as-is. This fixes all 50+ writeConfigFile call sites (doctor, configure wizard, gateway config.set/apply/patch, plugins, hooks, etc.) without requiring any caller changes. New files: - src/config/env-preserve.ts — restoreEnvVarRefs() utility - src/config/env-preserve.test.ts — 11 unit tests * fix: remove global config env snapshot race * docs(changelog): note config env snapshot race fix --------- Co-authored-by: Peter Steinberger <steipete@gmail.com>

@arosstale

…12364) thanks @arosstale Verified: - pnpm build - pnpm check - pnpm test Co-authored-by: arosstale <117890364+arosstale@users.noreply.github.com> Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

@arosstale

#14532) thanks @arosstale Verified: - pnpm build - pnpm check - pnpm test Co-authored-by: arosstale <117890364+arosstale@users.noreply.github.com> Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

steipete and others added 14 commits February 14, 2026 01:07

refactor(models): share auth helpers and forward-compat list fallbacks

cf2524b

perf(test): trim models/browser suite overhead

fecb3f3

perf(cli): slim route-first bootstrap with lazy route handlers

2f49d88

fix(cli): avoid runtime import cycle in routed commands

bc3eb98

perf(test): avoid npm pack in plugin install e2e fixtures

93dc3bb

perf(test): speed up screenshot normalization e2e fixture

212da86

perf(test): lighten models list e2e registry mock

ec4da3a

perf(test): remove duplicate models list e2e suite

05524bb

perf(test): reduce repeated image work in web auto-reply e2e

784e7c1

perf(test): tighten telegram media e2e flush windows

0a72412

Copilot AI review requested due to automatic review settings February 14, 2026 00:24

Copilot started reviewing on behalf of PeterTheSavage February 14, 2026 00:25 View session

sourcery-ai Bot reviewed Feb 14, 2026

View reviewed changes

steipete added 2 commits February 14, 2026 00:27

perf(cli): speed up help/config paths and route config get/unset

4d14610

perf(subagents): speed announce retry polling and trim duplicate e2e …

6daa491

…coverage

Copilot AI reviewed Feb 14, 2026

View reviewed changes

gemini-code-assist Bot reviewed Feb 14, 2026

View reviewed changes

Spacefish and others added 7 commits February 14, 2026 01:36

fix(media): classify text/* MIME types as documents (#12341) thanks @…

fdacfc5

…arosstale Verified: - pnpm build - pnpm check - pnpm test Co-authored-by: arosstale <117890364+arosstale@users.noreply.github.com> Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

perf(test): trim browser and models suite overhead

e7c3c27

perf(test): merge queue integration coverage and shrink media fixture

aa6d8b2

battman21 and others added 10 commits February 14, 2026 01:51

perf(test): reduce vitest logging overhead and media fixture cost

e837779

perf(test): consolidate browser profile CRUD checks

2e84ae7

refactor(telegram): extract native command menu helpers

cc2249a

refactor(config): simplify env snapshot write context

e18f94a

perf(test): consolidate browser and canvas hotspot suites

8796bfa

perf(test): collapse docker setup sandbox churn

59d2d89

PeterTheSavage merged commit 09388f8 into PeterTheSavage:DEV Feb 14, 2026
1 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V.2.7.0#3

V.2.7.0#3
PeterTheSavage merged 33 commits into
PeterTheSavage:DEVfrom
openclaw:main

PeterTheSavage commented Feb 14, 2026 •

edited by sourcery-ai Bot

Loading

Uh oh!

sourcery-ai Bot commented Feb 14, 2026 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

gemini-code-assist Bot commented Feb 14, 2026

Uh oh!

sourcery-ai Bot left a comment

Uh oh!

sourcery-ai Bot Feb 14, 2026

Uh oh!

sourcery-ai Bot Feb 14, 2026

Uh oh!

sourcery-ai Bot Feb 14, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Feb 14, 2026

Uh oh!

gemini-code-assist Bot Feb 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

	console.warn("[ollama-stream] Skipping malformed NDJSON line:", trimmed.slice(0, 120));
	log.warn(`Skipping malformed NDJSON line: ${trimmed.slice(0, 120)}`);

Conversation

PeterTheSavage commented Feb 14, 2026 • edited by sourcery-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by Sourcery

Uh oh!

sourcery-ai Bot commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

Sequence diagram for Ollama native streaming in embedded runner

Sequence diagram for CLI routed commands with new routes module

Class diagram for shared provider auth helper module

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

gemini-code-assist Bot commented Feb 14, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

PeterTheSavage commented Feb 14, 2026 •

edited by sourcery-ai Bot

Loading

sourcery-ai Bot commented Feb 14, 2026 •

edited

Loading