Ollama: cache model show metadata by ImLukeF · Pull Request #64753 · openclaw/openclaw

ImLukeF · 2026-04-11T10:27:33Z

Summary

cache Ollama /api/show enrichment results for discovery/model-picker flows when a model digest or modified timestamp is stable
reuse cached context-window and capability metadata across repeated enrich calls without changing model semantics
add targeted tests covering cache reuse and cache invalidation on digest changes

Testing

pnpm test extensions/ollama/src/provider-models.test.ts extensions/ollama/src/provider-models.ssrf.test.ts

greptile-apps · 2026-04-11T10:42:20Z

Greptile Summary

This PR adds a module-level Map cache for Ollama /api/show enrichment results, keyed on apiBase|modelName|digest (or modified_at as a fallback), with a 256-entry FIFO eviction cap. Cache hits skip the network call entirely; the cache is cleared between tests via an exported resetOllamaModelShowInfoCacheForTest helper.

Failed fetches are permanently cached. queryOllamaModelShowInfo never rejects — it swallows every error and returns {}. The empty-resolved promise is stored in the cache immediately, so if Ollama is temporarily down during the first enrichment call, that {} entry is served from cache on every subsequent discovery pass for the same model digest. Users end up with persistently missing context-window and capability metadata until the digest changes.

Confidence Score: 4/5

Safe to merge after addressing the failed-fetch caching issue; the P1 defect can lead to permanently missing model metadata for a session.

The cache hit/miss logic and eviction are structurally sound, and the tests are well-structured. The one P1 finding — that transient fetch failures are cached as permanent empty results for the lifetime of the process (until the model digest changes) — is a real defect that could silently degrade the model-picker UX.

extensions/ollama/src/provider-models.ts — specifically queryOllamaModelShowInfoCached (lines 145–162) and setOllamaModelShowCacheEntry (lines 78–88).

Prompt To Fix All With AI

This is a comment left during a code review.
Path: extensions/ollama/src/provider-models.ts
Line: 78-88

Comment:
**Unreachable `has(key)` branch**

`queryOllamaModelShowInfoCached` only calls `setOllamaModelShowCacheEntry` after confirming the key is absent (`if (cached) { return … }` covers the present-key path). The `ollamaModelShowInfoCache.has(key)` branch in this setter can therefore never be `true` in practice — it is dead code that will silently mislead future readers who assume it handles a "refresh LRU position" case.

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: extensions/ollama/src/provider-models.ts
Line: 145-162

Comment:
**Failed fetches permanently cached as empty results**

`queryOllamaModelShowInfo` always resolves — it catches every error and non-OK response and returns `{}`. That settled-to-empty promise is stored in the cache before `await pending` is called. On any subsequent call for the same `name+digest` the cached empty promise is served immediately, so a model whose enrichment failed (Ollama timeout, restart, 500) will never be retried until its digest changes. Users end up with permanently missing context-window and capability metadata for that session.

To fix this, remove the cache entry when the resolved value is empty (e.g., in a `.then()` on `pending` that calls `ollamaModelShowInfoCache.delete(cacheKey)` when both `contextWindow` and `capabilities` are absent), so the next discovery pass can attempt a fresh fetch.

How can I resolve this? If you propose a fix, please make it concise.

_{Reviews (1): Last reviewed commit: "Ollama: cache model show metadata" | Re-trigger Greptile}

greptile-apps · 2026-04-11T10:42:24Z

extensions/ollama/src/provider-models.ts

+function setOllamaModelShowCacheEntry(key: string, value: Promise<OllamaModelShowInfo>): void {
+  if (ollamaModelShowInfoCache.has(key)) {
+    ollamaModelShowInfoCache.delete(key);
+  } else if (ollamaModelShowInfoCache.size >= MAX_OLLAMA_SHOW_CACHE_ENTRIES) {
+    const oldestKey = ollamaModelShowInfoCache.keys().next().value;
+    if (typeof oldestKey === "string") {
+      ollamaModelShowInfoCache.delete(oldestKey);
+    }
+  }
+  ollamaModelShowInfoCache.set(key, value);
+}


Unreachable has(key) branch

queryOllamaModelShowInfoCached only calls setOllamaModelShowCacheEntry after confirming the key is absent (if (cached) { return … } covers the present-key path). The ollamaModelShowInfoCache.has(key) branch in this setter can therefore never be true in practice — it is dead code that will silently mislead future readers who assume it handles a "refresh LRU position" case.

Prompt To Fix With AI

This is a comment left during a code review. Path: extensions/ollama/src/provider-models.ts Line: 78-88 Comment: **Unreachable `has(key)` branch** `queryOllamaModelShowInfoCached` only calls `setOllamaModelShowCacheEntry` after confirming the key is absent (`if (cached) { return … }` covers the present-key path). The `ollamaModelShowInfoCache.has(key)` branch in this setter can therefore never be `true` in practice — it is dead code that will silently mislead future readers who assume it handles a "refresh LRU position" case. How can I resolve this? If you propose a fix, please make it concise.

Cleaned up in b2380c0c99 while fixing the empty-result caching path. The setter now only handles bounded insertion/eviction, without the dead has(key) branch.

greptile-apps · 2026-04-11T10:42:25Z

extensions/ollama/src/provider-models.ts

+async function queryOllamaModelShowInfoCached(
+  apiBase: string,
+  model: Pick<OllamaTagModel, "name" | "digest" | "modified_at">,
+): Promise<OllamaModelShowInfo> {
+  const cacheKey = buildOllamaModelShowCacheKey(apiBase, model);
+  if (!cacheKey) {
+    return await queryOllamaModelShowInfo(apiBase, model.name);
+  }
+
+  const cached = ollamaModelShowInfoCache.get(cacheKey);
+  if (cached) {
+    return await cached;
+  }
+
+  const pending = queryOllamaModelShowInfo(apiBase, model.name);
+  setOllamaModelShowCacheEntry(cacheKey, pending);
+  return await pending;
+}


Failed fetches permanently cached as empty results

queryOllamaModelShowInfo always resolves — it catches every error and non-OK response and returns {}. That settled-to-empty promise is stored in the cache before await pending is called. On any subsequent call for the same name+digest the cached empty promise is served immediately, so a model whose enrichment failed (Ollama timeout, restart, 500) will never be retried until its digest changes. Users end up with permanently missing context-window and capability metadata for that session.

To fix this, remove the cache entry when the resolved value is empty (e.g., in a .then() on pending that calls ollamaModelShowInfoCache.delete(cacheKey) when both contextWindow and capabilities are absent), so the next discovery pass can attempt a fresh fetch.

Prompt To Fix With AI

This is a comment left during a code review. Path: extensions/ollama/src/provider-models.ts Line: 145-162 Comment: **Failed fetches permanently cached as empty results** `queryOllamaModelShowInfo` always resolves — it catches every error and non-OK response and returns `{}`. That settled-to-empty promise is stored in the cache before `await pending` is called. On any subsequent call for the same `name+digest` the cached empty promise is served immediately, so a model whose enrichment failed (Ollama timeout, restart, 500) will never be retried until its digest changes. Users end up with permanently missing context-window and capability metadata for that session. To fix this, remove the cache entry when the resolved value is empty (e.g., in a `.then()` on `pending` that calls `ollamaModelShowInfoCache.delete(cacheKey)` when both `contextWindow` and `capabilities` are absent), so the next discovery pass can attempt a fresh fetch. How can I resolve this? If you propose a fix, please make it concise.

Addressed in b2380c0c99. The cached lookup now evicts empty {} results so transient /api/show failures are retried on the next discovery pass instead of sticking for the session.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 23c10dfd05

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-04-11T10:42:27Z

extensions/ollama/src/provider-models.ts

+  const pending = queryOllamaModelShowInfo(apiBase, model.name);
+  setOllamaModelShowCacheEntry(cacheKey, pending);


Do not cache failed /api/show lookups

This stores the in-flight /api/show result in cache before knowing whether it succeeded, so transient failures (timeouts/non-OK responses that currently resolve to {} in queryOllamaModelShowInfo) get memoized under a stable digest/modified timestamp key. In that case, later discovery/model-picker runs for the same model never retry and keep missing contextWindow/capabilities (for example, losing vision) until process restart or digest change. Cache insertion should be conditional on a successful metadata fetch, or failed/empty results should evict the key.

Useful? React with 👍 / 👎.

Copilot

Pull request overview

Adds an in-memory cache for Ollama /api/show enrichment so repeated model discovery/model-picker flows can reuse context-window/capability metadata when a model’s digest/modified timestamp is stable.

Changes:

Introduce a bounded (256 entry) cache for /api/show results keyed by base URL + model + digest/modified timestamp.
Route enrichOllamaModelsWithContext() through the cached lookup path.
Add tests asserting cache reuse on stable digest and invalidation when digest changes.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
extensions/ollama/src/provider-models.ts	Adds `/api/show` cache + cached query helper and wires it into enrichment.
extensions/ollama/src/provider-models.test.ts	Adds tests covering cache reuse and invalidation; clears cache between tests.

Copilot · 2026-04-11T10:43:02Z

extensions/ollama/src/provider-models.ts

+  const pending = queryOllamaModelShowInfo(apiBase, model.name);
+  setOllamaModelShowCacheEntry(cacheKey, pending);
+  return await pending;


The cache key is built from a normalized base URL (via resolveOllamaApiBase), but the actual request is still made with the original apiBase argument. If callers ever pass a base URL with /v1 (or trailing slashes) in one call and a normalized base in another, this can cause cache poisoning (e.g., caching {} from a 404 at /v1/api/show) and then reusing it for the correct base. Normalize apiBase once in this function (or inside queryOllamaModelShowInfo) and use the same normalized value both for the cache key and the fetch URL.

Addressed in b2380c0c99. queryOllamaModelShowInfo() now normalizes the base URL up front and uses that same normalized value for both the fetch path and the SSRF policy, so the cache key and request target stay aligned.

ImLukeF · 2026-04-11T12:30:30Z

Merged via squash.

Prepared head SHA: de56dfb91657019b209a37642ea2fb0f26158a24
Merge commit: 3da58226bfe6917ebc2010e79fe561fb6f1917ca

aisle-research-bot · 2026-04-11T12:35:52Z

🔒 Aisle Security Analysis

We found 2 potential security issue(s) in this PR:

#	Severity	Title
1	🟡 Medium	Cache key collision allows /api/show metadata cache poisoning in Ollama provider
2	🔵 Low	Unbounded cache key size from remote Ollama model metadata can cause memory pressure (DoS)

1. 🟡 Cache key collision allows /api/show metadata cache poisoning in Ollama provider

Property	Value
Severity	Medium
CWE	CWE-20
Location	`extensions/ollama/src/provider-models.ts:67-76`

Description

A module-level cache (ollamaModelShowInfoCache) keys entries by string concatenation of untrusted model fields (model.name, model.digest/modified_at) separated with |.

model.name, digest, and modified_at originate from the remote Ollama endpoint (/api/tags) and are not validated/escaped before being used in the cache key.
Because the key format is a raw concatenated string, an Ollama endpoint that returns values containing the delimiter (|) can deliberately create colliding keys, causing one model’s /api/show metadata (context window / capabilities) to be reused for a different model.
Downstream, capabilities influences behavior (e.g., enabling vision/tooling paths), so poisoning this cache can lead to capability spoofing/state confusion.

Vulnerable code:

return `${resolveOllamaApiBase(apiBase)}|${model.name}|${version}`;

Recommendation

Avoid building cache keys via ambiguous string concatenation of untrusted components.

Use a structured key that cannot collide, such as nested Maps or a stable encoding that escapes separators.

Example (nested maps):

const cache = new Map<string, Map<string, Map<string, Promise<OllamaModelShowInfo>>>>();

function getEntry(apiBase: string, modelName: string, version: string) {
  let byBase = cache.get(apiBase);
  if (!byBase) cache.set(apiBase, (byBase = new Map()));
  let byName = byBase.get(modelName);
  if (!byName) byBase.set(modelName, (byName = new Map()));
  return { byName, key: version };
}

Additionally, validate/normalize model.name (and other fields used for keying) to an expected character set (e.g., disallow control characters and |) before using them as cache identifiers.

2. 🔵 Unbounded cache key size from remote Ollama model metadata can cause memory pressure (DoS)

Property	Value
Severity	Low
CWE	CWE-400
Location	`extensions/ollama/src/provider-models.ts:67-76`

Description

The /api/show metadata cache key is built by concatenating apiBase, model.name, and model.digest/modified_at obtained from /api/tags without any size limits.

fetchOllamaModels() populates OllamaTagModel fields directly from the server response (only filtering for truthy name).
queryOllamaModelShowInfoCached() uses those fields to construct a cache key string.
An attacker controlling the configured Ollama base URL (or a compromised Ollama endpoint) can return extremely large name, digest, or modified_at values, causing large cache key allocations.
Even with a 256-entry cap, 256 very large keys (and their associated Promise closures) can lead to significant memory usage and UI/process instability.

Vulnerable code:

const version = model.digest?.trim() || model.modified_at?.trim();
return `${resolveOllamaApiBase(apiBase)}|${model.name}|${version}`;

Recommendation

Mitigate memory pressure by ensuring cache keys are bounded in size and derived from validated inputs.

Options:

Hash the key components instead of concatenating full strings:

import { createHash } from "node:crypto";

function buildOllamaModelShowCacheKey(apiBase: string, model: Pick<OllamaTagModel, "name"|"digest"|"modified_at">) {
  const version = model.digest?.trim() || model.modified_at?.trim();
  if (!version) return undefined;

  const raw = `${resolveOllamaApiBase(apiBase)}|${model.name}|${version}`;
  return createHash("sha256").update(raw).digest("hex");
}

Enforce maximum lengths for model.name and version (e.g., drop caching if too long):

if (model.name.length > 256 || version.length > 256) return undefined;

Additionally, consider validating/sanitizing fields from /api/tags (e.g., ignore unusually large values) before using them in caching or UI logic.

Analyzed PR: #64753 at commit de56dfb

_{Last updated on: 2026-04-11T12:37:35Z}

@ImLukeF

Merged via squash. Prepared head SHA: de56dfb Co-authored-by: ImLukeF <92253590+ImLukeF@users.noreply.github.com> Co-authored-by: ImLukeF <92253590+ImLukeF@users.noreply.github.com> Reviewed-by: @ImLukeF

@ImLukeF

Merged via squash. Prepared head SHA: de56dfb Co-authored-by: ImLukeF <92253590+ImLukeF@users.noreply.github.com> Co-authored-by: ImLukeF <92253590+ImLukeF@users.noreply.github.com> Reviewed-by: @ImLukeF

openclaw-barnacle bot added size: S maintainer Maintainer-authored PR labels Apr 11, 2026

ImLukeF marked this pull request as ready for review April 11, 2026 10:39

Copilot AI review requested due to automatic review settings April 11, 2026 10:39

Copilot started reviewing on behalf of ImLukeF April 11, 2026 10:40 View session

greptile-apps bot reviewed Apr 11, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Apr 11, 2026

View reviewed changes

Copilot AI reviewed Apr 11, 2026

View reviewed changes

ImLukeF added 4 commits April 11, 2026 22:29

Ollama: cache model show metadata

706a803

Ollama: retry empty model show lookups

853c715

Ollama: cover cache retry edges

8ca7cc4

Changelog: note Ollama metadata cache

de56dfb

ImLukeF force-pushed the imlukef/local-ollama-show-cache branch from fe9b15a to de56dfb Compare April 11, 2026 12:29

ImLukeF merged commit 3da5822 into main Apr 11, 2026
21 of 27 checks passed

ImLukeF deleted the imlukef/local-ollama-show-cache branch April 11, 2026 12:30

github-actions bot mentioned this pull request Apr 11, 2026

📡 Upstream Digest — 2026-04-11 12:39 UTC curtismercier/openclaw-mods#538

Open

		const pending = queryOllamaModelShowInfo(apiBase, model.name);
		setOllamaModelShowCacheEntry(cacheKey, pending);

Uh oh!

Conversation

ImLukeF commented Apr 11, 2026

Summary

Testing

Uh oh!

greptile-apps bot commented Apr 11, 2026

Greptile Summary

Confidence Score: 4/5

Uh oh!

greptile-apps bot Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

ImLukeF Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

ImLukeF Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

ImLukeF Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ImLukeF commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aisle-research-bot bot commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔒 Aisle Security Analysis

Description

Recommendation

Description

Recommendation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ImLukeF commented Apr 11, 2026 •

edited

Loading

aisle-research-bot bot commented Apr 11, 2026 •

edited

Loading