Skip to content

feat: refresh Workers AI model handling#11

Merged
darron merged 1 commit into
mainfrom
ai-model-update
May 8, 2026
Merged

feat: refresh Workers AI model handling#11
darron merged 1 commit into
mainfrom
ai-model-update

Conversation

@darron
Copy link
Copy Markdown
Owner

@darron darron commented May 8, 2026

Replace the deprecated Workers AI default model with @cf/zai-org/glm-4.7-flash and update staging to use the same model.

Add JSON mode, disable model thinking, and pass both token limit fields for ingest and location extraction so newer chat models keep returning parseable JSON within the existing budgets. Also accept OpenAI-compatible choices responses in AI text extraction for summaries, ingest, and location enrichment.

Upgrade Wrangler and the generated lockfile to the current Workers runtime tooling required by the refreshed model/runtime stack.

Behavior change: AI extraction and summary calls now default to GLM 4.7 Flash, so outputs may differ slightly from the previous Llama model. Risk is mainly in JSON extraction quality and token-budget behavior; staging should be smoke tested before production rollout. Follow-up: watch ingest/location proposal quality and add model-specific regression fixtures if output drift appears.

Replace the deprecated Workers AI default model with
@cf/zai-org/glm-4.7-flash and update staging to use the same model.

Add JSON mode, disable model thinking, and pass both token limit fields for
ingest and location extraction so newer chat models keep returning parseable
JSON within the existing budgets. Also accept OpenAI-compatible `choices`
responses in AI text extraction for summaries, ingest, and location enrichment.

Upgrade Wrangler and the generated lockfile to the current Workers runtime
tooling required by the refreshed model/runtime stack.

Behavior change: AI extraction and summary calls now default to GLM 4.7 Flash,
so outputs may differ slightly from the previous Llama model. Risk is mainly in
JSON extraction quality and token-budget behavior; staging should be smoke
tested before production rollout. Follow-up: watch ingest/location proposal
quality and add model-specific regression fixtures if output drift appears.
@darron darron self-assigned this May 8, 2026
@darron darron added the enhancement New feature or request label May 8, 2026
@darron darron merged commit db488b7 into main May 8, 2026
@darron darron deleted the ai-model-update branch May 8, 2026 02:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant