feat: refresh Workers AI model handling by darron · Pull Request #11 · darron/ff-workers

darron · 2026-05-08T02:55:54Z

Replace the deprecated Workers AI default model with @cf/zai-org/glm-4.7-flash and update staging to use the same model.

Add JSON mode, disable model thinking, and pass both token limit fields for ingest and location extraction so newer chat models keep returning parseable JSON within the existing budgets. Also accept OpenAI-compatible choices responses in AI text extraction for summaries, ingest, and location enrichment.

Upgrade Wrangler and the generated lockfile to the current Workers runtime tooling required by the refreshed model/runtime stack.

Behavior change: AI extraction and summary calls now default to GLM 4.7 Flash, so outputs may differ slightly from the previous Llama model. Risk is mainly in JSON extraction quality and token-budget behavior; staging should be smoke tested before production rollout. Follow-up: watch ingest/location proposal quality and add model-specific regression fixtures if output drift appears.

Replace the deprecated Workers AI default model with @cf/zai-org/glm-4.7-flash and update staging to use the same model. Add JSON mode, disable model thinking, and pass both token limit fields for ingest and location extraction so newer chat models keep returning parseable JSON within the existing budgets. Also accept OpenAI-compatible `choices` responses in AI text extraction for summaries, ingest, and location enrichment. Upgrade Wrangler and the generated lockfile to the current Workers runtime tooling required by the refreshed model/runtime stack. Behavior change: AI extraction and summary calls now default to GLM 4.7 Flash, so outputs may differ slightly from the previous Llama model. Risk is mainly in JSON extraction quality and token-budget behavior; staging should be smoke tested before production rollout. Follow-up: watch ingest/location proposal quality and add model-specific regression fixtures if output drift appears.

darron self-assigned this May 8, 2026

darron added the enhancement New feature or request label May 8, 2026

darron merged commit db488b7 into main May 8, 2026

darron deleted the ai-model-update branch May 8, 2026 02:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: refresh Workers AI model handling#11

feat: refresh Workers AI model handling#11
darron merged 1 commit into
mainfrom
ai-model-update

darron commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

darron commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant