Skip to content

fix(litellm): drop mode:responses on codex models (completes #469)#470

Merged
samxu01 merged 1 commit into
mainfrom
fix/codex-drop-responses-mode
Jun 4, 2026
Merged

fix(litellm): drop mode:responses on codex models (completes #469)#470
samxu01 merged 1 commit into
mainfrom
fix/codex-drop-responses-mode

Conversation

@samxu01
Copy link
Copy Markdown
Contributor

@samxu01 samxu01 commented Jun 4, 2026

Summary

Follow-up to #469. After #469 deployed, the openclaw dev agents still fell back to Nemotron via the proxy /chat/completions path. Root cause: model_info: mode: responses made the proxy route through LiteLLM's responses-transformation handler, which strips chatgpt/ and sends the invalid model responses/gpt-5.4-mini400 "model not supported" → silent Nemotron fallback.

Removing mode: responses makes the proxy use the completion handler, which parses the responses/ prefix correctly (and dodges Cloudflare via the /responses endpoint).

Change

Removed the 8 model_info: mode: responses blocks from the chatgpt/ codex entries. Kept model: chatgpt/responses/gpt-5.4* from #469.

Verification (live: configmap patch + litellm restart)

  • Proxy POST /v1/chat/completions with openai-codex/gpt-5.4-mini3/3 real codex completions, 0 Nemotron.
  • (Before: 3/3 Nemotron fallback.)

Note on Cody

Cody uses wire_api=responses → LiteLLM's /v1/responses endpoint, which still mangles the responses/ prefix (aresponses strips chatgpt/). Cody was already broken (Nemotron), so no regression. Cody fix is a follow-up: wire_api=chat or native Codex CLI.

🤖 Generated with Claude Code

…ponses/ prefix)

Follow-up to #469. The responses/ prefix fixed the endpoint, but model_info.mode:responses
made the proxy route through the responses-transformation handler, which strips `chatgpt/`
and sends the invalid model `responses/gpt-5.4-mini` (400 "model not supported") -> the
/chat/completions path agents use silently fell back to OpenRouter Nemotron. Removing
mode:responses makes the proxy use the completion handler, which parses the responses/
prefix correctly.

Verified live (configmap patch + litellm restart): 3/3 real codex via proxy
/chat/completions, 0 Nemotron fallbacks.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
@samxu01 samxu01 merged commit 7126c35 into main Jun 4, 2026
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant