[BUG] Wrong model keeps getting called (gpt-5.5-medium instead of gpt-5.5-xhigh) #2829
-
|
Codex (paid subscription). Things I checked so far:
Would appreciate your help. Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 17 comments 4 replies
-
|
Hey @KernelBypass! This is the same issue @iHanatos hit in #2762 -- the short answer is that OmniRoute reads 1. Switch your 2. Verify in Why the toml path is flaky: the Codex CLI sends Also double-check: in |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
Weird, reloaded windows with this in global toml: and got: |
Beta Was this translation helpful? Give feedback.
-
|
log snippet: |
Beta Was this translation helpful? Give feedback.
-
|
Confirmed @KernelBypass -- this is a real bug, not your config. Your log is the smoking gun: 1. The 2. Bare Workaround that works today -- fully qualify with the provider prefix: model = "codex/gpt-5.5-xhigh"
model_provider = "omniroute"The I've opened #2877 to fix both defects properly (an auto-injected default must never demote a higher explicit effort, and the suffixed codex variants need provider inference). Will ping here when the patch lands. Thanks for the detailed log -- it pinpointed the exact line. |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
The bug is even worse. which worked before once.
|
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
|
Even forked chats route to medium even with: So, this ^^ only worked once for a brand new chat only. |
Beta Was this translation helpful? Give feedback.
-
|
Thanks for the extra logs @KernelBypass -- they actually sharpen the diagnosis, and I owe you a straighter answer plus an honest ETA. Where the Then OmniRoute makes it stick. Once a request arrives as The fix (#2877): when a request carries both a suffix effort and a body Mitigation that actually holds today: since the wire model = the CLI picker selection (not config.toml) for any existing/forked session, set Your other questions:
|
Beta Was this translation helpful? Give feedback.
-
|
Thanks for your patience on this one, @KernelBypass -- nothing more you need to do on your end. I'll ping this thread the moment #2877 lands (the max-of-suffix-vs-body-effort fix). Appreciate the thorough logs that pinned it down. |
Beta Was this translation helpful? Give feedback.
-
|
I'm afraid the bug is still present, and there is now an additional regression: it uses the wrong accounts. I found this seemingly identical issue in: decolua/9router#1459 I updated to 3.8.8, but this:
However, when I use the "test combo" button:
...it correctly gets routed to 5.5-xhigh.
This also works for xhigh effort: ...but still mixes up the accounts: Thanks |
Beta Was this translation helpful? Give feedback.
-
|
Updated to 3.8.10 - the issue is still there. |
Beta Was this translation helpful? Give feedback.
-
|
Thanks for staying on this @KernelBypass -- and I owe you a straight correction first. I told you I'd ping when "#2877 (the max-of-suffix-vs-body-effort fix)" landed. #2877 did land (v3.8.8), but what it actually shipped was suffix-wins routing, not the max-of behavior I described -- which is the opposite of what your Codex-CLI case needs. So your effort bug is genuinely still open, and there's now a second, separate one. I've reproduced both against the current code. Splitting them: 1. Effort still demoted to medium -- real, root cause pinnedWhen you send bare The correct fix is a max-of at Workaround that works on your build today -- fully qualify with both prefix and suffix: model = "codex/gpt-5.5-xhigh"
model_provider = "omniroute"You already confirmed this carries 2. Wrong accounts being called -- separate bug, also reproducedThis one is real and independent of effort. I traced it: a combo target pins an account via a single The "Test combo" button works precisely because it takes a different path: it injects Workaround today: pin each account explicitly -- make foo1 and foo2 their own steps with the connection set on each, so the round-robin only cycles those two. If the UI isn't letting you set a per-step connection (so the round-robin can't be scoped to a subset), that's the gap, and I'm tracking a proper per-combo account allowlist in #3266. On the upstream 9router#1459 linkThe effort half rhymes with upstream, but the specific chain here (the I'll ping this thread when each fix lands. Sorry for the earlier mischaracterization of #2877 -- your logs were right the whole way. |
Beta Was this translation helpful? Give feedback.
-
|
Correction, @KernelBypass -- I need to walk back my previous comment. I traced this properly afterward and my effort diagnosis above was wrong. Apologies; here's what's actually true, verified against the current code (release/v3.8.12). Effort: OmniRoute does not rewrite
|
Beta Was this translation helpful? Give feedback.
-
|
Hey @diegosouzapw , thanks for continuing to hunt these pesky bugs, I appreciate your effort!
I responded to you in: ...with screenshots -- it's not honoring my combo accounts even with them pinned. It's not even selecting my only active combo. |
Beta Was this translation helpful? Give feedback.
-
But in my case it does NOT honor that as xhigh. OmniRoute does NOT honor this:
Yes, this works, but it causes this:
I understand what you are saying that it's a Codex client issue. Would it be possible to have some sort of compatibility shim for Codex to display them accurately like https://github.com/Soju06/codex-lb/ does? The incorrect button is jarring and makes user second-guess if the model is actually correct. I'm using codex-lb for the time being, and it honors: ...and displays it correctly in the selector button, and routes to correct accounts, but it doesn't have a per-account HTTP proxy functionality (and also I might want to add other non-Codex providers later, which they don't support). Again, thank you very much for your work! |
Beta Was this translation helpful? Give feedback.








Thanks for your patience on this one, @KernelBypass -- nothing more you need to do on your end. I'll ping this thread the moment #2877 lands (the max-of-suffix-vs-body-effort fix). Appreciate the thorough logs that pinned it down.