When OpenAB starts Gemini CLI without an explicit --model argument, Gemini CLI appears to default to a Pro model in practice.
That leads to noticeably longer thinking time and slower response latency for interactive chat usage.
Suggested change:
args = ["--acp", "--model", "gemini-3.1-flash-lite-preview"]
Using a faster Flash-Lite model by default would likely improve responsiveness a lot for Discord-style interactions, while still keeping ACP integration intact.
At minimum, it would be helpful to document the current default behavior and provide a recommended low-latency Gemini configuration in the README or example config.
When OpenAB starts Gemini CLI without an explicit
--modelargument, Gemini CLI appears to default to a Pro model in practice.That leads to noticeably longer thinking time and slower response latency for interactive chat usage.
Suggested change:
Using a faster Flash-Lite model by default would likely improve responsiveness a lot for Discord-style interactions, while still keeping ACP integration intact.
At minimum, it would be helpful to document the current default behavior and provide a recommended low-latency Gemini configuration in the README or example config.