Gemini CLI Has Been Hitting “Server Limit Exceeded” Frequently Over the Past Month — I’m Considering Switching to Claude Code / Codex #21075
Replies: 2 comments 2 replies
-
just the past two days for me gemini-cli worked again without any problems and no delays until I exhausted my quota for the used models :) |
Beta Was this translation helpful? Give feedback.
-
|
From my point of view, the product risk here is not just the raw limit error but the fact that the failure mode feels unpredictable during normal work. If capacity is tight, the CLI should probably make a clearer distinction between quota exhaustion, temporary server saturation, and retryable overload, then offer an automatic downgrade or queued retry path. That kind of reliability signaling would do a lot to keep people from feeling they have to switch tools entirely. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I used to really like
google-gemini/gemini-cli(gmmini cli), and I recommended it often. The main reason was simple: it runs on Gemini models, and it fit my development workflow really well.However, over roughly the past month, I’ve gradually stopped using Gemini CLI. Model calls have been frequently failing with errors like “server limit exceeded,” “resource exhausted,” or “rate limited.” From what I can tell, this looks more like a server-side stability or quota/rate-limiting issue than something caused by my local setup.
More importantly, it’s not just the Pro model—Flash runs into the same problem quite often. Conversations and tasks get interrupted, retries keep failing, and it’s become hard to reliably get through everyday work.
Given the current reliability and usability issues, I’m planning to stop using Gemini CLI for now and move to alternatives such as Claude Code or Codex. If the service becomes noticeably more stable (at least consistently usable during normal hours), I’d be happy to give it another try and recommend it again.
Beta Was this translation helpful? Give feedback.
All reactions