Has the Plus Quota Really Been Cut in Half? #22126
Replies: 3 comments 1 reply
-
|
I'm curious about this and I hope you get some sort of answer, but also fast mode is the default now. Check to make sure you don't have /fast enabled. |
Beta Was this translation helpful? Give feedback.
-
|
Adding to @Danny-BW's point about Beyond that, there's another factor: the newer models (GPT 5.x series) consume significantly more tokens per request than the previous generation, even for identical prompts. So you're getting fewer interactions per quota unit whether you notice it or not. If you want to stretch your budget further and you're also using the API directly, one approach that's worked well for us: route tasks to the cheapest model that can handle them. Simple formatting, lint fixes, boilerplate generation — those don't need the latest model. Architecture decisions and complex debugging do. We built InferCut to automate this routing — it's a drop-in proxy that picks the right model tier per task. Free tier available, zero risk (if it can't save cost on a request, it passes through unchanged). But yeah, check |
Beta Was this translation helpful? Give feedback.
-
|
There is no public doc I can find that confirms “Plus was cut in half” as an announced change. The public docs describe Plus qualitatively, and Pro only relative to Plus: Pro $100 is 5x Plus usage and Pro $200 is 20x Plus usage. They do not publish a fixed raw token quota for Plus that users can independently compare against. What I would check before assuming the quota itself changed:
Fast mode consumes quota faster. Current docs say Fast mode uses credits at: So if Fast is on, it can feel like the quota was cut heavily even if the nominal limit did not change.
Check the active model, rate-limit window, and remaining usage. If possible, capture
If all of those are the same and The best actionable next step is to send OpenAI support: So my short answer is: I would not call it confirmed from the public docs, but if Fast is off and the same model/workload is exhausting a fresh window much earlier, it is worth reporting with |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I’ve noticed that my Plus quota seems to run out way faster than before, and it honestly feels like it’s been cut in half. I used to use it pretty comfortably, but now I’m hitting the limit much more quickly and the overall experience feels noticeably worse. Is anyone else experiencing the same thing, or is it just me?
Beta Was this translation helpful? Give feedback.
All reactions