Problem Statement
I have two Codex accounts with independent rolling quota windows.
Currently, requests may be distributed across both accounts. While this helps balance usage, it also causes both accounts to consume quota at roughly the same time. As a result, both accounts may enter cooldown or become unavailable around the same period.
For my workflow, a sequential strategy would be more efficient.
Example:
- Start using Account A.
- Continue using Account A until it reaches its quota limit or becomes unavailable.
- Automatically switch to Account B.
- Continue working on Account B while Account A's quota window recovers.
By the time Account B is heavily utilized, Account A may already have recovered enough quota to become available again.
This creates a staggered recovery pattern and can provide longer uninterrupted working sessions compared to consuming quota from both accounts simultaneously.
Proposed Solution
Add an optional scheduling mode such as:
Behavior:
- Select a primary account.
- Route all requests to that account while it remains healthy and available.
- When the account becomes unavailable, delayed, rate limited, or reaches a configurable threshold, automatically switch to the next account.
- Continue through the account pool sequentially.
- Once an earlier account recovers, it becomes eligible again.
This mode should be optional and coexist with the current balancing and affinity strategies.
Alternatives Considered
Current workaround:
- Manually pin Account A.
- Use it until quota is exhausted.
- Manually switch to Account B.
- Repeat throughout the day.
This works well but requires manual intervention.
Scope
In Scope
- Sequential account usage.
- Automatic failover between accounts.
- Optional configurable switch thresholds.
- Compatibility with existing forecasting and health checks.
Validation Plan
Using two accounts:
- Enable Sequential / Drain First mode.
- Generate continuous workload.
- Verify that all requests initially use Account A.
- Verify that Account B remains mostly untouched.
- Verify automatic failover to Account B when Account A becomes unavailable.
- Verify that Account A can become available again after its quota window recovers.
Expected result:
Instead of both accounts reaching cooldown at roughly the same time, quota recovery becomes staggered, allowing for longer uninterrupted usage.
Compliance Confirmation
Problem Statement
I have two Codex accounts with independent rolling quota windows.
Currently, requests may be distributed across both accounts. While this helps balance usage, it also causes both accounts to consume quota at roughly the same time. As a result, both accounts may enter cooldown or become unavailable around the same period.
For my workflow, a sequential strategy would be more efficient.
Example:
By the time Account B is heavily utilized, Account A may already have recovered enough quota to become available again.
This creates a staggered recovery pattern and can provide longer uninterrupted working sessions compared to consuming quota from both accounts simultaneously.
Proposed Solution
Add an optional scheduling mode such as:
Behavior:
This mode should be optional and coexist with the current balancing and affinity strategies.
Alternatives Considered
Current workaround:
This works well but requires manual intervention.
Scope
In Scope
Validation Plan
Using two accounts:
Expected result:
Instead of both accounts reaching cooldown at roughly the same time, quota recovery becomes staggered, allowing for longer uninterrupted usage.
Compliance Confirmation