ai-limits

A CLI and TypeScript SDK to check usage limits and quotas across multiple AI coding assistants from one place.

It reuses the credentials that the official tools already store on your machine, so for most providers there is nothing to configure. Run one command and see how much of your plan is left and when it resets.

Provider: CLAUDE
Overall Usage: ████████░░ 78%
Next Reset:    in 2h 14m

┌──────────────────────────────┬────────────────────┬────────────────────┐
│ Model/Bucket                 │ Usage              │ Reset Time         │
├──────────────────────────────┼────────────────────┼────────────────────┤
│ 5-hour window                │ ████████░░ 78%     │ in 2h 14m          │
│ 7-day window                 │ ███░░░░░░░ 31%      │ in 5d 3h           │
└──────────────────────────────┴────────────────────┴────────────────────┘

Supported providers

Provider	Source	Where credentials come from
Claude	Anthropic / Claude Code CLI	macOS Keychain or `~/.claude/.credentials.json`
ChatGPT / Codex	ChatGPT backend API	`~/.codex/auth.json`
Gemini	Google Cloud Code Assist	`~/.gemini/oauth_creds.json`
Antigravity	Google Cloud Code Assist	OAuth login built into this CLI
MiniMax	MiniMax OpenPlatform API	`MINIMAX_API_KEY` environment variable
OpenRouter	OpenRouter API key endpoint	`OPENROUTER_API_KEY` environment variable

For Claude, ChatGPT and Gemini the credentials are created by the providers' own CLIs and IDE plugins. If those tools already work on your machine, this one works too. Antigravity is the only provider that needs an explicit login through this CLI.

OpenRouter has no plan-based quota. Instead it reports the per-key spend limit (e.g. $3 / month): if the key has a limit set, you get an overall usage bar and reset time; if the key is unlimited, the spend is shown for information only.

Installation

Install globally to use the CLI anywhere:

npm install -g @lenadweb/ai-limits

Or add it to a project to use the SDK:

npm install @lenadweb/ai-limits

Requires Node.js 18 or newer.

CLI usage

Show usage

# All providers at once
ai-limits show

# A single provider
ai-limits show claude
ai-limits show chatgpt
ai-limits show gemini
ai-limits show minimax
ai-limits show openrouter
ai-limits show antigravity

Each provider is printed with an overall usage bar, the next reset time, and a per-model or per-window breakdown when the provider exposes one.

Antigravity login

Antigravity uses Google OAuth. Authenticate once and the tokens are cached locally:

# Open the browser and complete the OAuth flow
ai-limits login antigravity

# Remove the cached tokens
ai-limits logout antigravity

SDK usage

import { LimitsClient } from "@lenadweb/ai-limits";

const client = new LimitsClient();

// Usage for every provider, keyed by provider name
const all = await client.fetchAllUsage();
console.log(all.claude.overallUsagePercent);

// Usage for a single provider
const claude = await client.fetchUsage("claude");
console.log(claude.overallUsagePercent, claude.overallResetTime);

Available methods

Method	Returns	Description
`fetchUsage(provider)`	`StandardUsageResult`	Normalized usage for one provider.
`fetchAllUsage()`	`Record<Provider, StandardUsageResult>`	Normalized usage for every provider in parallel.
`fetchSummary(provider)`	`UsageSummary`	Compact status with flags and a ready to print line.
`fetchAllSummaries()`	`Record<Provider, UsageSummary>`	Summaries for every provider.
`fetchRawUsage(provider)`	`any`	The provider's raw API response, unmodified.
`fetchAllRawUsage()`	`Record<Provider, any>`	Raw responses for every provider, errors captured per provider.
`getProvider(name)`	`BaseProvider`	The underlying provider instance, for example to call `login()` on Antigravity.

Response shapes

fetchUsage returns a normalized result that is the same for every provider:

interface StandardUsageResult {
  provider: string;
  overallUsagePercent: number | null;
  overallResetTime: string | null; // ISO timestamp
  perModel?: Record<string, {
    usagePercent: number | null; // null for informational rows (e.g. OpenRouter spend)
    remainingAmount?: number;
    limitAmount?: number;
    resetTime?: string | null;
    displayName?: string;
  }>;
  error?: { code: "AUTH" | "API" | "CONN" | number; message: string };
}

fetchSummary returns a smaller object that is handy for status bars and alerts:

interface UsageSummary {
  provider: string;
  overallUsagePercent: number | null;
  overallResetTime: string | null;
  isExhausted: boolean;
  isRateLimited: boolean;
  needsAuthentication: boolean;
  formattedText: string;
}

When a provider fails, fetchUsage resolves with the error field set instead of throwing, so a single broken provider never breaks the whole batch.

Typed accessors per provider

Instead of iterating perModel with string keys, each provider exposes named, typed methods for its specific windows. Get the typed instance with getProvider<T>(name). All accessors share the same cached fetch, so calling several in a row makes a single request.

import { LimitsClient, ProviderName, ClaudeProvider, OpenRouterProvider } from "@lenadweb/ai-limits";

const client = new LimitsClient();

const claude = client.getProvider<ClaudeProvider>(ProviderName.Claude);
await claude.getFiveHourUsage();   // ModelUsage | null
await claude.getSevenDayUsage();   // ModelUsage | null

const or = client.getProvider<OpenRouterProvider>(ProviderName.OpenRouter);
await or.getLimit();         // OpenRouterLimit | null — { amount, interval, used, remaining, usagePercent, resetTime }
await or.getMonthlySpend();  // number | null
await or.fetchDetails();     // OpenRouterUsage — structured limit + spend

Provider	Methods
Claude	`getFiveHourUsage()`, `getSevenDayUsage()`, `getSonnetWeeklyUsage()`
ChatGPT	`getPrimaryWindow()`, `getSecondaryWindow()`
MiniMax	`getDailyUsage()`, `getWeeklyUsage()`
Gemini	`getModelUsage(modelId)`, `getModels()`
Antigravity	`getModelUsage(modelId)`, `getModels()`
OpenRouter	`getLimit()`, `getTotalSpend()`, `getDailySpend()`, `getWeeklySpend()`, `getMonthlySpend()`, `fetchDetails()`

Window accessors return ModelUsage | null (null when that window is absent). Every provider also inherits listBuckets() to discover the raw bucket keys.

Caching

Each provider caches its normalized usage internally, so several accessor calls in a row (for example getFiveHourUsage() then getSevenDayUsage()) make a single network request. The default TTL is 30 seconds.

Control it through config. Set cacheTtlMs globally or per provider; 0 disables caching entirely. A per-provider value overrides the global one.

// Global default for every provider
const client = new LimitsClient({ cacheTtlMs: 10000 });

// Disable globally, but keep a 60s cache for OpenRouter only
const client2 = new LimitsClient({
  cacheTtlMs: 0,
  openrouter: { apiKey: process.env.OPENROUTER_API_KEY, cacheTtlMs: 60000 },
});

// Force a fresh fetch on the next call
client.getProvider(ProviderName.Claude).clearCache();

Custom configuration

Every provider accepts overrides, which is useful for non standard credential locations, custom OAuth clients, or passing a key directly:

import { LimitsClient } from "@lenadweb/ai-limits";

const client = new LimitsClient({
  antigravity: {
    tokenPath: "/custom/path/antigravity_oauth.json",
    clientId: process.env.ANTIGRAVITY_CLIENT_ID,
    clientSecret: process.env.ANTIGRAVITY_CLIENT_SECRET,
  },
  claude: {
    credentialsPath: "/custom/path/.credentials.json",
    useKeychain: false,
  },
  chatgpt: {
    authPath: "/custom/path/auth.json",
  },
  gemini: {
    credentialsPath: "/custom/path/oauth_creds.json",
    projectId: "your-gcp-project",
  },
  minimax: {
    apiKey: process.env.MINIMAX_API_KEY,
  },
  openrouter: {
    apiKey: process.env.OPENROUTER_API_KEY,
  },
});

How it reads credentials

This tool never asks for your passwords and never sends your tokens anywhere except to the matching provider's official API.

Claude: reads the token from the macOS Keychain entry Claude Code-credentials, or from ~/.claude/.credentials.json. Set useKeychain: false to force the file.
ChatGPT / Codex: reads the access token and account id from ~/.codex/auth.json.
Gemini: reads Google OAuth credentials from ~/.gemini/oauth_creds.json.
Antigravity: runs a local OAuth flow and caches tokens in ~/.limits-streamdeck/antigravity_oauth.json. Tokens are refreshed automatically.
MiniMax: uses the MINIMAX_API_KEY environment variable, or the apiKey option.
OpenRouter: uses the OPENROUTER_API_KEY environment variable, or the apiKey option. Calls GET /api/v1/key to read the key's spend limit and usage.

For Gemini and Antigravity the package uses the same public OAuth client identifiers that the official Google CLIs ship with. These are public desktop clients protected by PKCE, not private secrets. You can swap in your own client through the configuration options above.

Development

npm install
npm run build      # bundle with tsup
npm run dev        # rebuild on change

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ai-limits

Supported providers

Installation

CLI usage

Show usage

Antigravity login

SDK usage

Available methods

Response shapes

Typed accessors per provider

Caching

Custom configuration

How it reads credentials

Development

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ai-limits

Supported providers

Installation

CLI usage

Show usage

Antigravity login

SDK usage

Available methods

Response shapes

Typed accessors per provider

Caching

Custom configuration

How it reads credentials

Development

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages