Tokenr Python SDK

Automatic LLM cost tracking in one line of code.

Track costs from OpenAI, Anthropic, and other LLM providers with zero code changes. Get real-time visibility into spending by agent, feature, team, or any dimension you need.

Features

One-line setup — Add tokenr.init() and you're done
Zero code changes — Use OpenAI/Anthropic SDK as normal
Automatic tracking — Token counts and costs tracked automatically
Async by default — Never slows down your app
Multi-provider — OpenAI, Anthropic (more coming)
Rich attribution — Track by agent, feature, team, or custom tags
Production-ready — Handles errors gracefully, never crashes your app

Installation

pip install tokenr

Quickstart

OpenAI

import tokenr
import openai

# One line to enable tracking
tokenr.init(token="your-tokenr-token")

# Use OpenAI exactly as you normally would
response = openai.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}]
)

# That's it — costs are automatically tracked to Tokenr

Anthropic

import tokenr
from anthropic import Anthropic

tokenr.init(token="your-tokenr-token")

client = Anthropic(api_key="your-anthropic-key")
response = client.messages.create(
    model="claude-opus-4-5",
    messages=[{"role": "user", "content": "Hello!"}]
)

# Automatically tracked!

Configuration

Environment Variables

export TOKENR_TOKEN="your-token"

import tokenr
tokenr.init()  # Reads from TOKENR_TOKEN

All Options

tokenr.init(
    token="your-token",           # API token (or TOKENR_TOKEN env var)
    url="https://tokenr.co/...",  # Override API URL (optional)
    agent_id="my-app",            # Default agent ID for all requests
    tags={"environment": "prod"}, # Default tags applied to every request
    enabled=True,                 # Set False to disable tracking entirely
    debug=False,                  # Print tracking info to console
)

Disable in Development

import os
tokenr.init(
    token=os.getenv("TOKENR_TOKEN"),
    enabled=os.getenv("ENV") == "production"
)

Advanced Usage

Track by Agent

# Option 1: Set a default agent ID at init
tokenr.init(token="...", agent_id="customer-support-bot")

# Option 2: Override per request
response = openai.chat.completions.create(
    model="gpt-4",
    messages=[...],
    tokenr_agent_id="sales-assistant"
)

Track by Feature

response = openai.chat.completions.create(
    model="gpt-4",
    messages=[...],
    tokenr_feature="chat",
    tokenr_agent_id="support-bot"
)

Multi-Tenant Tracking

response = openai.chat.completions.create(
    model="gpt-4",
    messages=[...],
    tokenr_team_id="team-123",
    tokenr_agent_id="shared-bot"
)

Custom Tags

response = openai.chat.completions.create(
    model="gpt-4",
    messages=[...],
    tokenr_tags={
        "customer_id": "cust_123",
        "conversation_id": "conv_456",
        "language": "en",
        "priority": "high"
    }
)

Manual Tracking

For providers not yet auto-patched, or when you need full control:

import tokenr

tokenr.track(
    provider="custom-provider",
    model="custom-model-v1",
    input_tokens=1000,
    output_tokens=500,
    agent_id="my-agent",
    feature_name="translation",
    tags={"language": "es"}
)

Per-Request Parameters

Add any of these to OpenAI or Anthropic calls:

Parameter	Type	Description
`tokenr_agent_id`	`str`	Agent identifier
`tokenr_feature`	`str`	Feature name
`tokenr_team_id`	`str`	Team or customer ID
`tokenr_tags`	`dict`	Arbitrary metadata

`tokenr.track()` Reference

tokenr.track(
    provider="openai",        # required
    model="gpt-4",            # required
    input_tokens=100,         # required
    output_tokens=50,         # required
    cache_read_tokens=0,      # tokens served from prompt cache
    cache_write_tokens=0,     # tokens written to prompt cache
    agent_id=None,
    feature_name=None,
    team_id=None,
    status="success",         # "success" or "error"
    latency_ms=None,
    tags=None,
    requested_at=None,        # ISO 8601 timestamp; defaults to now
)

Supported Providers

Provider	Auto-Tracking	Manual Tracking
OpenAI	Yes	Yes
Anthropic	Yes	Yes
Google AI	Coming soon	Yes
Cohere	Coming soon	Yes
Custom	—	Yes

Prompt Caching

Both OpenAI and Anthropic support prompt caching, and the SDK handles it automatically. You don't need to do anything special.

OpenAI includes cached tokens inside prompt_tokens. The SDK reads prompt_tokens_details.cached_tokens and separates them so Tokenr can price each category at the correct rate.

Anthropic reports cache tokens as separate fields (cache_creation_input_tokens and cache_read_input_tokens). The SDK passes these through directly.

In both cases, the Tokenr server applies the right per-token rate for input, output, cache reads, and cache writes.

For manual tracking, you can pass cache tokens explicitly:

tokenr.track(
    provider="anthropic",
    model="claude-sonnet-4-20250514",
    input_tokens=500,
    output_tokens=200,
    cache_read_tokens=8000,
    cache_write_tokens=2000,
)

How It Works

tokenr.init() wraps the create method on OpenAI and Anthropic clients
After each call, token counts are read from the response usage field
A background daemon thread sends the data to Tokenr asynchronously
If tracking fails for any reason, your app is completely unaffected

Examples

See the examples/ directory:

basic_openai.py — Simple OpenAI setup
multi_agent.py — Multiple agents, multiple providers
saas_multitenant.py — Per-team cost attribution

Getting Your API Token

Sign up at tokenr.co
Go to API Tokens and create a token
Copy it — it's shown only once

export TOKENR_TOKEN="your-token-here"

Troubleshooting

No data showing up?

tokenr.init(token="your-token", debug=True)
# Prints: [Tokenr] Tracked: gpt-4 - $0.0012

Verify echo $TOKENR_TOKEN returns your token
Confirm you're making actual LLM API calls
Check network access to tokenr.co

Errors? The SDK never raises — it always fails silently. Use debug=True to see what's happening.

Running Tests

# Unit and mock tests (no API keys needed)
python -m unittest discover tests -v

# Live integration tests (makes real API calls, costs fractions of a cent)
OPENAI_API_KEY=sk-... ANTHROPIC_API_KEY=sk-ant-... python -m unittest tests.test_live_integration -v

The live tests make a real call to each provider, then verify that the token counts in the Tokenr payload match what the provider actually returned. This includes a test that triggers Anthropic prompt caching and confirms cache tokens are extracted correctly.

Security

This SDK is open source so you can audit exactly what data is sent and when. The short version:

Only token counts, model names, and your attribution metadata are transmitted
No prompt content or response content ever leaves your application
All requests use HTTPS
The tracker runs in a daemon thread and cannot block your process

License

MIT — see LICENSE

Support

Issues: github.com/tokenr-co/tokenr-python/issues
Email: support@tokenr.co

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
examples		examples
tests		tests
tokenr		tokenr
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
QUICKSTART.md		QUICKSTART.md
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tokenr Python SDK

Features

Installation

Quickstart

OpenAI

Anthropic

Configuration

Environment Variables

All Options

Disable in Development

Advanced Usage

Track by Agent

Track by Feature

Multi-Tenant Tracking

Custom Tags

Manual Tracking

Per-Request Parameters

`tokenr.track()` Reference

Supported Providers

Prompt Caching

How It Works

Examples

Getting Your API Token

Troubleshooting

Running Tests

Security

License

Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tokenr Python SDK

Features

Installation

Quickstart

OpenAI

Anthropic

Configuration

Environment Variables

All Options

Disable in Development

Advanced Usage

Track by Agent

Track by Feature

Multi-Tenant Tracking

Custom Tags

Manual Tracking

Per-Request Parameters

tokenr.track() Reference

Supported Providers

Prompt Caching

How It Works

Examples

Getting Your API Token

Troubleshooting

Running Tests

Security

License

Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`tokenr.track()` Reference

Packages