Tokenly

Personal AI token usage tracking across all your platforms

Overview

Tokenly is a distributed system for collecting and analyzing AI token usage from across your entire infrastructure. Whether you're running OpenClaw agents, using Azure OpenAI, calling Anthropic APIs, or any other LLM services - Tokenly gives you unified visibility into your AI spending.

The Problem

AI costs can spiral quickly when you have:

Multiple AI agents running 24/7
Different applications calling various LLM APIs
Services spread across different platforms and providers
No central view of token consumption and costs

Without tracking, you discover expensive usage patterns too late.

The Solution

Tokenly automatically discovers and collects token usage data from JSONL log files across your systems, giving you:

Unified Dashboard - All AI spending in one place
Cost Trending - Daily, weekly, monthly burn rates
Service Breakdown - Which models and providers cost the most
Platform Analysis - Which applications are driving costs
Budget Monitoring - Alerts when spending exceeds thresholds

Architecture

Distributed Collection

┌─────────────┐    ┌─────────────┐    ┌─────────────┐    ┌─────────────┐    ┌─────────────┐
│   Laptop    │    │ Office PC   │    │  Cloud VM   │    │    VPS      │    │ Raspberry   │
│  (MacBook)  │    │ (Windows)   │    │   (Azure)   │    │  (Linux)    │    │     Pi      │
│ ┌─launcher──┐│    │ ┌─launcher──┐│    │ ┌─launcher──┐│    │ ┌─launcher──┐│    │ ┌─launcher──┐│
│ │  ┌worker┐ ││    │ │  ┌worker┐ ││    │ │  ┌worker┐ ││    │ │  ┌worker┐ ││    │ │  ┌worker┐ ││
│ │  └──────┘ ││    │ │  └──────┘ ││    │ │  └──────┘ ││    │ │  └──────┘ ││    │ │  └──────┘ ││
│ └───────────┘│    │ └───────────┘│    │ └───────────┘│    │ └───────────┘│    │ └───────────┘│
└─────────────┘    └─────────────┘    └─────────────┘    └─────────────┘    └─────────────┘
       │                   │                   │                   │                   │
       └───────────────────┼───────────────────┼───────────────────┼───────────────────┘
                           │
                  ┌─────────────────┐
                  │    Tokenly      │
                  │     Server      │
                  └─────────────────┘

Technology Stack

Layer	Technology
Backend	Azure Functions v4, TypeScript, Node.js 22
Frontend	React 19, Vite 7, Tailwind CSS 4, Chart.js
Client	Go 1.24+, cross-platform binaries
Hosting	Azure Static Web Apps (CDN + managed API)
Storage	In-memory (dev) / Azure Table Storage (prod)
Auth	JWT with httpOnly refresh cookies, bcrypt

Client Components

Launcher (Small, stable service)

Manages worker lifecycle via state-file IPC
Handles heartbeats and server communication
Exponential backoff on connection failures
Cross-platform: Windows, macOS, Linux (x64 + arm64)

Worker (The detective engine)

Smart discovery of JSONL files using glob patterns
Validates files (50%+ valid records threshold)
Concurrent uploads with configurable parallelism
Learning engine tracks directory success rates

Server Components

Core API - 12 Azure Functions endpoints

/api/heartbeat - Client registration and approval
/api/ingest - Token usage data collection (JSON + multipart)
/api/auth/* - Authentication (login, refresh, logout)
/api/manage/* - Client management, users, config, audit, analytics

Storage Plugins (dependency-injected, swappable)

Admin Storage - Client registry, users, config, audit logs
- In-memory backend (development)
- Azure Table Storage backend (production)
Token Storage - Usage records, analytics, time-series
- In-memory backend with deduplication and trend analysis

Admin Interface - React SPA with 7 pages

Dashboard with metric cards and usage trend charts
Client management with approve/reject workflow
Analytics with cost trends, top services/models, breakdowns
Configuration, user management, and audit trail

Client Workflow

Registration - Client sends heartbeat, waits for server approval
Discovery - Smart scanning of common log locations:
- Linux: /var/log/*, /opt/*/logs/, /home/*/logs/
- Windows: %APPDATA%/logs/, %PROGRAMDATA%/logs/
Collection - Upload JSONL files older than 24 hours
Cleanup - Delete uploaded files, remove empty directories
Learning - Remember successful locations for efficient future scans

Token Usage Format

Expected JSONL format for token usage logs:

{"timestamp": "2026-02-09T09:38:00Z", "service": "anthropic", "model": "claude-sonnet-4", "input_tokens": 1205, "output_tokens": 847, "cost_usd": 0.0234, "session_id": "abc123"}
{"timestamp": "2026-02-09T09:39:00Z", "service": "azure-openai", "model": "gpt-4", "input_tokens": 892, "output_tokens": 445, "cost_usd": 0.0156, "application": "chatbot"}

Key Features

Zero Configuration

Clients require no manual setup
All configuration delivered via server heartbeat responses
Automatic discovery of log locations
Server-side approval workflow

Secure & Controlled

JWT authentication with httpOnly refresh cookies
bcrypt password hashing, permission-based access control
Clients must be approved before collecting data
Full audit trail of all admin actions

Smart Discovery

Platform-aware log location scanning
Learning algorithm remembers successful paths with weighted scoring
Heuristic expansion (if logs found in /var/log/app1/, check /var/log/app2/)
Efficient scanning with adaptive intervals

Reliable Architecture

Launcher/worker two-process model prevents update failures
Branded types prevent ID mix-ups at compile time
Discriminated union Result types for error handling
Plugin architecture enables swappable storage backends

Project Status

Phase 1: Server + Admin Interface - COMPLETE

Server Core API (12 endpoints, TypeScript/Azure Functions)
Admin Storage Plugin (in-memory + Azure Table Storage)
Token Storage Plugin (in-memory with analytics)
Admin Interface (React SPA, 7 pages, dark theme)
JWT authentication with refresh tokens
Audit logging and permission system

Phase 2: Client + Production Storage - IN PROGRESS

Go client MVP (launcher + worker, cross-platform)
JSONL discovery, validation, and upload pipeline
Learning engine for optimized scanning
Persistent token storage (PostgreSQL / InfluxDB)
System service installation (systemd / Windows Service)
Client auto-update mechanism

Phase 3: Advanced Features - PLANNED

Budget alerts and threshold notifications
Advanced analytics and cost projections
Multi-tenant support

Project Structure

Tokenly/
├── api/                    → Azure Functions backend (TypeScript)
│   └── src/
│       ├── functions/      → 12 HTTP trigger handlers
│       ├── services/       → Business logic (admin, client, JWT)
│       ├── interfaces/     → Plugin contracts
│       ├── plugins/        → InMemory + AzureTable storage
│       └── models/         → Domain models, DTOs, branded types
├── web/admin/              → React SPA frontend
│   └── src/
│       ├── pages/          → 7 pages (dashboard, clients, analytics, ...)
│       ├── components/     → Reusable UI (Button, Card, Modal, ...)
│       ├── contexts/       → Auth state management
│       └── services/       → API client with auto-refresh
├── client/go/              → Go client (launcher + worker)
│   ├── cmd/                → Entry points (launcher, worker)
│   └── internal/           → Config, platform, scanner, uploader, learner
├── specs/                  → 8 component specifications (language-agnostic)
└── swa-cli.config.json     → Local development config

Getting Started

Local Development

# Install dependencies
cd api && npm install
cd web/admin && npm install

# Start the full stack (SWA CLI)
npx swa start    # Frontend on :4280, API on :7071

# Build the Go client
cd client/go && make build

Deploy a Client

./tokenly-launcher --server https://your-server.com

The launcher registers with the server, waits for admin approval, then automatically starts the worker to discover and upload token usage data.

Approve Clients

Use the admin interface to review and approve pending clients. All client configuration is managed server-side and delivered via heartbeat responses.

Specifications

Spec	Component
`01-client-launcher-spec.md`	Client Launcher - system service managing worker lifecycle
`02-client-worker-spec.md`	Client Worker - JSONL discovery, upload, and learning engine
`03-server-core-spec.md`	Server Core - HTTP API, auth, client management, ingestion
`04-admin-storage-plugin-spec.md`	Admin Storage Plugin - client registry, config, users, audit
`05-token-storage-plugin-spec.md`	Token Storage Plugin - usage data and analytics
`06-admin-interface-spec.md`	Admin Interface - React SPA for management and analytics
`07-client-protocol-spec.md`	Client Protocol - language-agnostic interoperability contracts
`08-ingestion-post-processor-spec.md`	Ingestion Post-Processor - async JSONL parsing and validation

All specifications are implementation-agnostic, using JSON data models, operation tables, and behavioral descriptions.

Tokenly: Because knowing where your tokens go is the first step to AI cost management.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
api		api
client/go		client/go
specs		specs
web/admin		web/admin
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
MEMORY.md		MEMORY.md
README.md		README.md
start-claude.ps1		start-claude.ps1
swa-cli.config.json		swa-cli.config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tokenly

Overview

The Problem

The Solution

Architecture

Distributed Collection

Technology Stack

Client Components

Server Components

Client Workflow

Token Usage Format

Key Features

Zero Configuration

Secure & Controlled

Smart Discovery

Reliable Architecture

Project Status

Phase 1: Server + Admin Interface - COMPLETE

Phase 2: Client + Production Storage - IN PROGRESS

Phase 3: Advanced Features - PLANNED

Project Structure

Getting Started

Local Development

Deploy a Client

Approve Clients

Specifications

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tokenly

Overview

The Problem

The Solution

Architecture

Distributed Collection

Technology Stack

Client Components

Server Components

Client Workflow

Token Usage Format

Key Features

Zero Configuration

Secure & Controlled

Smart Discovery

Reliable Architecture

Project Status

Phase 1: Server + Admin Interface - COMPLETE

Phase 2: Client + Production Storage - IN PROGRESS

Phase 3: Advanced Features - PLANNED

Project Structure

Getting Started

Local Development

Deploy a Client

Approve Clients

Specifications

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages