-
Notifications
You must be signed in to change notification settings - Fork 1
Features
Derived from the API Mappings Wiki (450 documented endpoints, 2026-03-04)
Version: 2.0.4 | Last Updated: 2026-03-04
- Admin
- Analytics
- Authentication
- Chat & Messaging
- Circuit Breakers
- Code Router
- Coupons
- Credits
- Diagnostics
- Error Monitoring
- General Router
- Health & Monitoring
- Metrics & Observability
- Models & Catalog
- Other
- Status
- Users
Full administrative control over users, credits, caches, monitoring, model sync, rate limits, roles, trials, downtime tracking, coupons, and notifications.
| Endpoint | Feature |
|---|---|
GET /admin/users |
List all users with search, filtering, and pagination |
GET /admin/users/{user_id} |
Get detailed user profile by ID |
GET /admin/users/by-api-key |
Look up user by their API key |
GET /admin/users/count |
Total registered user count |
GET /admin/users/growth |
User growth metrics over time |
GET /admin/users/stats |
Aggregated user statistics (credits, activity, tiers) |
DELETE /admin/users/by-domain/{domain} |
Bulk delete users by email domain |
GET /admin/api-keys/{api_key_id} |
Inspect a specific API key's metadata and permissions |
GET /admin/balance |
Credit balances for all users (financial snapshot) |
| Endpoint | Feature |
|---|---|
POST /admin/add_credits |
Add credits to a user account (per-transaction cap, 24h rolling daily limit) |
GET /admin/credit-transactions |
Query credit transaction history with filtering |
POST /admin/limit |
Set per-user daily credit spending limits |
POST /admin/assign-plan |
Assign a subscription plan to a user |
| Endpoint | Feature |
|---|---|
GET /admin/monitor |
Comprehensive system monitoring: user counts, credit totals, API usage (today + 30 days), per-user breakdowns |
GET /admin/monitoring/chat-requests |
All chat completion requests with filtering |
GET /admin/monitoring/chat-requests/summary |
Aggregated chat request summary stats |
GET /admin/monitoring/chat-requests/plot-data |
Time-series plot data for chat request dashboards |
GET /admin/monitoring/chat-requests/by-api-key |
Chat request breakdown by API key |
GET /admin/monitoring/chat-requests/providers |
Chat requests grouped by provider |
GET /admin/monitoring/chat-requests/counts |
Request count aggregations |
GET /admin/monitoring/chat-requests/models |
Chat requests grouped by model |
GET /admin/model-usage-analytics |
Model usage analytics: top models, costs, trends |
GET /admin/monitoring/api-key-tracking-quality |
API key tracking data quality metrics |
GET /admin/monitoring/api-key-tracking-trend |
API key tracking trend over time |
| Endpoint | Feature |
|---|---|
GET /admin/cache-status |
In-process provider catalog cache metadata (age, TTL, validity) |
GET /admin/huggingface-cache-status |
HuggingFace model cache state with all cached model IDs |
GET /admin/debug-models |
Diagnostic: sample models, providers, slug matching tests, cache metadata |
POST /admin/refresh-providers |
Force refresh provider cache from upstream |
POST /admin/refresh-huggingface-cache |
Force refresh HuggingFace model cache |
POST /admin/clear-rate-limit-cache |
Clear all rate limit counters from Redis |
| Endpoint | Feature |
|---|---|
GET /admin/model-sync/status |
Current model sync job status |
GET /admin/model-sync/health |
Model sync service health |
GET /admin/model-sync/providers |
List of providers available for sync |
POST /admin/model-sync/trigger |
Trigger incremental model sync |
POST /admin/model-sync/all |
Sync all models from all providers |
POST /admin/model-sync/full |
Full catalog resync (delete + reimport) |
POST /admin/model-sync/incremental |
Incremental delta sync |
POST /admin/model-sync/providers-only |
Sync provider metadata only |
POST /admin/model-sync/provider/{provider_slug} |
Sync a single provider |
POST /admin/model-sync/reset-and-resync |
Flush and fully resync the catalog |
DELETE /admin/model-sync/flush-models |
Flush all models from catalog DB |
DELETE /admin/model-sync/flush-providers |
Flush all providers from catalog DB |
| Endpoint | Feature |
|---|---|
GET /admin/rate-limits/config |
Current rate limit configuration |
GET /admin/rate-limits/system |
System-wide rate limit stats |
GET /admin/rate-limits/users |
Per-user rate limit stats |
PUT /admin/rate-limits/update |
Update rate limit rules |
POST /admin/rate-limits/config/reset |
Reset rate limit config to defaults |
DELETE /admin/rate-limits/delete |
Delete rate limit entries for an API key |
| Endpoint | Feature |
|---|---|
GET /admin/roles/{user_id} |
Get user's role info |
POST /admin/roles/update |
Update a user's role with reason logging |
GET /admin/roles/list/{role} |
List all users with a specific role |
GET /admin/roles/permissions/{role} |
Get permissions for a role |
GET /admin/roles/audit/log |
Role change audit log |
| Endpoint | Feature |
|---|---|
GET /admin/trial/analytics |
Trial usage analytics |
GET /admin/trial/users |
List all trial users with status |
GET /admin/trial/domain-analysis |
Trial sign-up analysis by email domain (abuse detection) |
GET /admin/trial/conversion-funnel |
Trial-to-paid conversion funnel |
GET /admin/trial/ip-analysis |
Trial sign-ups by IP (fraud detection) |
GET /admin/trial/cohort-analysis |
Trial user cohort retention analysis |
POST /admin/trial/save-conversion-metrics |
Save conversion metrics snapshot |
| Endpoint | Feature |
|---|---|
GET /admin/downtime/incidents |
List all downtime incidents |
GET /admin/downtime/incidents/ongoing |
Currently active incidents |
GET /admin/downtime/statistics |
Downtime statistics (MTTR, frequency, affected providers) |
GET /admin/downtime/incidents/{incident_id} |
Incident details |
GET /admin/downtime/incidents/{incident_id}/logs |
Logs captured during incident |
GET /admin/downtime/incidents/{incident_id}/analysis |
AI-generated incident analysis |
POST /admin/downtime/incidents/{incident_id}/capture-logs |
Trigger log capture for an incident |
POST /admin/downtime/incidents/{incident_id}/resolve |
Resolve an incident |
| Endpoint | Feature |
|---|---|
GET /admin/coupons |
List all coupons |
GET /admin/coupons/{coupon_id} |
Coupon details |
GET /admin/coupons/{coupon_id}/analytics |
Coupon usage analytics |
GET /admin/coupons/stats/overview |
Coupon system overview stats |
| Endpoint | Feature |
|---|---|
POST /admin/notifications/process |
Process pending notification queue |
GET /admin/notifications/stats |
Notification delivery statistics |
| Endpoint | Feature |
|---|---|
GET /admin/test-huggingface/{hugging_face_id} |
Test HuggingFace model data retrieval |
GET /admin/health/optimizations/cache |
Health optimization cache status |
POST /admin/health/optimizations/cache/clear |
Clear health optimization cache |
Server-side analytics event forwarding to Statsig and PostHog, bypassing client-side ad-blockers.
| Endpoint | Feature |
|---|---|
POST /v1/analytics/events |
Send a single analytics event (routes to Statsig + PostHog) |
POST /v1/analytics/batch |
Batch send multiple analytics events |
POST /v1/analytics/session/start |
Start an analytics session |
GET /v1/analytics/cache |
Analytics cache status and data |
GET /v1/analytics/cache/summary |
Analytics cache summary |
User registration, login, and password management with Privy, Google, GitHub, email, and wallet auth methods.
| Endpoint | Feature |
|---|---|
POST /auth |
Authenticate user (Privy token validation) - returns API key, credits, subscription status |
POST /auth/register |
Register a new user account |
POST /auth/password-reset |
Request a password reset email |
POST /auth/reset-password |
Reset password with token |
GET /auth/health |
Auth service health check |
OpenAI-compatible and Anthropic-compatible inference, chat sessions, message history, feedback, search, sharing, metrics, and Vercel AI SDK compatibility.
| Endpoint | Feature |
|---|---|
POST /v1/chat/completions |
Primary inference endpoint - Full OpenAI Chat Completions API compatibility (streaming SSE, tool/function calling, JSON mode, logprobs, all standard parameters). Routes to 30+ providers with automatic failover. |
POST /v1/messages |
Anthropic Messages API - Drop-in Claude compatibility |
POST /v1/responses |
OpenAI v1/responses - Unified response API |
POST /api/chat/ai-sdk |
Vercel AI SDK compatible chat endpoint |
POST /api/chat/ai-sdk-completions |
Vercel AI SDK completions endpoint |
| Endpoint | Feature |
|---|---|
GET /v1/chat/sessions |
List all chat sessions for the user |
GET /v1/chat/sessions/{session_id} |
Get session with full message history |
POST /v1/chat/sessions/{session_id}/messages |
Save a message to a session |
POST /v1/chat/sessions/{session_id}/messages/batch |
Batch save messages |
POST /v1/chat/search |
Search across chat sessions |
PUT /v1/chat/sessions/{session_id} |
Update session metadata |
DELETE /v1/chat/sessions/{session_id} |
Delete a chat session |
GET /v1/chat/stats |
Chat usage statistics |
| Endpoint | Feature |
|---|---|
GET /v1/chat/feedback |
Get user's feedback entries |
GET /v1/chat/feedback/stats |
Feedback statistics |
GET /v1/chat/sessions/{session_id}/feedback |
Feedback for a specific session |
POST /v1/chat/feedback |
Submit feedback on a response |
PUT /v1/chat/feedback/{feedback_id} |
Update feedback |
DELETE /v1/chat/feedback/{feedback_id} |
Delete feedback |
| Endpoint | Feature |
|---|---|
GET /v1/chat/completions/metrics/tokens-per-second |
Token throughput per model (Prometheus format) |
GET /v1/chat/completions/metrics/tokens-per-second/all |
Aggregate tokens/second across all models |
| Endpoint | Feature |
|---|---|
GET /v1/chat/share |
List user's share links |
GET /v1/chat/share/{token} |
Access a shared chat |
POST /v1/chat/share |
Create a shareable link for a session |
DELETE /v1/chat/share/{token} |
Delete a share link |
Monitor and manage provider circuit breaker states (CLOSED/OPEN/HALF_OPEN) for the failover system.
| Endpoint | Feature |
|---|---|
GET /circuit-breakers |
All circuit breaker states with failure/success counts |
GET /circuit-breakers/{provider} |
Circuit breaker state for a specific provider |
POST /circuit-breakers/{provider}/reset |
Reset a provider's circuit breaker to CLOSED |
POST /circuit-breakers/reset-all |
Reset all circuit breakers |
Benchmark-driven code model selection. Classifies task complexity and matches to tiered models scored by SWE-bench. Modes: auto, price, quality, agentic.
| Endpoint | Feature |
|---|---|
GET /code-router/settings/options |
Available code router configuration options |
GET /code-router/tiers |
Code model tiers with benchmark scores and pricing |
GET /code-router/stats |
Code router usage statistics |
POST /code-router/test |
Test code routing with a sample prompt |
POST /code-router/settings/validate |
Validate code router settings |
Coupon redemption for users (admin coupon management is under Admin).
| Endpoint | Feature |
|---|---|
GET /coupons/available |
List available coupons for the user |
GET /coupons/history |
User's coupon redemption history |
POST /coupons/redeem |
Redeem a coupon code (adds credits to balance) |
Credit balance management, transactions, add/adjust/refund operations.
| Endpoint | Feature |
|---|---|
GET /credits/balance |
Current credit balance breakdown |
GET /credits/summary |
Credit summary with usage breakdown |
GET /credits/transactions |
Credit transaction history |
POST /credits/add |
Add credits to account |
POST /credits/adjust |
Adjust credits (positive or negative) |
POST /credits/bulk-add |
Bulk add credits to multiple users |
POST /credits/refund |
Process a credit refund |
Real-time system diagnostics for concurrency and provider performance.
| Endpoint | Feature |
|---|---|
GET /api/diagnostics/concurrency |
Active concurrency stats (in-flight requests, queue depth, shed count) |
GET /api/diagnostics/provider-timing |
Provider response timing summary (avg, p50, p95, p99 by provider) |
Autonomous error detection, pattern recognition, AI-generated fix suggestions, and continuous monitoring.
| Endpoint | Feature |
|---|---|
GET /error-monitor/autonomous/status |
Autonomous error monitor status |
GET /error-monitor/health |
Error monitor health check |
GET /error-monitor/dashboard |
Error monitoring dashboard data (charts, stats, trends) |
GET /error-monitor/errors/recent |
Recent errors with severity and trace info |
GET /error-monitor/errors/critical |
Critical errors only |
GET /error-monitor/errors/fixable |
Errors with auto-fix potential |
GET /error-monitor/errors/patterns |
Detected error patterns (recurring issues) |
GET /error-monitor/fixes/generated |
All AI-generated fix suggestions |
GET /error-monitor/fixes/{fix_id} |
Details of a specific fix suggestion |
POST /error-monitor/fixes/generate-for-error |
Generate an AI fix suggestion for an error |
POST /error-monitor/fixes/generate-batch |
Batch generate fix suggestions |
POST /error-monitor/monitor/start |
Start continuous error monitoring |
POST /error-monitor/monitor/scan |
Trigger a one-time error scan |
ML-powered model selection via NotDiamond. Analyzes prompt content and picks the best model optimized for quality, cost, latency, or balanced.
| Endpoint | Feature |
|---|---|
GET /general-router/settings/options |
Available routing strategies and model pools |
GET /general-router/models |
Models available for general routing |
GET /general-router/fallback-models |
Fallback model chain for the general router |
GET /general-router/stats |
General router usage statistics |
POST /general-router/test |
Test general routing with a sample prompt |
Tiered health monitoring across all providers, models, and gateways with dashboards, uptime tracking, and optimization insights.
| Endpoint | Feature |
|---|---|
GET /health |
Primary system health check (version, status, timestamp) |
GET /health/quick |
Lightweight health check (minimal overhead) |
GET /health/railway |
Railway deployment health check (comprehensive: DB, Redis, providers) |
GET /health/system |
Detailed system health (memory, CPU, connections) |
GET /health/database |
Database connectivity and performance health |
GET /health/all |
All health checks combined |
GET /health/status |
Current system status |
GET /health/summary |
Health summary with scores |
GET /health/uptime |
System uptime metrics |
GET /health/insights |
AI-generated health insights and recommendations |
GET /health/dashboard |
Health dashboard data |
GET /health/optimizations |
Current optimization status |
GET /health/optimizations/connection-pools |
Connection pool health |
GET /health/optimizations/prioritization |
Request prioritization stats |
| Endpoint | Feature |
|---|---|
GET /health/providers |
All provider health scores and statuses |
GET /health/provider/{provider} |
Single provider health details |
GET /health/providers/stats |
Provider health statistics |
GET /health/providers/uptime |
Provider uptime history |
GET /health/providers/import-status |
Provider data import status |
GET /health/google-vertex |
Google Vertex AI specific health check |
| Endpoint | Feature |
|---|---|
GET /health/models |
All model health scores |
GET /health/model/{model_id} |
Single model health details |
GET /health/models/stats |
Model health statistics |
GET /health/models/uptime |
Model uptime history |
| Endpoint | Feature |
|---|---|
GET /health/gateways |
All gateway health checks |
GET /health/gateways/dashboard |
Gateway health dashboard (HTML) |
GET /health/gateways/dashboard/data |
Gateway dashboard data (JSON) |
GET /health/{gateway} |
Single gateway health check |
POST /health/gateways/{gateway}/fix |
Trigger auto-fix for an unhealthy gateway |
| Endpoint | Feature |
|---|---|
GET /health/catalog/models |
Catalog model data health |
GET /health/catalog/providers |
Catalog provider data health |
| Endpoint | Feature |
|---|---|
GET /health/monitoring/status |
Active monitoring status |
POST /health/monitoring/start |
Start active health monitoring |
POST /health/monitoring/stop |
Stop active health monitoring |
POST /health/check |
Trigger a health check |
POST /health/check/now |
Trigger an immediate health check |
Prometheus metrics, Grafana integration, OpenTelemetry tracing, Loki logging.
| Endpoint | Feature |
|---|---|
GET /metrics |
Raw Prometheus metrics (supports OpenMetrics format for exemplars with trace ID links) |
GET /api/metrics/parsed |
Structured JSON: latency (p50/p95/p99/avg), request counts, error counts by endpoint |
GET /api/metrics/status |
Metrics collection status |
GET /api/metrics/summary |
Metrics summary |
GET /api/metrics/health |
Metrics system health |
GET /api/metrics/grafana-queries |
Grafana-compatible query results |
POST /api/metrics/test |
Test metrics collection |
| Endpoint | Feature |
|---|---|
GET /api/monitoring/health |
Provider health scores (0-100) with status per provider |
GET /api/monitoring/health/{provider} |
Single provider health |
GET /api/monitoring/stats/realtime |
Real-time stats: requests, cost, health, error rates, latency with hourly breakdown |
GET /api/monitoring/stats/hourly/{provider} |
Hourly stats for a specific provider |
GET /api/monitoring/error-rates |
Error rates by provider and model with trend detection |
GET /api/monitoring/errors/{provider} |
Recent error logs per provider |
GET /api/monitoring/cost-analysis |
Cost breakdown by provider with cost-per-request |
GET /api/monitoring/latency-trends/{provider} |
Latency percentiles (p50/p95/p99) over time |
GET /api/monitoring/latency/{provider}/{model} |
Latency stats for a specific model |
GET /api/monitoring/anomalies |
Anomaly detection: cost spikes, latency spikes, high error rates |
GET /api/monitoring/circuit-breakers |
Circuit breaker states per provider |
GET /api/monitoring/circuit-breakers/{provider} |
Circuit breaker for a specific provider |
GET /api/monitoring/providers/comparison |
Multi-provider comparison matrix |
GET /api/monitoring/token-efficiency/{provider}/{model} |
Token efficiency analysis |
GET /api/monitoring/trial-analytics |
Trial system analytics |
GET /api/monitoring/chat-requests |
Chat request monitoring |
GET /api/monitoring/chat-requests/counts |
Chat request counts |
GET /api/monitoring/chat-requests/models |
Chat requests by model |
GET /api/monitoring/chat-requests/providers |
Chat requests by provider |
GET /api/monitoring/chat-requests/plot-data |
Chat request time-series data |
POST /monitoring |
Sentry tunnel (proxies Sentry events from frontend) |
| Endpoint | Feature |
|---|---|
GET /api/instrumentation/health |
Instrumentation health |
GET /api/instrumentation/config |
Current instrumentation configuration |
GET /api/instrumentation/environment-variables |
Instrumentation env vars |
GET /api/instrumentation/loki/status |
Loki log aggregation status |
GET /api/instrumentation/tempo/status |
Tempo distributed tracing status |
GET /api/instrumentation/otel/status |
OpenTelemetry status |
GET /api/instrumentation/trace-context |
Current trace context |
POST /api/instrumentation/test-log |
Send test log to Loki |
POST /api/instrumentation/test-trace |
Send test trace to Tempo |
| Endpoint | Feature |
|---|---|
GET /prometheus/datasource |
Grafana SimpleJSON datasource test |
POST /prometheus/datasource/search |
Metric name search |
POST /prometheus/datasource/query |
Metric query |
POST /prometheus/datasource/annotations |
Annotation query |
POST /prometheus/datasource/tag-keys |
Tag key query |
POST /prometheus/datasource/tag-values |
Tag value query |
| Endpoint | Feature |
|---|---|
GET /prometheus/data/metrics |
Prometheus telemetry data |
GET /prometheus/data/admin/cache/status |
Cache status via Prometheus |
DELETE /prometheus/data/admin/cache/invalidate |
Invalidate cache via Prometheus API |
GET /prometheus/data/instrumentation/health |
Instrumentation health via Prometheus |
GET /prometheus/data/instrumentation/loki/status |
Loki status via Prometheus |
GET /prometheus/data/instrumentation/tempo/status |
Tempo status via Prometheus |
POST /prometheus/data/instrumentation/test-log |
Test log via Prometheus |
POST /prometheus/data/instrumentation/test-trace |
Test trace via Prometheus |
Model discovery, search, comparison, trending models, provider info, HuggingFace integration, and gateway management.
| Endpoint | Feature |
|---|---|
GET /v1/models |
List all models (filter by provider, gateway, private, HuggingFace; pagination) |
GET /v1/models/unique |
Deduplicated model list (one entry per model across providers) |
GET /v1/models/search |
Full-text model search |
GET /v1/models/trending |
Top models ranked by requests, tokens, users, cost, speed |
GET /v1/models/low-latency |
Low-latency optimized models |
GET /v1/models/{provider}/{model} |
Specific model details |
GET /v1/models/{provider}/{model}/compare |
Compare a model across providers |
GET /v1/models/{developer} |
Models by developer/organization |
POST /v1/models/batch-compare |
Batch compare multiple models |
GET /api/models/detail |
Detailed model info for frontend rendering |
| Endpoint | Feature |
|---|---|
GET /v1/modelz/models |
Full canonical model registry |
GET /v1/modelz/ids |
All model IDs in the registry |
GET /v1/modelz/check/{model_id} |
Check if a model exists and get its registry data |
| Endpoint | Feature |
|---|---|
GET /v1/provider |
List all providers with stats |
GET /v1/provider/{provider_name}/stats |
Provider statistics (models, requests, cost) |
GET /v1/provider/{provider_name}/top-models |
Top models for a provider |
GET /v1/routers |
Available intelligent routing options |
| Endpoint | Feature |
|---|---|
GET /v1/gateways |
List all registered gateways (frontend auto-discovers from this) |
GET /v1/gateways/status |
Gateway statuses |
GET /v1/gateways/summary |
Aggregated gateway statistics |
GET /v1/gateway/{gateway}/stats |
Stats for a specific gateway |
| Endpoint | Feature |
|---|---|
GET /v1/huggingface/discovery |
Discover HuggingFace models |
GET /v1/huggingface/search |
Search HuggingFace models |
GET /v1/huggingface/author/{author}/models |
Models by a HuggingFace author |
GET /v1/huggingface/models/{model_id}/details |
HuggingFace model details (downloads, likes, parameters) |
GET /v1/huggingface/models/{model_id}/card |
Model card (README) |
GET /v1/huggingface/models/{model_id}/files |
Model file listing |
| Endpoint | Feature |
|---|---|
GET /v1/model-health |
All model health data |
GET /v1/model-health/stats |
Model health statistics |
GET /v1/model-health/providers |
Provider-level health summary |
GET /v1/model-health/unhealthy |
Currently unhealthy models |
GET /v1/model-health/{provider}/{model} |
Health for a specific model |
GET /v1/model-health/provider/{provider}/summary |
Provider health summary |
| Endpoint | Feature |
|---|---|
GET /ranking/models |
Model leaderboard with trend data (direction, percentage, logos) |
| Endpoint | Feature |
|---|---|
GET /availability/models |
All model availability statuses |
GET /availability/model/{model_id} |
Availability for a specific model |
GET /availability/summary |
Availability summary |
GET /availability/status |
Availability system status |
GET /availability/check/{model_id} |
Check model availability with provider details |
GET /availability/fallback/{model_id} |
Fallback providers for a model |
GET /availability/best/{model_id} |
Best available provider for a model |
POST /availability/maintenance/{model_id} |
Put a model in maintenance mode |
DELETE /availability/maintenance/{model_id} |
Remove maintenance mode |
POST /availability/monitoring/start |
Start availability monitoring |
POST /availability/monitoring/stop |
Stop availability monitoring |
| Endpoint | Feature |
|---|---|
GET /catalog/models-db/ |
List all models in the database catalog |
GET /catalog/models-db/{model_id} |
Get a model from the DB |
GET /catalog/models-db/search |
Search the model catalog DB |
GET /catalog/models-db/stats |
Catalog statistics |
GET /catalog/models-db/provider/{provider_slug} |
Models by provider in DB |
GET /catalog/models-db/health/{health_status} |
Models by health status in DB |
GET /catalog/models-db/{model_id}/health/history |
Model health history |
POST /catalog/models-db/bulk |
Bulk create models |
POST /catalog/models-db/bulk-upsert |
Bulk upsert models |
POST /catalog/models-db/upsert |
Upsert a single model |
POST /catalog/models-db/{model_id}/activate |
Activate a model |
POST /catalog/models-db/{model_id}/deactivate |
Deactivate a model |
PATCH /catalog/models-db/{model_id}/health |
Update model health status |
| Endpoint | Feature |
|---|---|
GET /providers/ |
List all providers in DB |
PATCH /providers/{provider_id} |
Update provider metadata |
GET /providers/{provider_id}/models/stats |
Provider model statistics |
Registration, images, audio, tools, payments, IP whitelists, Nosana GPU, partner trials, notifications, and system utilities.
| Endpoint | Feature |
|---|---|
POST /create |
Create new user account ($5 initial credits, 3-day trial, welcome email) |
| Endpoint | Feature |
|---|---|
POST /v1/images/generations |
Generate images using AI models (Stability AI, DALL-E, etc.) with credit billing |
| Endpoint | Feature |
|---|---|
POST /v1/audio/transcriptions |
Transcribe audio files via Whisper (supports all major formats, credit billing per minute) |
POST /v1/audio/transcriptions/base64 |
Transcribe base64-encoded audio |
| Endpoint | Feature |
|---|---|
GET /v1/tools |
List available server-side tools |
GET /v1/tools/definitions |
Tool definitions (for function calling) |
GET /v1/tools/{tool_name} |
Get a specific tool's details |
POST /v1/tools/execute |
Execute a server-side tool (TTS, calculator, code executor) |
POST /v1/tools/search/augment |
Search augmentation tool |
| Endpoint | Feature |
|---|---|
POST /api/stripe/checkout-session |
Create Stripe checkout session |
GET /api/stripe/checkout-session/{session_id} |
Get checkout session status |
GET /api/stripe/credit-packages |
Available credit packages and pricing |
POST /api/stripe/payment-intent |
Create payment intent |
GET /api/stripe/payment-intent/{payment_intent_id} |
Get payment intent status |
GET /api/stripe/payments |
List user's payment history |
GET /api/stripe/payments/{payment_id} |
Get payment details |
POST /api/stripe/refund |
Process a refund |
GET /api/stripe/subscription |
Get current subscription |
POST /api/stripe/subscription-checkout |
Create subscription checkout |
POST /api/stripe/subscription/upgrade |
Upgrade subscription plan |
POST /api/stripe/subscription/downgrade |
Downgrade subscription plan |
POST /api/stripe/subscription/cancel |
Cancel subscription |
POST /api/stripe/webhook |
Stripe webhook handler (checkout.session.completed, payment_intent.succeeded/failed, charge.refunded) |
| Endpoint | Feature |
|---|---|
POST /api/admin/ip-whitelist |
Create IP allowlist entry |
GET /api/admin/ip-whitelist/{entry_id} |
Get allowlist entry |
PUT /api/admin/ip-whitelist/{entry_id} |
Update allowlist entry |
DELETE /api/admin/ip-whitelist/{entry_id} |
Delete allowlist entry |
GET /api/admin/ip-whitelist |
List all allowlist entries |
POST /api/admin/ip-whitelist/check |
Check if an IP is allowlisted |
| Endpoint | Feature |
|---|---|
GET /nosana/config |
Nosana platform configuration |
GET /nosana/credits/balance |
Nosana credit balance |
GET /nosana/deployments |
List all Nosana deployments |
GET /nosana/deployments/{deployment_id} |
Deployment details |
POST /nosana/deployments/llm |
Deploy LLM inference on GPU |
POST /nosana/deployments/image-generation |
Deploy image generation on GPU |
POST /nosana/deployments/whisper |
Deploy Whisper transcription on GPU |
POST /nosana/deployments/{deployment_id}/start |
Start a deployment |
POST /nosana/deployments/{deployment_id}/stop |
Stop a deployment |
POST /nosana/deployments/{deployment_id}/archive |
Archive a deployment |
POST /nosana/deployments/{deployment_id}/revisions |
Create a deployment revision |
PATCH /nosana/deployments/{deployment_id}/replicas |
Update replica count |
GET /nosana/markets |
List GPU markets |
GET /nosana/markets/{market_id} |
Market details |
GET /nosana/markets/{market_id}/resources |
Market resource requirements |
POST /nosana/jobs |
Create a new GPU job |
GET /nosana/jobs/{job_address} |
Job details |
POST /nosana/jobs/{job_address}/extend |
Extend job duration |
POST /nosana/jobs/{job_address}/stop |
Stop a job |
| Endpoint | Feature |
|---|---|
GET /partner-trials/config/{partner_code} |
Partner trial configuration |
GET /partner-trials/check/{code} |
Check if partner code is valid |
GET /partner-trials/status |
Current user's partner trial status |
GET /partner-trials/daily-limit |
Partner trial daily limit info |
GET /partner-trials/analytics/{partner_code} |
Partner trial analytics |
POST /partner-trials/start |
Start a partner trial (e.g., Redbeard 14-day Pro) |
POST /partner-trials/expire/{target_user_id} |
Force-expire a partner trial |
| Endpoint | Feature |
|---|---|
GET /api/provider-credits/balance |
All upstream provider credit balances |
GET /api/provider-credits/balance/{provider} |
Specific provider credit balance |
| Endpoint | Feature |
|---|---|
GET /user/notifications/preferences |
Get notification preferences |
POST /user/notifications/send-usage-report |
Send usage report email |
POST /user/notifications/test |
Send test notification |
| Endpoint | Feature |
|---|---|
GET /ping |
System ping (pong response with uptime) |
GET /ping/stats |
Ping statistics |
GET /sentry-debug |
Test Sentry error tracking integration |
GET /velocity-mode-status |
Security velocity mode status |
GET / |
Root endpoint (API info) |
| Endpoint | Feature |
|---|---|
GET /health/gateways/optimized |
Optimized gateway health |
GET /health/models/optimized |
Optimized model health |
GET /health/providers/optimized |
Optimized provider health |
GET /health/dashboard/optimized |
Optimized dashboard data |
POST /admin/cache/refresh/{gateway} |
Refresh cache for a specific gateway |
POST /admin/cache/clear |
Clear all caches |
GET /admin/cache/status |
Cache status |
GET /admin/cache/debouncer/stats |
Cache debouncer stats |
GET /admin/cache/warmer/stats |
Cache warmer stats |
GET /admin/cache/modelz/status |
Modelz cache status |
POST /admin/cache/modelz/refresh |
Refresh modelz cache |
DELETE /admin/cache/modelz/clear |
Clear modelz cache |
POST /admin/cache/pricing/refresh |
Refresh pricing cache |
POST /admin/api/cache/invalidate |
Invalidate specific cache entry |
Public-facing status page for model and provider availability (no authentication required).
| Endpoint | Feature |
|---|---|
GET /v1/status/ |
Overall system status |
GET /v1/status/detailed |
Detailed status with provider breakdown |
GET /v1/status/providers |
Provider availability statuses |
GET /v1/status/models |
Model availability statuses |
GET /v1/status/models/{provider}/{model_id} |
Specific model status |
GET /v1/status/incidents |
Recent incidents |
GET /v1/status/uptime/{provider}/{model_id} |
Model uptime history |
GET /v1/status/search |
Search models on status page |
GET /v1/status/stats |
Status page statistics |
User profile, balance, plan, rate limits, activity, and account management.
| Endpoint | Feature |
|---|---|
GET /user/balance |
Current credit balance and status |
GET /user/monitor |
User's own usage monitoring data |
GET /user/rate-limits |
User's rate limit configuration and current usage |
GET /user/profile |
User profile data |
PUT /user/profile |
Update user profile |
GET /user/plan |
Current subscription plan |
GET /user/plan/entitlements |
Plan entitlements (what the plan includes) |
GET /user/plan/usage |
Plan usage vs limits |
GET /user/limit |
Daily spending limit |
GET /user/credit-transactions |
Credit transaction history |
GET /user/environment-usage |
Usage by environment (live/test/staging/dev) |
GET /user/cache-settings |
User's cache settings |
GET /user/activity/stats |
Activity statistics |
GET /user/activity/log |
Activity log |
DELETE /user/account |
Delete user account |
| Endpoint | Feature |
|---|---|
POST /user/api-keys |
Create a new API key (name, environment tag, scope permissions) |
GET /user/api-keys |
List all API keys |
GET /user/api-keys/usage |
API key usage statistics |
GET /user/api-keys/audit-logs |
API key audit logs |
PUT /user/api-keys/{key_id} |
Update API key (name, active status) |
DELETE /user/api-keys/{key_id} |
Delete an API key |
| Endpoint | Feature |
|---|---|
GET /user/rate-limits/usage/{key_id} |
Rate limit usage for a specific key |
PUT /user/rate-limits/{key_id} |
Update rate limit for a key |
POST /user/rate-limits/bulk-update |
Bulk update rate limits |
| Endpoint | Feature |
|---|---|
GET /plans |
List all available subscription plans |
GET /plans/{plan_id} |
Plan details |
GET /subscription/plans |
Subscription plans (alternate path) |
GET /trial/status |
Current trial status |
| Endpoint | Feature |
|---|---|
GET /referral/code |
Get user's referral code |
POST /referral/generate |
Generate a new referral code |
GET /referral/stats |
Referral statistics (total, conversion rate, rewards) |
POST /referral/validate |
Validate and apply a referral code |
| Endpoint | Feature |
|---|---|
GET /analytics/transactions |
Transaction analytics data |
GET /analytics/transactions/summary |
Transaction summary |
| Category | Endpoints | Key Capabilities |
|---|---|---|
| Admin | 46 | User management, credits, caches, model sync, rate limits, RBAC, trials, downtime, coupons |
| Analytics | 5 | Server-side event forwarding (Statsig + PostHog) |
| Authentication | 5 | Multi-method auth (Privy, Google, GitHub, email, wallet) |
| Chat & Messaging | 20 | OpenAI/Anthropic inference, sessions, history, feedback, sharing, AI SDK |
| Circuit Breakers | 4 | Provider circuit breaker monitoring and reset |
| Code Router | 5 | Benchmark-driven code model selection (SWE-bench) |
| Coupons | 3 | User coupon redemption |
| Credits | 6 | Balance, transactions, add/adjust/refund |
| Diagnostics | 2 | Concurrency and provider timing diagnostics |
| Error Monitoring | 12 | Autonomous error detection, AI fix generation |
| General Router | 4 | ML-powered model selection (NotDiamond) |
| Health & Monitoring | 30 | Multi-tier health: system, providers, models, gateways |
| Metrics & Observability | 6 | Prometheus, Grafana, OpenTelemetry, Loki, Tempo |
| Models & Catalog | 23 | Discovery, search, compare, trending, HuggingFace, availability |
| Other | 19 | Images, audio, tools, Stripe, IP allowlists, Nosana GPU, partner trials |
| Status | 2 | Public status page |
| Users | 8 | Profile, balance, plan, rate limits, API keys, referrals |
| Total | 450 |
Source: API Mappings Wiki | Conceptual Model | Developer Wiki
Reading Path (start here, in order)
- Conceptual Model
- Stability Definition
- Conceptual Model Features
- Features
- Delta Report
- Features-Acceptance-Criteria
Testing
Security & Access
Billing
Monitoring
Features
Providers
Operations
Data References