Skip to content
arminrad edited this page Mar 5, 2026 · 4 revisions

Gatewayz Backend - Complete Feature Reference

Derived from the API Mappings Wiki (450 documented endpoints, 2026-03-04)

Version: 2.0.4 | Last Updated: 2026-03-04


Table of Contents

  1. Admin
  2. Analytics
  3. Authentication
  4. Chat & Messaging
  5. Circuit Breakers
  6. Code Router
  7. Coupons
  8. Credits
  9. Diagnostics
  10. Error Monitoring
  11. General Router
  12. Health & Monitoring
  13. Metrics & Observability
  14. Models & Catalog
  15. Other
  16. Status
  17. Users

1. Admin (46 endpoints)

Full administrative control over users, credits, caches, monitoring, model sync, rate limits, roles, trials, downtime tracking, coupons, and notifications.

User Management

Endpoint Feature
GET /admin/users List all users with search, filtering, and pagination
GET /admin/users/{user_id} Get detailed user profile by ID
GET /admin/users/by-api-key Look up user by their API key
GET /admin/users/count Total registered user count
GET /admin/users/growth User growth metrics over time
GET /admin/users/stats Aggregated user statistics (credits, activity, tiers)
DELETE /admin/users/by-domain/{domain} Bulk delete users by email domain
GET /admin/api-keys/{api_key_id} Inspect a specific API key's metadata and permissions
GET /admin/balance Credit balances for all users (financial snapshot)

Credit Operations

Endpoint Feature
POST /admin/add_credits Add credits to a user account (per-transaction cap, 24h rolling daily limit)
GET /admin/credit-transactions Query credit transaction history with filtering
POST /admin/limit Set per-user daily credit spending limits
POST /admin/assign-plan Assign a subscription plan to a user

System Monitoring

Endpoint Feature
GET /admin/monitor Comprehensive system monitoring: user counts, credit totals, API usage (today + 30 days), per-user breakdowns
GET /admin/monitoring/chat-requests All chat completion requests with filtering
GET /admin/monitoring/chat-requests/summary Aggregated chat request summary stats
GET /admin/monitoring/chat-requests/plot-data Time-series plot data for chat request dashboards
GET /admin/monitoring/chat-requests/by-api-key Chat request breakdown by API key
GET /admin/monitoring/chat-requests/providers Chat requests grouped by provider
GET /admin/monitoring/chat-requests/counts Request count aggregations
GET /admin/monitoring/chat-requests/models Chat requests grouped by model
GET /admin/model-usage-analytics Model usage analytics: top models, costs, trends
GET /admin/monitoring/api-key-tracking-quality API key tracking data quality metrics
GET /admin/monitoring/api-key-tracking-trend API key tracking trend over time

Cache Management

Endpoint Feature
GET /admin/cache-status In-process provider catalog cache metadata (age, TTL, validity)
GET /admin/huggingface-cache-status HuggingFace model cache state with all cached model IDs
GET /admin/debug-models Diagnostic: sample models, providers, slug matching tests, cache metadata
POST /admin/refresh-providers Force refresh provider cache from upstream
POST /admin/refresh-huggingface-cache Force refresh HuggingFace model cache
POST /admin/clear-rate-limit-cache Clear all rate limit counters from Redis

Model Sync

Endpoint Feature
GET /admin/model-sync/status Current model sync job status
GET /admin/model-sync/health Model sync service health
GET /admin/model-sync/providers List of providers available for sync
POST /admin/model-sync/trigger Trigger incremental model sync
POST /admin/model-sync/all Sync all models from all providers
POST /admin/model-sync/full Full catalog resync (delete + reimport)
POST /admin/model-sync/incremental Incremental delta sync
POST /admin/model-sync/providers-only Sync provider metadata only
POST /admin/model-sync/provider/{provider_slug} Sync a single provider
POST /admin/model-sync/reset-and-resync Flush and fully resync the catalog
DELETE /admin/model-sync/flush-models Flush all models from catalog DB
DELETE /admin/model-sync/flush-providers Flush all providers from catalog DB

Rate Limits

Endpoint Feature
GET /admin/rate-limits/config Current rate limit configuration
GET /admin/rate-limits/system System-wide rate limit stats
GET /admin/rate-limits/users Per-user rate limit stats
PUT /admin/rate-limits/update Update rate limit rules
POST /admin/rate-limits/config/reset Reset rate limit config to defaults
DELETE /admin/rate-limits/delete Delete rate limit entries for an API key

Roles (RBAC)

Endpoint Feature
GET /admin/roles/{user_id} Get user's role info
POST /admin/roles/update Update a user's role with reason logging
GET /admin/roles/list/{role} List all users with a specific role
GET /admin/roles/permissions/{role} Get permissions for a role
GET /admin/roles/audit/log Role change audit log

Trial Analytics

Endpoint Feature
GET /admin/trial/analytics Trial usage analytics
GET /admin/trial/users List all trial users with status
GET /admin/trial/domain-analysis Trial sign-up analysis by email domain (abuse detection)
GET /admin/trial/conversion-funnel Trial-to-paid conversion funnel
GET /admin/trial/ip-analysis Trial sign-ups by IP (fraud detection)
GET /admin/trial/cohort-analysis Trial user cohort retention analysis
POST /admin/trial/save-conversion-metrics Save conversion metrics snapshot

Downtime Tracking

Endpoint Feature
GET /admin/downtime/incidents List all downtime incidents
GET /admin/downtime/incidents/ongoing Currently active incidents
GET /admin/downtime/statistics Downtime statistics (MTTR, frequency, affected providers)
GET /admin/downtime/incidents/{incident_id} Incident details
GET /admin/downtime/incidents/{incident_id}/logs Logs captured during incident
GET /admin/downtime/incidents/{incident_id}/analysis AI-generated incident analysis
POST /admin/downtime/incidents/{incident_id}/capture-logs Trigger log capture for an incident
POST /admin/downtime/incidents/{incident_id}/resolve Resolve an incident

Coupons (Admin)

Endpoint Feature
GET /admin/coupons List all coupons
GET /admin/coupons/{coupon_id} Coupon details
GET /admin/coupons/{coupon_id}/analytics Coupon usage analytics
GET /admin/coupons/stats/overview Coupon system overview stats

Notifications (Admin)

Endpoint Feature
POST /admin/notifications/process Process pending notification queue
GET /admin/notifications/stats Notification delivery statistics

Other Admin

Endpoint Feature
GET /admin/test-huggingface/{hugging_face_id} Test HuggingFace model data retrieval
GET /admin/health/optimizations/cache Health optimization cache status
POST /admin/health/optimizations/cache/clear Clear health optimization cache

2. Analytics (5 endpoints)

Server-side analytics event forwarding to Statsig and PostHog, bypassing client-side ad-blockers.

Endpoint Feature
POST /v1/analytics/events Send a single analytics event (routes to Statsig + PostHog)
POST /v1/analytics/batch Batch send multiple analytics events
POST /v1/analytics/session/start Start an analytics session
GET /v1/analytics/cache Analytics cache status and data
GET /v1/analytics/cache/summary Analytics cache summary

3. Authentication (5 endpoints)

User registration, login, and password management with Privy, Google, GitHub, email, and wallet auth methods.

Endpoint Feature
POST /auth Authenticate user (Privy token validation) - returns API key, credits, subscription status
POST /auth/register Register a new user account
POST /auth/password-reset Request a password reset email
POST /auth/reset-password Reset password with token
GET /auth/health Auth service health check

4. Chat & Messaging (20 endpoints)

OpenAI-compatible and Anthropic-compatible inference, chat sessions, message history, feedback, search, sharing, metrics, and Vercel AI SDK compatibility.

Inference

Endpoint Feature
POST /v1/chat/completions Primary inference endpoint - Full OpenAI Chat Completions API compatibility (streaming SSE, tool/function calling, JSON mode, logprobs, all standard parameters). Routes to 30+ providers with automatic failover.
POST /v1/messages Anthropic Messages API - Drop-in Claude compatibility
POST /v1/responses OpenAI v1/responses - Unified response API
POST /api/chat/ai-sdk Vercel AI SDK compatible chat endpoint
POST /api/chat/ai-sdk-completions Vercel AI SDK completions endpoint

Chat Sessions & History

Endpoint Feature
GET /v1/chat/sessions List all chat sessions for the user
GET /v1/chat/sessions/{session_id} Get session with full message history
POST /v1/chat/sessions/{session_id}/messages Save a message to a session
POST /v1/chat/sessions/{session_id}/messages/batch Batch save messages
POST /v1/chat/search Search across chat sessions
PUT /v1/chat/sessions/{session_id} Update session metadata
DELETE /v1/chat/sessions/{session_id} Delete a chat session
GET /v1/chat/stats Chat usage statistics

Feedback

Endpoint Feature
GET /v1/chat/feedback Get user's feedback entries
GET /v1/chat/feedback/stats Feedback statistics
GET /v1/chat/sessions/{session_id}/feedback Feedback for a specific session
POST /v1/chat/feedback Submit feedback on a response
PUT /v1/chat/feedback/{feedback_id} Update feedback
DELETE /v1/chat/feedback/{feedback_id} Delete feedback

Chat Metrics

Endpoint Feature
GET /v1/chat/completions/metrics/tokens-per-second Token throughput per model (Prometheus format)
GET /v1/chat/completions/metrics/tokens-per-second/all Aggregate tokens/second across all models

Sharing

Endpoint Feature
GET /v1/chat/share List user's share links
GET /v1/chat/share/{token} Access a shared chat
POST /v1/chat/share Create a shareable link for a session
DELETE /v1/chat/share/{token} Delete a share link

5. Circuit Breakers (4 endpoints)

Monitor and manage provider circuit breaker states (CLOSED/OPEN/HALF_OPEN) for the failover system.

Endpoint Feature
GET /circuit-breakers All circuit breaker states with failure/success counts
GET /circuit-breakers/{provider} Circuit breaker state for a specific provider
POST /circuit-breakers/{provider}/reset Reset a provider's circuit breaker to CLOSED
POST /circuit-breakers/reset-all Reset all circuit breakers

6. Code Router (5 endpoints)

Benchmark-driven code model selection. Classifies task complexity and matches to tiered models scored by SWE-bench. Modes: auto, price, quality, agentic.

Endpoint Feature
GET /code-router/settings/options Available code router configuration options
GET /code-router/tiers Code model tiers with benchmark scores and pricing
GET /code-router/stats Code router usage statistics
POST /code-router/test Test code routing with a sample prompt
POST /code-router/settings/validate Validate code router settings

7. Coupons (3 endpoints)

Coupon redemption for users (admin coupon management is under Admin).

Endpoint Feature
GET /coupons/available List available coupons for the user
GET /coupons/history User's coupon redemption history
POST /coupons/redeem Redeem a coupon code (adds credits to balance)

8. Credits (6 endpoints)

Credit balance management, transactions, add/adjust/refund operations.

Endpoint Feature
GET /credits/balance Current credit balance breakdown
GET /credits/summary Credit summary with usage breakdown
GET /credits/transactions Credit transaction history
POST /credits/add Add credits to account
POST /credits/adjust Adjust credits (positive or negative)
POST /credits/bulk-add Bulk add credits to multiple users
POST /credits/refund Process a credit refund

9. Diagnostics (2 endpoints)

Real-time system diagnostics for concurrency and provider performance.

Endpoint Feature
GET /api/diagnostics/concurrency Active concurrency stats (in-flight requests, queue depth, shed count)
GET /api/diagnostics/provider-timing Provider response timing summary (avg, p50, p95, p99 by provider)

10. Error Monitoring (12 endpoints)

Autonomous error detection, pattern recognition, AI-generated fix suggestions, and continuous monitoring.

Endpoint Feature
GET /error-monitor/autonomous/status Autonomous error monitor status
GET /error-monitor/health Error monitor health check
GET /error-monitor/dashboard Error monitoring dashboard data (charts, stats, trends)
GET /error-monitor/errors/recent Recent errors with severity and trace info
GET /error-monitor/errors/critical Critical errors only
GET /error-monitor/errors/fixable Errors with auto-fix potential
GET /error-monitor/errors/patterns Detected error patterns (recurring issues)
GET /error-monitor/fixes/generated All AI-generated fix suggestions
GET /error-monitor/fixes/{fix_id} Details of a specific fix suggestion
POST /error-monitor/fixes/generate-for-error Generate an AI fix suggestion for an error
POST /error-monitor/fixes/generate-batch Batch generate fix suggestions
POST /error-monitor/monitor/start Start continuous error monitoring
POST /error-monitor/monitor/scan Trigger a one-time error scan

11. General Router (4 endpoints)

ML-powered model selection via NotDiamond. Analyzes prompt content and picks the best model optimized for quality, cost, latency, or balanced.

Endpoint Feature
GET /general-router/settings/options Available routing strategies and model pools
GET /general-router/models Models available for general routing
GET /general-router/fallback-models Fallback model chain for the general router
GET /general-router/stats General router usage statistics
POST /general-router/test Test general routing with a sample prompt

12. Health & Monitoring (30 endpoints)

Tiered health monitoring across all providers, models, and gateways with dashboards, uptime tracking, and optimization insights.

System Health

Endpoint Feature
GET /health Primary system health check (version, status, timestamp)
GET /health/quick Lightweight health check (minimal overhead)
GET /health/railway Railway deployment health check (comprehensive: DB, Redis, providers)
GET /health/system Detailed system health (memory, CPU, connections)
GET /health/database Database connectivity and performance health
GET /health/all All health checks combined
GET /health/status Current system status
GET /health/summary Health summary with scores
GET /health/uptime System uptime metrics
GET /health/insights AI-generated health insights and recommendations
GET /health/dashboard Health dashboard data
GET /health/optimizations Current optimization status
GET /health/optimizations/connection-pools Connection pool health
GET /health/optimizations/prioritization Request prioritization stats

Provider Health

Endpoint Feature
GET /health/providers All provider health scores and statuses
GET /health/provider/{provider} Single provider health details
GET /health/providers/stats Provider health statistics
GET /health/providers/uptime Provider uptime history
GET /health/providers/import-status Provider data import status
GET /health/google-vertex Google Vertex AI specific health check

Model Health

Endpoint Feature
GET /health/models All model health scores
GET /health/model/{model_id} Single model health details
GET /health/models/stats Model health statistics
GET /health/models/uptime Model uptime history

Gateway Health

Endpoint Feature
GET /health/gateways All gateway health checks
GET /health/gateways/dashboard Gateway health dashboard (HTML)
GET /health/gateways/dashboard/data Gateway dashboard data (JSON)
GET /health/{gateway} Single gateway health check
POST /health/gateways/{gateway}/fix Trigger auto-fix for an unhealthy gateway

Catalog Health

Endpoint Feature
GET /health/catalog/models Catalog model data health
GET /health/catalog/providers Catalog provider data health

Health Monitoring Control

Endpoint Feature
GET /health/monitoring/status Active monitoring status
POST /health/monitoring/start Start active health monitoring
POST /health/monitoring/stop Stop active health monitoring
POST /health/check Trigger a health check
POST /health/check/now Trigger an immediate health check

13. Metrics & Observability (6 endpoints)

Prometheus metrics, Grafana integration, OpenTelemetry tracing, Loki logging.

Endpoint Feature
GET /metrics Raw Prometheus metrics (supports OpenMetrics format for exemplars with trace ID links)
GET /api/metrics/parsed Structured JSON: latency (p50/p95/p99/avg), request counts, error counts by endpoint
GET /api/metrics/status Metrics collection status
GET /api/metrics/summary Metrics summary
GET /api/metrics/health Metrics system health
GET /api/metrics/grafana-queries Grafana-compatible query results
POST /api/metrics/test Test metrics collection

Monitoring API

Endpoint Feature
GET /api/monitoring/health Provider health scores (0-100) with status per provider
GET /api/monitoring/health/{provider} Single provider health
GET /api/monitoring/stats/realtime Real-time stats: requests, cost, health, error rates, latency with hourly breakdown
GET /api/monitoring/stats/hourly/{provider} Hourly stats for a specific provider
GET /api/monitoring/error-rates Error rates by provider and model with trend detection
GET /api/monitoring/errors/{provider} Recent error logs per provider
GET /api/monitoring/cost-analysis Cost breakdown by provider with cost-per-request
GET /api/monitoring/latency-trends/{provider} Latency percentiles (p50/p95/p99) over time
GET /api/monitoring/latency/{provider}/{model} Latency stats for a specific model
GET /api/monitoring/anomalies Anomaly detection: cost spikes, latency spikes, high error rates
GET /api/monitoring/circuit-breakers Circuit breaker states per provider
GET /api/monitoring/circuit-breakers/{provider} Circuit breaker for a specific provider
GET /api/monitoring/providers/comparison Multi-provider comparison matrix
GET /api/monitoring/token-efficiency/{provider}/{model} Token efficiency analysis
GET /api/monitoring/trial-analytics Trial system analytics
GET /api/monitoring/chat-requests Chat request monitoring
GET /api/monitoring/chat-requests/counts Chat request counts
GET /api/monitoring/chat-requests/models Chat requests by model
GET /api/monitoring/chat-requests/providers Chat requests by provider
GET /api/monitoring/chat-requests/plot-data Chat request time-series data
POST /monitoring Sentry tunnel (proxies Sentry events from frontend)

Instrumentation

Endpoint Feature
GET /api/instrumentation/health Instrumentation health
GET /api/instrumentation/config Current instrumentation configuration
GET /api/instrumentation/environment-variables Instrumentation env vars
GET /api/instrumentation/loki/status Loki log aggregation status
GET /api/instrumentation/tempo/status Tempo distributed tracing status
GET /api/instrumentation/otel/status OpenTelemetry status
GET /api/instrumentation/trace-context Current trace context
POST /api/instrumentation/test-log Send test log to Loki
POST /api/instrumentation/test-trace Send test trace to Tempo

Prometheus/Grafana Datasource

Endpoint Feature
GET /prometheus/datasource Grafana SimpleJSON datasource test
POST /prometheus/datasource/search Metric name search
POST /prometheus/datasource/query Metric query
POST /prometheus/datasource/annotations Annotation query
POST /prometheus/datasource/tag-keys Tag key query
POST /prometheus/datasource/tag-values Tag value query

Prometheus Data API

Endpoint Feature
GET /prometheus/data/metrics Prometheus telemetry data
GET /prometheus/data/admin/cache/status Cache status via Prometheus
DELETE /prometheus/data/admin/cache/invalidate Invalidate cache via Prometheus API
GET /prometheus/data/instrumentation/health Instrumentation health via Prometheus
GET /prometheus/data/instrumentation/loki/status Loki status via Prometheus
GET /prometheus/data/instrumentation/tempo/status Tempo status via Prometheus
POST /prometheus/data/instrumentation/test-log Test log via Prometheus
POST /prometheus/data/instrumentation/test-trace Test trace via Prometheus

14. Models & Catalog (23 endpoints)

Model discovery, search, comparison, trending models, provider info, HuggingFace integration, and gateway management.

Model Discovery

Endpoint Feature
GET /v1/models List all models (filter by provider, gateway, private, HuggingFace; pagination)
GET /v1/models/unique Deduplicated model list (one entry per model across providers)
GET /v1/models/search Full-text model search
GET /v1/models/trending Top models ranked by requests, tokens, users, cost, speed
GET /v1/models/low-latency Low-latency optimized models
GET /v1/models/{provider}/{model} Specific model details
GET /v1/models/{provider}/{model}/compare Compare a model across providers
GET /v1/models/{developer} Models by developer/organization
POST /v1/models/batch-compare Batch compare multiple models
GET /api/models/detail Detailed model info for frontend rendering

Modelz (Canonical Registry)

Endpoint Feature
GET /v1/modelz/models Full canonical model registry
GET /v1/modelz/ids All model IDs in the registry
GET /v1/modelz/check/{model_id} Check if a model exists and get its registry data

Providers

Endpoint Feature
GET /v1/provider List all providers with stats
GET /v1/provider/{provider_name}/stats Provider statistics (models, requests, cost)
GET /v1/provider/{provider_name}/top-models Top models for a provider
GET /v1/routers Available intelligent routing options

Gateways

Endpoint Feature
GET /v1/gateways List all registered gateways (frontend auto-discovers from this)
GET /v1/gateways/status Gateway statuses
GET /v1/gateways/summary Aggregated gateway statistics
GET /v1/gateway/{gateway}/stats Stats for a specific gateway

HuggingFace Integration

Endpoint Feature
GET /v1/huggingface/discovery Discover HuggingFace models
GET /v1/huggingface/search Search HuggingFace models
GET /v1/huggingface/author/{author}/models Models by a HuggingFace author
GET /v1/huggingface/models/{model_id}/details HuggingFace model details (downloads, likes, parameters)
GET /v1/huggingface/models/{model_id}/card Model card (README)
GET /v1/huggingface/models/{model_id}/files Model file listing

Model Health

Endpoint Feature
GET /v1/model-health All model health data
GET /v1/model-health/stats Model health statistics
GET /v1/model-health/providers Provider-level health summary
GET /v1/model-health/unhealthy Currently unhealthy models
GET /v1/model-health/{provider}/{model} Health for a specific model
GET /v1/model-health/provider/{provider}/summary Provider health summary

Rankings

Endpoint Feature
GET /ranking/models Model leaderboard with trend data (direction, percentage, logos)

Model Availability

Endpoint Feature
GET /availability/models All model availability statuses
GET /availability/model/{model_id} Availability for a specific model
GET /availability/summary Availability summary
GET /availability/status Availability system status
GET /availability/check/{model_id} Check model availability with provider details
GET /availability/fallback/{model_id} Fallback providers for a model
GET /availability/best/{model_id} Best available provider for a model
POST /availability/maintenance/{model_id} Put a model in maintenance mode
DELETE /availability/maintenance/{model_id} Remove maintenance mode
POST /availability/monitoring/start Start availability monitoring
POST /availability/monitoring/stop Stop availability monitoring

Models Catalog Management (CRUD)

Endpoint Feature
GET /catalog/models-db/ List all models in the database catalog
GET /catalog/models-db/{model_id} Get a model from the DB
GET /catalog/models-db/search Search the model catalog DB
GET /catalog/models-db/stats Catalog statistics
GET /catalog/models-db/provider/{provider_slug} Models by provider in DB
GET /catalog/models-db/health/{health_status} Models by health status in DB
GET /catalog/models-db/{model_id}/health/history Model health history
POST /catalog/models-db/bulk Bulk create models
POST /catalog/models-db/bulk-upsert Bulk upsert models
POST /catalog/models-db/upsert Upsert a single model
POST /catalog/models-db/{model_id}/activate Activate a model
POST /catalog/models-db/{model_id}/deactivate Deactivate a model
PATCH /catalog/models-db/{model_id}/health Update model health status

Providers Management (CRUD)

Endpoint Feature
GET /providers/ List all providers in DB
PATCH /providers/{provider_id} Update provider metadata
GET /providers/{provider_id}/models/stats Provider model statistics

15. Other (19 endpoints)

Registration, images, audio, tools, payments, IP whitelists, Nosana GPU, partner trials, notifications, and system utilities.

User Registration

Endpoint Feature
POST /create Create new user account ($5 initial credits, 3-day trial, welcome email)

Image Generation

Endpoint Feature
POST /v1/images/generations Generate images using AI models (Stability AI, DALL-E, etc.) with credit billing

Audio Transcription

Endpoint Feature
POST /v1/audio/transcriptions Transcribe audio files via Whisper (supports all major formats, credit billing per minute)
POST /v1/audio/transcriptions/base64 Transcribe base64-encoded audio

Server-Side Tools

Endpoint Feature
GET /v1/tools List available server-side tools
GET /v1/tools/definitions Tool definitions (for function calling)
GET /v1/tools/{tool_name} Get a specific tool's details
POST /v1/tools/execute Execute a server-side tool (TTS, calculator, code executor)
POST /v1/tools/search/augment Search augmentation tool

Payments (Stripe)

Endpoint Feature
POST /api/stripe/checkout-session Create Stripe checkout session
GET /api/stripe/checkout-session/{session_id} Get checkout session status
GET /api/stripe/credit-packages Available credit packages and pricing
POST /api/stripe/payment-intent Create payment intent
GET /api/stripe/payment-intent/{payment_intent_id} Get payment intent status
GET /api/stripe/payments List user's payment history
GET /api/stripe/payments/{payment_id} Get payment details
POST /api/stripe/refund Process a refund
GET /api/stripe/subscription Get current subscription
POST /api/stripe/subscription-checkout Create subscription checkout
POST /api/stripe/subscription/upgrade Upgrade subscription plan
POST /api/stripe/subscription/downgrade Downgrade subscription plan
POST /api/stripe/subscription/cancel Cancel subscription
POST /api/stripe/webhook Stripe webhook handler (checkout.session.completed, payment_intent.succeeded/failed, charge.refunded)

IP Allowlist Management

Endpoint Feature
POST /api/admin/ip-whitelist Create IP allowlist entry
GET /api/admin/ip-whitelist/{entry_id} Get allowlist entry
PUT /api/admin/ip-whitelist/{entry_id} Update allowlist entry
DELETE /api/admin/ip-whitelist/{entry_id} Delete allowlist entry
GET /api/admin/ip-whitelist List all allowlist entries
POST /api/admin/ip-whitelist/check Check if an IP is allowlisted

Nosana GPU Computing

Endpoint Feature
GET /nosana/config Nosana platform configuration
GET /nosana/credits/balance Nosana credit balance
GET /nosana/deployments List all Nosana deployments
GET /nosana/deployments/{deployment_id} Deployment details
POST /nosana/deployments/llm Deploy LLM inference on GPU
POST /nosana/deployments/image-generation Deploy image generation on GPU
POST /nosana/deployments/whisper Deploy Whisper transcription on GPU
POST /nosana/deployments/{deployment_id}/start Start a deployment
POST /nosana/deployments/{deployment_id}/stop Stop a deployment
POST /nosana/deployments/{deployment_id}/archive Archive a deployment
POST /nosana/deployments/{deployment_id}/revisions Create a deployment revision
PATCH /nosana/deployments/{deployment_id}/replicas Update replica count
GET /nosana/markets List GPU markets
GET /nosana/markets/{market_id} Market details
GET /nosana/markets/{market_id}/resources Market resource requirements
POST /nosana/jobs Create a new GPU job
GET /nosana/jobs/{job_address} Job details
POST /nosana/jobs/{job_address}/extend Extend job duration
POST /nosana/jobs/{job_address}/stop Stop a job

Partner Trials

Endpoint Feature
GET /partner-trials/config/{partner_code} Partner trial configuration
GET /partner-trials/check/{code} Check if partner code is valid
GET /partner-trials/status Current user's partner trial status
GET /partner-trials/daily-limit Partner trial daily limit info
GET /partner-trials/analytics/{partner_code} Partner trial analytics
POST /partner-trials/start Start a partner trial (e.g., Redbeard 14-day Pro)
POST /partner-trials/expire/{target_user_id} Force-expire a partner trial

Provider Credit Monitoring

Endpoint Feature
GET /api/provider-credits/balance All upstream provider credit balances
GET /api/provider-credits/balance/{provider} Specific provider credit balance

Notifications (User)

Endpoint Feature
GET /user/notifications/preferences Get notification preferences
POST /user/notifications/send-usage-report Send usage report email
POST /user/notifications/test Send test notification

System Utilities

Endpoint Feature
GET /ping System ping (pong response with uptime)
GET /ping/stats Ping statistics
GET /sentry-debug Test Sentry error tracking integration
GET /velocity-mode-status Security velocity mode status
GET / Root endpoint (API info)

System & Cache Management

Endpoint Feature
GET /health/gateways/optimized Optimized gateway health
GET /health/models/optimized Optimized model health
GET /health/providers/optimized Optimized provider health
GET /health/dashboard/optimized Optimized dashboard data
POST /admin/cache/refresh/{gateway} Refresh cache for a specific gateway
POST /admin/cache/clear Clear all caches
GET /admin/cache/status Cache status
GET /admin/cache/debouncer/stats Cache debouncer stats
GET /admin/cache/warmer/stats Cache warmer stats
GET /admin/cache/modelz/status Modelz cache status
POST /admin/cache/modelz/refresh Refresh modelz cache
DELETE /admin/cache/modelz/clear Clear modelz cache
POST /admin/cache/pricing/refresh Refresh pricing cache
POST /admin/api/cache/invalidate Invalidate specific cache entry

16. Status (2 endpoints)

Public-facing status page for model and provider availability (no authentication required).

Endpoint Feature
GET /v1/status/ Overall system status
GET /v1/status/detailed Detailed status with provider breakdown
GET /v1/status/providers Provider availability statuses
GET /v1/status/models Model availability statuses
GET /v1/status/models/{provider}/{model_id} Specific model status
GET /v1/status/incidents Recent incidents
GET /v1/status/uptime/{provider}/{model_id} Model uptime history
GET /v1/status/search Search models on status page
GET /v1/status/stats Status page statistics

17. Users (8 endpoints)

User profile, balance, plan, rate limits, activity, and account management.

Endpoint Feature
GET /user/balance Current credit balance and status
GET /user/monitor User's own usage monitoring data
GET /user/rate-limits User's rate limit configuration and current usage
GET /user/profile User profile data
PUT /user/profile Update user profile
GET /user/plan Current subscription plan
GET /user/plan/entitlements Plan entitlements (what the plan includes)
GET /user/plan/usage Plan usage vs limits
GET /user/limit Daily spending limit
GET /user/credit-transactions Credit transaction history
GET /user/environment-usage Usage by environment (live/test/staging/dev)
GET /user/cache-settings User's cache settings
GET /user/activity/stats Activity statistics
GET /user/activity/log Activity log
DELETE /user/account Delete user account

API Key Management

Endpoint Feature
POST /user/api-keys Create a new API key (name, environment tag, scope permissions)
GET /user/api-keys List all API keys
GET /user/api-keys/usage API key usage statistics
GET /user/api-keys/audit-logs API key audit logs
PUT /user/api-keys/{key_id} Update API key (name, active status)
DELETE /user/api-keys/{key_id} Delete an API key

Rate Limits (User)

Endpoint Feature
GET /user/rate-limits/usage/{key_id} Rate limit usage for a specific key
PUT /user/rate-limits/{key_id} Update rate limit for a key
POST /user/rate-limits/bulk-update Bulk update rate limits

Plans & Trials

Endpoint Feature
GET /plans List all available subscription plans
GET /plans/{plan_id} Plan details
GET /subscription/plans Subscription plans (alternate path)
GET /trial/status Current trial status

Referrals

Endpoint Feature
GET /referral/code Get user's referral code
POST /referral/generate Generate a new referral code
GET /referral/stats Referral statistics (total, conversion rate, rewards)
POST /referral/validate Validate and apply a referral code

Transaction Analytics

Endpoint Feature
GET /analytics/transactions Transaction analytics data
GET /analytics/transactions/summary Transaction summary

Feature Summary

Category Endpoints Key Capabilities
Admin 46 User management, credits, caches, model sync, rate limits, RBAC, trials, downtime, coupons
Analytics 5 Server-side event forwarding (Statsig + PostHog)
Authentication 5 Multi-method auth (Privy, Google, GitHub, email, wallet)
Chat & Messaging 20 OpenAI/Anthropic inference, sessions, history, feedback, sharing, AI SDK
Circuit Breakers 4 Provider circuit breaker monitoring and reset
Code Router 5 Benchmark-driven code model selection (SWE-bench)
Coupons 3 User coupon redemption
Credits 6 Balance, transactions, add/adjust/refund
Diagnostics 2 Concurrency and provider timing diagnostics
Error Monitoring 12 Autonomous error detection, AI fix generation
General Router 4 ML-powered model selection (NotDiamond)
Health & Monitoring 30 Multi-tier health: system, providers, models, gateways
Metrics & Observability 6 Prometheus, Grafana, OpenTelemetry, Loki, Tempo
Models & Catalog 23 Discovery, search, compare, trending, HuggingFace, availability
Other 19 Images, audio, tools, Stripe, IP allowlists, Nosana GPU, partner trials
Status 2 Public status page
Users 8 Profile, balance, plan, rate limits, API keys, referrals
Total 450

Source: API Mappings Wiki | Conceptual Model | Developer Wiki

Clone this wiki locally