latency-tracking

Here are 2 public repositories matching this topic...

ashwathnakate / adaptive_llm_inference_router

An intelligent LLM inference gateway that dynamically routes user queries to optimal model tiers (Llama-3.1 8B/70B) based on real-time complexity, reasoning depth, and ambiguity analysis.

python nlp request-routing mlops inference-optimization fastapi large-language-models llm groq-api ai-infrastructure model-routing latency-tracking intelligent-gateway

Updated Jan 17, 2026
Python

Turtles-AI-Lab / api-latency-monitor

Star

Real-time API latency monitor for LLM providers - track OpenAI, Anthropic, Google, Azure response times

javascript performance monitoring metrics openai api-monitoring anthropic llm-api latency-tracking

Updated Oct 31, 2025
JavaScript

Improve this page

Add a description, image, and links to the latency-tracking topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the latency-tracking topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly