You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Explore and expose additional Prometheus metrics from the Semantic Router to describe workload characteristics (prompt/response sizes, category distribution, cache hit ratio) and backend load (endpoint utilization, token throughput, TTFT, TPOT). These metrics enable the control plane to implement algorithms that adapt router configs dynamically for latency, accuracy, and cost objectives.