Skip to content

feat: add Prometheus metrics and Grafana dashboard for observability#7

Merged
alez007 merged 2 commits intomainfrom
feat/prometheus-metrics
Apr 7, 2026
Merged

feat: add Prometheus metrics and Grafana dashboard for observability#7
alez007 merged 2 commits intomainfrom
feat/prometheus-metrics

Conversation

@alez007
Copy link
Copy Markdown
Owner

@alez007 alez007 commented Apr 6, 2026

What

Add yasha:* custom metrics (request latency, errors, model load time, per-usecase timing, client disconnects, cleanup errors) via ray.serve.metrics, gated behind YASHA_METRICS env var with zero-overhead no-op stubs when disabled. Include pre-built Grafana dashboard, /health endpoint, and documentation.

Why

Support for prometheus metrics

How to Test

Run tests

Checklist

  • ruff check . passes
  • ruff format --check . passes
  • pyright passes

Alex M added 2 commits April 6, 2026 19:17
Add yasha:* custom metrics (request latency, errors, model load time,
per-usecase timing, client disconnects, cleanup errors) via ray.serve.metrics,
gated behind YASHA_METRICS env var with zero-overhead no-op stubs when disabled.
Include pre-built Grafana dashboard, /health endpoint, and documentation.
… Ray

All metrics exported via Ray's metrics agent are prefixed with ray_,
but the Grafana dashboard and docs referenced unprefixed names. This
updates all queries to use the actual exported names (ray_yasha_*,
ray_vllm_*, ray_serve_*).

- Route vLLM native metrics through Ray via RayPrometheusStatLogger
- Fix Ray Serve Internals panels to use metrics that exist in Ray 2.54
- Fix model load time panel to work for one-shot events (avg not rate)
- Enable YASHA_METRICS=true by default in Dockerfiles and metrics.py
- Expose port 8079 in devcontainer config
- Update monitoring.md to reflect all metric name prefixes
@alez007 alez007 merged commit 135076f into main Apr 7, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant