Skip to content

Aitra Meter v0.2.1

Choose a tag to compare

@stevenphtan stevenphtan released this 15 Jun 02:53
· 43 commits to main since this release
88fcb54

generic-prometheus inference provider — compatible with TGI, SGLang, Ollama, and Triton. Metric names and model label are configurable via environment variables. Implements the full InferenceMetricsProvider interface (ADR 0004).

Added

  • internal/provider/inference/genericprometheus/ — full implementation with configurable metric names
  • Table-driven tests covering metric scraping and model label resolution