Replies: 1 comment
-
|
Update: RFC-0001 v0.4 is now live 🎉 Just shipped v0.4 with all upstream feedback incorporated:
Full spec: https://github.com/sauravGit/open-llm-observability/blob/main/RFC.md Would love continued feedback, especially on the streaming metrics and cost estimation approach! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi everyone,
I'm proposing a vendor-neutral, OpenTelemetry-compatible semantic convention for LLM observability — and I'd love early feedback from this community.
The problem
Every LLM platform and observability tool today uses different field names, KPI definitions, and export formats. Developers who want end-to-end visibility across multiple providers or backends have to re-instrument every time.
What I'm proposing
A canonical schema — built on top of OpenTelemetry semantic conventions — that standardizes:
gen_ai.latency,gen_ai.usage.cost,gen_ai.time_to_first_token)The mandatory core covers: latency, tokens, cost, errors, retries, rate limits, and trace coverage. An optional domain extension layer covers quality signals like groundedness, relevance, and safety flags.
Why OpenTelemetry
OTEL is already the standard for distributed tracing and metrics. Defining LLM semantic conventions on top of it means any OTEL-compatible backend works without additional integration: Prometheus, Grafana, Datadog, GCP, Honeycomb, etc.
Full RFC
See RFC.md in this repo for the full v0.1 spec.
What I'm asking for
gen_ai.*) clear and unambiguous?Thanks — looking forward to the feedback.
Beta Was this translation helpful? Give feedback.
All reactions