Skip to content

Latest commit

 

History

History
295 lines (206 loc) · 23.8 KB

distributed-tracing-conventions.md

File metadata and controls

295 lines (206 loc) · 23.8 KB
title permalink keywords folder sidebar
Distributed Tracing Conventions
distributed_tracing_conventions.html
opentelemetry conventions
general
general_sidebar

Conventions are the contract between Azure SDK and tracing providers such as Azure Monitor, Jaeger, and others. They describe and standardize attributes, events and relationships for common span types: HTTP, DB, messaging and others. Observability vendors use conventions to build visualizations and may be very sensitive to them. Custom Azure SDK conventions are described in Azure SDK semantic conventions section below.

When writing instrumentation in Azure SDK or Core:

{% include requirement/MUST id="general-tracing-convention-use-otel" %} use existing OpenTelemetry or Azure SDK semantic conventions whenever possible.

{% include requirement/MUST id="general-tracing-convention-describe-attributes" %} update Azure SDK semantic conventions when adding new Azure-specific attributes.

{% include requirement/MUST id="general-tracing-convention-attribute-naming" %} follow OpenTelemetry attribute naming conventions and use the az.{service-family}. prefix when adding new Azure-specific attributes.

{% include requirement/MUST id="general-tracing-convention-set-schema-version" %} set OpenTelemetry Schema URL when creating OpenTelemetry tracer.

{% include requirement/SHOULD id="general-tracing-convention-set-library" %} set Azure Client Library name and version Schema URL when creating OpenTelemetry tracer.

{% include requirement/SHOULD id="general-tracing-convention-new-otel" %} contribute new conventions (or patch existing ones) to OpenTelemetry when there is no suitable one or some scenarios are missing.

Azure SDK semantic conventions

Azure SDK support distributed tracing with OpenTelemetry. Azure client libraries do not report metrics at the moment.

Semantic conventions for Azure SDK spans

Status: Mixed

This document describes tracing semantic conventions adopted by Azure SDK. Azure client libraries are instrumented natively, so the instrumentation code is a part of each library (but depending on the languages, you may need to install a plugin to enable collection with OpenTelemetry). Check out tracing documentation for your language to get more details.

The Azure SDK produces spans for public API calls and nested HTTP client spans. AMQP transport-level calls are not traced.

Versioning

Azure client libraries follow OpenTelemetry semantic conventions when applicable, but adopt new versions of conventions at their own pace. Telemetry consumers MAY use SchemaUrl to detect which version of semantic conventions are emitted by Azure client libraries.

Public API calls

Status: Stable

Azure SDK SHOULD create a span for each call to public APIs that involve communication with Azure services.

  • Spans representing public API calls SHOULD have names following the client.method pattern and are language-specific. In cases where OpenTelemetry defines a semantic convention for a specific span name (for example, in messaging or database conventions), the standard OpenTelemetry name SHOULD be used instead.
  • For HTTP-based client libraries, public API spans SHOULD have kind of INTERNAL.

See Messaging section below and CosmosDB conventions for non-HTTP semantics.

API-level spans produced by Azure SDK have the following attributes:

Attribute Type Description Examples Requirement Level
az.namespace string Namespace of Azure service request is made against. [1] Microsoft.Storage; Microsoft.KeyVault; Microsoft.ServiceBus Required
az.schema_url string OpenTelemetry Schema URL including schema version. Only 1.23.0 is supported. https://opentelemetry.io/schemas/1.23.0 Conditionally Required: [2]
error.type string Describes a class of error the operation ended with. [3] java.net.UnknownHostException; System.Threading.Tasks.OperationCanceledException; azure.core.exceptions.ServiceRequestError Recommended

[1]: This SHOULD be set as an instrumentation scope attribute when creating a Tracer as long as OpenTelemetry in a given language allows to do so.

[2]: if and only if OpenTelemetry in a given language doesn't provide a standard way to set schema_url (.NET)

[3]: The error.type SHOULD be predictable and SHOULD have low cardinality. Instrumentations SHOULD document the list of errors they report.

The cardinality of error.type within one instrumentation library SHOULD be low. Telemetry consumers that aggregate data from multiple instrumentation libraries and applications should be prepared for error.type to have high cardinality at query time when no additional filters are applied.

If the operation has completed successfully, instrumentations SHOULD NOT set error.type.

If a specific domain defines its own set of error identifiers (such as HTTP or gRPC status codes), it's RECOMMENDED to:

  • Use a domain-specific attribute
  • Set error.type to capture all errors, regardless of whether they are defined within the domain-specific set or not.

error.type has the following list of well-known values. If one of them applies, then the respective value MUST be used, otherwise a custom value MAY be used.

Value Description
_OTHER A fallback error value to be used when the instrumentation doesn't define a custom value.

HTTP Client spans

Status: Stable

Azure SDK implements a valid subset of stable part of OpenTelemetry HTTP spans conventions and create a span per HTTP call (attempt).

Attribute Type Description Examples Requirement Level
az.client_request_id string Value of the [x-ms-client-request-id] header (or other request-id header, depending on the service) sent by the client. eb178587-c05a-418c-a695-ae9466c5303c Conditionally Required: only if present.
az.namespace string Namespace of Azure service request is made against. [1] Microsoft.Storage; Microsoft.KeyVault; Microsoft.ServiceBus Required
az.schema_url string OpenTelemetry Schema URL including schema version. Only 1.23.0 is supported. https://opentelemetry.io/schemas/1.23.0 Conditionally Required: [2]
az.service_request_id string Value of the [x-ms-request-id] header (or other request-id header, depending on the service) sent by the server in response. 3f828ae5-ecb9-40ab-88d9-db0420af30c6 Conditionally Required: if and only if one was received
error.type string Describes a class of error the operation ended with. [3] timeout; java.net.UnknownHostException; server_certificate_invalid; 500 Conditionally Required: If request has ended with an error.
http.request.method string HTTP request method. [4] GET; POST; HEAD Required
http.request.resend_count int The ordinal number of request resending attempt (for any reason, including redirects). [5] 3 Recommended
http.response.status_code int HTTP response status code. 200 Conditionally Required: If and only if one was received/sent.
server.address string Host identifier of the "URI origin" HTTP request is sent to. [6] example.com; 10.1.2.80; /tmp/my.sock Required
server.port int Port identifier of the "URI origin" HTTP request is sent to. [7] 80; 8080; 443 Required
url.full string Absolute URL describing a network resource according to RFC3986 [8] https://www.foo.bar/search?q=OpenTelemetry#SemConv; //localhost Recommended

[1]: This SHOULD be set as an instrumentation scope attribute when creating a Tracer as long as OpenTelemetry in a given language allows to do so.

[2]: if and only if OpenTelemetry in a given language doesn't provide a standard way to set schema_url (.NET)

[3]: If the request fails with an error before response status code was sent or received, error.type SHOULD be set to a component-specific low cardinality error identifier or the fully-qualified type of exception.

If response status code was sent or received and status indicates an error according to HTTP span status definition, error.type SHOULD be set to a component-specific low cardinality error identifier or the fully-qualified type of exception.

The error.type value SHOULD be predictable and SHOULD have low cardinality. Instrumentations SHOULD document the list of errors they report.

The cardinality of error.type within one instrumentation library SHOULD be low, but telemetry consumers that aggregate data from multiple instrumentation libraries and applications should be prepared for error.type to have high cardinality at query time, when no additional filters are applied.

If the request has completed successfully, instrumentations SHOULD NOT set error.type.

[4]: Azure client libraries support HTTP methods listed in RFC9110 and the `PATCH`` method defined in RFC5789

[5]: The resend count SHOULD be updated each time an HTTP request gets resent by the client, regardless of what was the cause of the resending (e.g. redirection, authorization failure, 503 Server Unavailable, network issues, or any other).

[6]: Describes host component of Azure service endpoint.

[7]: Describes port component of Azure service endpoint.

[8]: For network calls, URL usually has scheme://host[:port][path][?query][#fragment] format, where the fragment is not transmitted over HTTP, but if it is known, it SHOULD be included nevertheless. url.full MUST NOT contain credentials passed via URL in form of https://username:password@www.example.com/. In such case username and password SHOULD be redacted and attribute's value SHOULD be https://REDACTED:REDACTED@www.example.com/. url.full SHOULD capture the absolute URL when it is available (or can be reconstructed) and SHOULD NOT be validated or modified except for sanitizing purposes.

The following attributes can be important for making sampling decisions and SHOULD be provided at span creation time (if provided at all):

Instrumentation supports W3C Trace context propagation and Azure legacy correlation protocols. Propagator configuration is not supported.

Messaging libraries

Status: Experimental

Messaging span semantics apply to Azure Event Hubs and Service Bus and follow OpenTelemetry Messaging spans conventions v1.23.0.

Azure SDK will update messaging semantic conventions as messaging specification evolves.

Messaging libraries produce three kinds of spans:

  • PRODUCER - describes message creation and associates unique context with the message to trace them when they are sent in batches.
  • CLIENT - describes message (or batch) publishing.
    • It has links pointing to each message being sent.
  • CONSUMER - describes message (or batch) processing.
    • It is created when user leverages handler APIs that wrap message or batch processing.
    • Processing span has links to each message being processed (when context is present).

Messaging attributes

Attribute Type Description Examples Requirement Level
az.namespace string Namespace of Azure service request is made against. [1] Microsoft.Storage; Microsoft.KeyVault; Microsoft.ServiceBus Required
az.schema_url string OpenTelemetry Schema URL including schema version. Only 1.23.0 is supported. https://opentelemetry.io/schemas/1.23.0 Conditionally Required: [2]
messaging.batch.message_count int The number of messages sent, received, or processed in the scope of the batching operation. [3] 0; 1; 2 Recommended
messaging.destination.name string The message destination name [4] MyQueue; MyTopic Recommended
messaging.message.id string A value used by the messaging system as an identifier for the message, represented as a string. 452a7c7c7c7048c2f887f61572b18fc2 Recommended
messaging.operation string A string identifying the kind of messaging operation. [5] publish Recommended
messaging.system string A string identifying the messaging system. eventhubs; servicebus Recommended
server.address string Server domain name if available without reverse DNS lookup; otherwise, IP address or Unix domain socket name. [6] example.com; 10.1.2.80; /tmp/my.sock Recommended
server.port int Server port number. [7] 80; 8080; 443 Recommended

[1]: This SHOULD be set as an instrumentation scope attribute when creating a Tracer as long as OpenTelemetry in a given language allows to do so.

[2]: if and only if OpenTelemetry in a given language doesn't provide a standard way to set schema_url (.NET)

[3]: Instrumentations SHOULD NOT set messaging.batch.message_count on spans that operate with a single message. When a messaging client library supports both batch and single-message API for the same operation, instrumentations SHOULD use messaging.batch.message_count for batching APIs and SHOULD NOT use it for single-message APIs.

[4]: Destination name SHOULD uniquely identify a specific queue, topic or other entity within the broker. If the broker doesn't have such notion, the destination name SHOULD uniquely identify the broker.

[5]: If a custom value is used, it MUST be of low cardinality.

[6]: When observed from the client side, and when communicating through an intermediary, server.address SHOULD represent the server address behind any intermediaries, for example proxies, if it's available.

[7]: When observed from the client side, and when communicating through an intermediary, server.port SHOULD represent the server port behind any intermediaries, for example proxies, if it's available.

The following attributes can be important for making sampling decisions and SHOULD be provided at span creation time (if provided at all):

Library-specific attributes

In addition to common attributes listed in the Public API calls section, certain Azure client libraries in .NET emit other library-specific attributes.

Azure Application Configuration attributes

Attribute Type Description Examples Requirement Level
az.appconfiguration.key string Value of the Azure Application Configuration property key. AppName:Service1:ApiEndpoint Recommended

Azure Cognitive Language Question Answering SDK attributes

Attribute Type Description Examples Requirement Level
az.cognitivelanguage.deployment.name string Name of the Azure Questions Answering deployment. production Recommended
az.cognitivelanguage.project.name string Name of the Azure Questions Answering project. production Recommended

Azure Digital Twins attributes

Attribute Type Description Examples Requirement Level
az.digitaltwins.component.name string The name of the digital twin component. thermostat Recommended
az.digitaltwins.event_route.id string The event route identifier used by the digital twin. 6f8741b1 Recommended
az.digitaltwins.job.id string Digital twin job id. test-job Recommended
az.digitaltwins.message.id string A unique message identifier (in the scope of the digital twin id) used to de-duplicate telemetry messages. a40896c5ab954ab1 Recommended
az.digitaltwins.model.id string The digital twin model id. dtmi:example:Room23;1 Recommended
az.digitaltwins.query string Digital twin graph query. SELECT * FROM DIGITALTWINS WHERE Name = "DSouza" Recommended
az.digitaltwins.relationship.name string The name of the relationship between twins. contains Recommended
az.digitaltwins.twin.id string The unique identifier of the digital twin. edf41622 Recommended

Azure KeyVault attributes

Azure KeyVault Certificates attributes

Attribute Type Description Examples Requirement Level
az.keyvault.certificate.issuer.name string Azure KeyVault certificate version. issuer01 Recommended
az.keyvault.certificate.name string Azure KeyVault certificate name. selfSignedCert01 Recommended
az.keyvault.certificate.version string Azure KeyVault certificate version. c3d31d7b36c942ad83ef36fc0785a4fc Recommended

Azure KeyVault Keys attributes

Attribute Type Description Examples Requirement Level
az.keyvault.key.id string Azure KeyVault key id (full URL). "https://myvault.vault.azure.net/keys/CreateSoftKeyTest/78deebed173b48e48f55abf87ed4cf71 Recommended
az.keyvault.key.name string Azure KeyVault key name. test-key Recommended
az.keyvault.key.version string Azure KeyVault key version. 3d31e6e5c4c14eaf9be8d42c00225088 Recommended

Azure KeyVault Secrets attributes

Attribute Type Description Examples Requirement Level
az.keyvault.secret.name string Azure KeyVault secret name. test-secret Recommended
az.keyvault.secret.version string Azure KeyVault secret version. 4387e9f3d6e14c459867679a90fd0f79 Recommended

Azure Mixed Reality Remote Rendering attributes

Attribute Type Description Examples Requirement Level
az.remoterendering.conversion.id string A conversion id uniquely identifying the conversion for the given Azure Remote Rendering account. contoso-conversion-6fae2bfb754e Recommended
az.remoterendering.session.id string A session id uniquely identifying the conversion for the given Azure Remote Rendering account. contoso-session-8c28813adc28 Recommended