Add MetricProducer to allow sdk.MeterProviders to incorporate metric data from third-party sources #2722

dashpole · 2022-08-10T21:23:49Z

Part of #1175

Related to #2730

Required for #2732

Changes

Add a MetricProducer interface to the metrics SDK. It is an optional argument when creating a MetricReader meant to support non-SDK sources of metric data.

Metric data from a MetricProducer is not subject to the MeterProviders or MetricReaders configuration for instruments. This means Views on MeterProviders, and default Aggregations and Aggregation Temporalities configured on MetricReaders are not applied to data from MetricProducers.

This is needed to support an OpenCensus metric bridge, which is proposed separately. It could also be used to support other bridges as well, such as a Prometheus bridge.

Discussion from the spec sig on 8/30:

@dashpole presented high level design options:

Bridge incorporated into MeterProvider
Bridge separate from MeterProvider, and works with a MetricReader
Bridge separate from MeterProvider and MetricReader, works with MetricExporter

3 was eliminated because it won't work easily with pull-based exporters. The overall sentiment was to stick with option 1, because then resource and exporter configuration for the MeterProvider would be shared. See #2722 (comment) for more details.

specification/metrics/sdk.md

pirgeo

It seems to me that OpenCensus is currently the only use case (but please correct me if I am wrong). I am not familiar with OpenCensus, but I know from Micrometer that there are definitely Metrics APIs that allow plugging a different SDK. I wonder if it is possible to wrap the OpenTelemetry instruments in the OpenCensus instruments so that their usage doesn't change, but they would use the OTel SDK nonetheless.

specification/metrics/sdk.md

dashpole · 2022-08-17T21:01:44Z

I wonder if it is possible to wrap the OpenTelemetry instruments in the OpenCensus instruments so that their usage doesn't change, but they would use the OTel SDK nonetheless.

Unfortunately, I don't think that is feasible for OpenCensus. The instrumentation API for OpenCensus is quite different from OTel's. You just call Record(number), and then decide what instrument to use in the OpenCensus View. The other challenge is that you can't substitute the SDK in OpenCensus with a different implementation (e.g. OTel).

Although I haven't prototyped it out, I believe this model (bridge implements MetricProducer) would also work well for a prometheus bridge. In Go, I can Gather metrics from a prometheus registry (~SDK), and could convert to an OpenTelemetry batch of metrics relatively easily.

jmacd · 2022-08-17T23:02:30Z

As I read this PR, a MetricProducer could be bound to just one MetricExporter. Does the MetricProducer user lose access to multi-exporter support?

Thinking about how I'd like to see a bridge to OpenCensus (or Prometheus), I'd like to see a single SDK (with multiple Meters) co-exist with MetricProducers so that a single SDK configuration could include bridged-metrics AND there would be just one export call across the whole SDK+bridges, one gRPC connection, etc.

To me, this suggests integrating a different sort of Producer than the one specified here, which essentially looks like a replacement of the SDK. Instead, can we replace a Meter with a producer for a single Scope? In addition to bridging metrics from OpenCensus and Prometheus, this sort of back-door allows users to work around all the current limitations in OpenTelemetry API. For example, asynchronous histogram does not exist? Just create a MetricProducer and emit some histogram points (e.g., I might use a single-scope MetricProducer to output https://pkg.go.dev/runtime/metrics (issue).

dashpole · 2022-08-18T02:24:41Z

As I read this PR, a MetricProducer could be bound to just one MetricExporter. Does the MetricProducer user lose access to multi-exporter support?

That wasn't my intention. A MetricReader can have a single MetricProducer, and a single MetricExporter. But, like a MeterProvider can have multiple MetricReaders, a MetricProducer can be registered with many MetricReaders, each with an exporter. Let me know if I can make that clearer.

I'll chew on the other feedback and see what I can come up with.

specification/metrics/sdk.md

dashpole · 2022-08-18T20:07:51Z

@jmacd re: #2722 (comment)

I think it should be possible to achieve your first ask. Instead of registering a MetricProducer with a MetricReader, we could pass a MetricProducer to a MeterProvider. The MeterProvider would add the bridge's metrics to metrics from OTel instruments' metrics, and provide a single batch of metrics to exporters. This would have the added benefit of simplifying the user experience, since they don't need to create multiple readers (one for the SDK, one for the bridge).

For the second section, I think the fundamental ask is to reduce the scope of the interface from a full batch of metrics (i.e. one ResourceMetrics in Go) to metrics for a single scope (i.e. []Metrics in Go). Doing that would also simplify the user experience, since the Resource used by the MeterProvider will be automatically used for bridged metrics, rather than needing to be supplied separately.

I've made those updates in the latest commit.

Co-authored-by: jack-berg <34418638+jack-berg@users.noreply.github.com>

specification/metrics/sdk.md

Co-authored-by: Andrew Hayworth <ahayworth@gmail.com>

bogdandrutu

Few things:

This PR title should be updated that this works on the sdk.MeterProvider not on the api, because we don't want to expose our data model implementation to the API layer.
For me it does not make too much sense to have MeterProvider support additional MetricProducers. I think I see MeterProvider as any other MetricProducer, so the MetricReader that we have should be able to read from any MetricProducer including our MeterProvider. Then the MetricProducer becomes the interface that any source (meter provider is one) can implement to connect with the export pipeline lead by the MetricReader.

If this was discussed, please document in the PR description for new reviewers to understand.

dashpole · 2022-09-19T18:13:20Z

Thanks @bogdandrutu. I updated the title. I also pasted the notes from the 8/30 discussion in the PR description where we discussed the high-level design.

bogdandrutu · 2022-09-19T18:25:27Z

My vote is to support Option 2:

Hard to explain that views are not there. We clearly separate that views are only for sdk.MeterProvider.
exporter configuration for the MeterProvider would be shared.
- This is true for the option 2 as well, since the MetricReader may be shared, isn't it?
MeterProvider will implement the same interface as any other producer.
Indeed user must configure Resource maybe twice, but that is easier in my opinion compare with explaining that the resource will be overwritten. (see "Resource MUST be provided by the MeterProvider, and any resource information provided by the bridge MUST be overridden.")

dashpole · 2022-09-19T19:10:14Z

My vote is to support Option 2:

Hard to explain that views are not there. We clearly separate that views are only for sdk.MeterProvider.

Agreed. That was one of the downsides discussed. But given that we already have to explain that aggregation/ aggregation temporality configuration doesn't apply, it seems reasonable.

exporter configuration for the MeterProvider would be shared.

This is true for the option 2 as well, since the MetricReader may be shared, isn't it?

You are correct that you can share the metric readers still with option 2. It just means you have to pass the metric readers (and resource) as options to both the sdk and to bridges separately.

MeterProvider will implement the same interface as any other producer.

This is still possible with the current design, but I considered it an implementation detail.

Indeed user must configure Resource maybe twice, but that is easier in my opinion compare with explaining that the resource will be overwritten. (see "Resource MUST be provided by the MeterProvider, and any resource information provided by the bridge MUST be overridden.")

This may only apply to java (see #2722 (comment)), but I haven't checked other languages. The benefit of this is that you always end up grouping the metrics together under a single resource, which was important, if I'm remembering the discussion correctly.

jmacd · 2022-09-21T22:38:15Z

specification/metrics/sdk.md

+`InstrumentationScope()`, and with the `MeterProvider`'s configured resource if
+it is possible for those to conflict.


What does "if it is possible for those to conflict." mean?

I meant it to mean: "If a batch of metric points from a MetricProducer can conflict with the MeterProvider's resource". This would be possible if a "batch of metric points" includes resource information. That wouldn't be the case in Go, but would be in Java given their current definition of "batch of metric points".

I could remove "if it is possible for those to conflict", or I could change it to "...configured resource. If a resource or scope is already present in a batch of metric points from a MetricProducer, that MUST be overridden with the MetricProducer's Instrumentation Scope and the MeterProvider's resource".

Thanks. I see this was also discussed in #2722 (comment). Now that I understand the Java SDK has Resource in its data object, I understand this point, which helps explain @bogdandrutu's remarks in #2722 (comment).

jmacd · 2022-09-22T18:26:24Z

It appears there is lingering debate over how to require that Resources are-or-are-not-definitely-the-same between the bridged metrics and those from other Meters (and the same concern for Scopes, practically speaking). Here's what is written in the Resource SDK spec

When used with distributed tracing, a resource can be associated 
with the [TracerProvider](https://github.com/open-telemetry/opentelemetry-specification/blob/main/specification/trace/api.md#tracerprovider) 
when the TracerProvider is created. That association cannot be changed later. 
When associated with a TracerProvider, all Spans produced by any Tracer 
from the provider MUST be associated with this Resource.

Analogous to distributed tracing, when used with metrics,
a resource can be associated with a `MeterProvider`.
When associated with a [`MeterProvider`](../metrics/api.md#meterprovider),
all metrics produced by any `Meter` from the provider will be
associated with this `Resource`.

It's that "analogous" MUST we are defending here. It appears the stable Java SDK has a loophole, and now we have wrinkles in the specification because of it. I recommend @dashpole remove all the wrinkle about Resources "if it is possible for those to conflict", leave the specification stating that each producer has an instrumentation scope, and let the Java SDK document its loophole (i.e., it should document that users are required to use the same resource twice or else be out-of-spec).

I expect that to make the Producer interface the same as the internal representation of a Meter has; I expect that means the Producer will be invoked to produce a batch of Metric objects just as one Meter would do; therefore I expect the MetricProducer to produce the same data objects that any other Meter would, because the Scope and Resource are already determined at that level in the data hierarchy.

@bogdandrutu can you clarify if whether you have any reservations not covered above?

jmacd · 2022-09-22T18:33:40Z

MetricProducer to produce the same data objects that any other Meter would, because the Scope and Resource are already determined at that level in the data hierarchy.

p.s., I'm not opposed to allowing MetricProducers to return more than one scope. A MetricProducer returning multiple Scopes equals multiple single-Scope MetricProducers, IMO, what's important to me is that a MetricProducer doesn't require the MetricReader to sort data points by Scope to successfully export the data.

jmacd · 2022-09-26T16:12:06Z

Would @ahayworth @pirgeo @MadVikingGod and @bogdandrutu in particular please read through the above dialog and see if you agree to approve the PR in its current form? The four of you have commented without approving.

Recall that this is absolutely a barrier to finishing the OTel-Go OpenCensus bridge, which is at least partlyt responsible for stalling development on Collector observability improvements. Thanks.

ahayworth · 2022-09-26T16:40:21Z

Would @ahayworth [...] please read through the above dialog and see if you agree to approve the PR in its current form? The four of you have commented without approving.

I have no strong feelings on the PR; I happened to notice a typo previously (which was the only thing in my comment). ❤️

MadVikingGod · 2022-09-26T16:55:48Z

specification/metrics/sdk.md

+If a meter is created which produces an
+[`InstrumentationScope`](../glossary.md#instrumentation-scope), which matches
+the InstrumentationScope of a [MetricProducer](#metricproducer), or if multiple
+[MetricProducers](#metricproducer) have the same InstrumentationScope the SDK
+SHOULD emit a warning.


This seems like 2 separate warnings/errors.

If an instrument is created, after applying views, such that it's scope conflicts with a MetricProducer. If we create a meter that conflicts but never create an instrument that does, because of scope rename or not creating any instruments, is there an warning?

When registering MetricProducers if they were to collide then there should be an error/warning. This should be a new norm forming sentence.

I agree this is two separate statements, but it doesn't say anything about per-instrument warnings. I believe we should emit warnings for the conflicting scopes, but allow the instruments to be correctly registered into each. IOW the presence of MetricProducers could produce scope warnings, not instrument warnings. The only way to get instrument warnings should be to have conflicts within a scope (IMO).

MadVikingGod · 2022-09-26T17:12:30Z

I'm ok with the current PR, as I think it offers us a way to short-term migrate from OC to otel without the re-instrument everything step. I think this will be implementable as written.

I still have a preference for approach 2, for two reasons.

I think it will be simpler to implement in go by creating a wrapped reader.
I think it makes it easier to explain what is happening in the following contrived example. If you have 2x readers, and 1x MetricProducer that produces delta metrics, neither reader would get the entire stream of data from the producer. As it is written now the MetricProducer will be shared by the MeterProvider among all Readers. In approach 2 the user would have to explicitly reuse the producer between 2 or more readers.

Because of 2 I would also recommend that we have some warning on any producers we create that can make delta metrics. They shouldn't be used with multiple readers.

jmacd · 2022-09-26T17:53:30Z

I would also recommend that we have some warning on any producers we create that can make delta metrics.

I would be happy to add that MetricProducers MUST NOT use delta temporality, "for they are expected to be stateless" with respect to the MetricReader.

jmacd · 2022-09-26T18:12:20Z

@dashpole I had been confused (and I suspect others have bene) by the option-numbering scheme mentioned in the PR description, especially considering the numbered-list appearing in (and later quoted) #2722 (comment).

@MadVikingGod's points are well taken in arguing for the bridge to be implemented as a wrapped MetricReader. Implementing the bridge inside the Reader is both simpler and allows the correct use of Delta Temporality. The downside of that approach is that warnings will not be realized until the first time they are read. This is completely fine, in my opinion.

@dashpole I recommend closing this PR, revising the contents of this PR, and opening a new one where we do not list numbered alternatives.

dashpole · 2022-09-26T20:59:43Z

I would be happy to add that MetricProducers MUST NOT use delta temporality, "for they are expected to be stateless" with respect to the MetricReader.

It seems like metric reader could still be stateless even with a stateless bridge/reader/exporter if both the bridge and exporter prefer delta. If there was a client that natively produced delta temporality metrics, I don't think we'd want to disallow bridging that to compatible exporters, right?

dashpole · 2022-09-26T21:13:57Z

I opened #2838, which removes "if it is possible for those to conflict" language, splits the conflicting-scope warning sentence into two, and removes the numbered options from the description.

dashpole force-pushed the metric_producer_bridge branch 2 times, most recently from 255ae16 to fdf9fb4 Compare August 10, 2022 21:32

This was referenced Aug 11, 2022

Add back the OpenCensus example code open-telemetry/opentelemetry-go#2806

Closed

Add back the OpenCensus bridge code open-telemetry/opentelemetry-go#2808

Closed

dashpole force-pushed the metric_producer_bridge branch 2 times, most recently from 48512ce to cc4d99c Compare August 11, 2022 17:48

dashpole mentioned this pull request Aug 16, 2022

[PoC] Reimplement OpenCensus bridge open-telemetry/opentelemetry-go#3093

Closed

dashpole commented Aug 16, 2022

View reviewed changes

specification/metrics/sdk.md Outdated Show resolved Hide resolved

dashpole force-pushed the metric_producer_bridge branch from cc4d99c to cd5470d Compare August 16, 2022 18:26

dashpole changed the title ~~OpenCensus Metrics bridge using MetricProducer~~ Add MetricProducer to allow MetricReaders to collect from third-party metric sources Aug 16, 2022

dashpole mentioned this pull request Aug 16, 2022

Add OpenCensus metric bridge specification #2732

Closed

dashpole force-pushed the metric_producer_bridge branch 2 times, most recently from 42b8ad0 to 170351b Compare August 16, 2022 19:23

dashpole mentioned this pull request Aug 16, 2022

OpenCensus Compatibility for GA #1175

Closed

7 tasks

dashpole marked this pull request as ready for review August 16, 2022 19:30

dashpole requested review from a team as code owners August 16, 2022 19:30

github-actions bot assigned jmacd Aug 16, 2022

pirgeo reviewed Aug 17, 2022

View reviewed changes

specification/metrics/sdk.md Outdated Show resolved Hide resolved

specification/metrics/sdk.md Outdated Show resolved Hide resolved

specification/metrics/sdk.md Show resolved Hide resolved

MrAlias reviewed Aug 17, 2022

View reviewed changes

specification/metrics/sdk.md Outdated Show resolved Hide resolved

dashpole force-pushed the metric_producer_bridge branch from c1480d7 to f7c6065 Compare August 18, 2022 01:58

MrAlias approved these changes Aug 18, 2022

View reviewed changes

specification/metrics/sdk.md Outdated Show resolved Hide resolved

specification/metrics/sdk.md Outdated Show resolved Hide resolved

dashpole requested a review from a team as a code owner August 18, 2022 20:06

dashpole force-pushed the metric_producer_bridge branch from 2cb7a75 to a68a576 Compare August 18, 2022 20:11

dashpole mentioned this pull request Aug 18, 2022

Add MetricProducer interface, and MeterProvider Option open-telemetry/opentelemetry-go#3100

Closed

dashpole and others added 2 commits September 13, 2022 17:41

Update specification/metrics/sdk.md

fd3c9c8

Co-authored-by: jack-berg <34418638+jack-berg@users.noreply.github.com>

emit warnings for duplicate scopes

4877d9d

dashpole force-pushed the metric_producer_bridge branch from 49b316e to 4877d9d Compare September 13, 2022 18:28

jmacd approved these changes Sep 13, 2022

View reviewed changes

ahayworth reviewed Sep 13, 2022

View reviewed changes

specification/metrics/sdk.md Outdated Show resolved Hide resolved

dashpole and others added 2 commits September 14, 2022 14:12

Update specification/metrics/sdk.md

79948c5

Co-authored-by: Andrew Hayworth <ahayworth@gmail.com>

Merge branch 'main' into metric_producer_bridge

739dd1a

bogdandrutu reviewed Sep 19, 2022

View reviewed changes

dashpole changed the title ~~Add MetricProducer to allow MeterProviders to incorporate metric data from third-party sources~~ Add MetricProducer to allow sdk.MeterProviders to incorporate metric data from third-party sources Sep 19, 2022

tsloughter mentioned this pull request Sep 19, 2022

MetricProducers open-telemetry/opentelemetry-erlang#462

Open

jmacd reviewed Sep 21, 2022

View reviewed changes

MadVikingGod reviewed Sep 26, 2022

View reviewed changes

MadVikingGod approved these changes Sep 26, 2022

View reviewed changes

dashpole closed this Sep 26, 2022

dashpole mentioned this pull request Sep 26, 2022

Add MetricProducer as a source of external metric data #2838

Closed

dashpole mentioned this pull request Nov 15, 2022

Define MetricProducer as a third-party provider of metric data to MetricReaders #2951

Merged

This was referenced Jul 20, 2023

Change metric.Producer to be an Option on Reader open-telemetry/opentelemetry-go#4346

Merged

MetricProducers are passed via config to MetricReaders instead of RegisterProducer #3613

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MetricProducer to allow sdk.MeterProviders to incorporate metric data from third-party sources #2722

Add MetricProducer to allow sdk.MeterProviders to incorporate metric data from third-party sources #2722

dashpole commented Aug 10, 2022 •

edited

pirgeo left a comment

dashpole commented Aug 17, 2022

jmacd commented Aug 17, 2022

dashpole commented Aug 18, 2022

dashpole commented Aug 18, 2022

bogdandrutu left a comment

dashpole commented Sep 19, 2022

bogdandrutu commented Sep 19, 2022

dashpole commented Sep 19, 2022

jmacd Sep 21, 2022

dashpole Sep 22, 2022

jmacd Sep 22, 2022

jmacd commented Sep 22, 2022

jmacd commented Sep 22, 2022

jmacd commented Sep 26, 2022

ahayworth commented Sep 26, 2022

MadVikingGod Sep 26, 2022

jmacd Sep 26, 2022

MadVikingGod commented Sep 26, 2022

jmacd commented Sep 26, 2022

jmacd commented Sep 26, 2022

dashpole commented Sep 26, 2022

dashpole commented Sep 26, 2022

		`InstrumentationScope()`, and with the `MeterProvider`'s configured resource if
		it is possible for those to conflict.

Add MetricProducer to allow sdk.MeterProviders to incorporate metric data from third-party sources #2722

Add MetricProducer to allow sdk.MeterProviders to incorporate metric data from third-party sources #2722

Conversation

dashpole commented Aug 10, 2022 • edited

Changes

Discussion from the spec sig on 8/30:

pirgeo left a comment

Choose a reason for hiding this comment

dashpole commented Aug 17, 2022

jmacd commented Aug 17, 2022

dashpole commented Aug 18, 2022

dashpole commented Aug 18, 2022

bogdandrutu left a comment

Choose a reason for hiding this comment

dashpole commented Sep 19, 2022

bogdandrutu commented Sep 19, 2022

dashpole commented Sep 19, 2022

jmacd Sep 21, 2022

Choose a reason for hiding this comment

dashpole Sep 22, 2022

Choose a reason for hiding this comment

jmacd Sep 22, 2022

Choose a reason for hiding this comment

jmacd commented Sep 22, 2022

jmacd commented Sep 22, 2022

jmacd commented Sep 26, 2022

ahayworth commented Sep 26, 2022

MadVikingGod Sep 26, 2022

Choose a reason for hiding this comment

jmacd Sep 26, 2022

Choose a reason for hiding this comment

MadVikingGod commented Sep 26, 2022

jmacd commented Sep 26, 2022

jmacd commented Sep 26, 2022

dashpole commented Sep 26, 2022

dashpole commented Sep 26, 2022

dashpole commented Aug 10, 2022 •

edited