WIP: Add Labels.Unique() API, remove label encoder from ungrouped batcher #641

jmacd · 2020-04-15T04:11:08Z

This is an optimization for export pipelines that do not encode labels as a string. My real goal was simply to remove the label encoder dependency from the integrator (still called batcher), to reduce cycles and dependencies between components.

… batcher

MrAlias · 2020-04-15T16:57:17Z

sdk/export/metric/metric.go

@@ -329,42 +313,10 @@ type Record struct {
 // presented in sorted order by the SDK.
 type Labels interface {
 	Iter() LabelIterator
+	Unique() interface{}


Can this be changed to return a string? Or, another concrete type?

The interface{} type when used in this way circumvents type checking that helps prevent runtime errors.

This can't be changed because the underlying type used by the real SDK is a variable-sized array, the same one used as a map key for variable-size label sets. The "Simple" implementation uses a string, and the constraint is that you can't use both at the same time--that's why I moved "Simple" into the test package.

I've added comments on the interface methods here.

This has a bad smell to it and is making me think there is a mistake in the design.

the two implementations do not return a consistent value, uniqueness is only guaranteed for a single implementation.

This seems like the antithesis of what a Go interface is used for. Instead of the Labels interface acting as a contract between the implementer and the caller of the declared methods it becomes a direct circumvention of the typing system. Instead of the compiler asserting the passed type is compatible, it is now the the callers responsibility to assert the passed interface is from a single implementation and thus compatible.

may be used as a map key

Additionally, it is only in documentation that the type returned needs to be a comparable value. This again is moving additional responsibility of the compiler to the user.

I get that this can work, but I feel it is building something users are going to have safety issues and frustration with down the road.

I'm wondering if we have over-optimized the Labels implementation in the default SDK and it has now put us into a corner because of this.

To be clear, I can move past this concern if we think this is need to be merged to unblock stuff. But I think it is an issue and we should give it some serious thought.

There is a prior PR about this: #523

It was an optimization in the *sdk/metric.SDK, and there is only one SDK in use at a time. The SDK goes to great lengths to encode a unique interface{} value for use in the sync.Map of current records, and I think it's reasonable for the integrator to use this. If an exporter is receiving data from multiple SDKs, which I think of as an irregular configuration, then it would have to be defensive about this kind of thing.

When testing, it is appropriate to use the simple implementation of the export metric.Labels, and still correct. It would only be incorrect to mix metric.Labels value from a test and a real SDK in the same map, then you'd have two entries, unexpectedly with the same identity.

Funny story, opening up that prior PR, GitHub showed me the pending review I had started but didn't finish because the PR was merged before I could. It contained similar concerns around the premature optimization and complexity added by using the reflect package to dynamically declare variable-length arrays.

I think this approach can be tractable if you maintain all uses of the code, like inside the SDK. However, when we are exporting this implementation in this way we lose all guarantees on use and conformity. We cannot ensure users will understand and appropriately follow the restrictions and guidelines outlined here. In fact, I think it should be expected that users are going to use these things in ways that we never anticipated much less the ways we anticipate to be wrong.

You consistently seem to have a better grasp on the project structure direction and specifically the end goals of Labels than I do. I'm wondering if I'm missing a big-picture thing here that is driving us in this direction. From my perspective, it seems like we could build a better user experience by going about this in a different way. Maybe we can talk more about this in the next SIG meeting.

One question: are the interfaces and types in sdk/export/metric supposed to be reused/implemented by other SDKs? If not, then having an exporter gathering data from multiple SDKs would be possible. Unless we are talking about multiple instances of the same SDK type.

If we would want to differentiate the unique representations across SDKs, then we could have something like this:

// in sdk/export/metric/metric.go var sdkIDCounter int64 = 0 // NewIDForSDK should be called by each SDK at the creation time, blah blah func NewIDForSDK() int64 { return atomic.AddInt64(&labelEncoderIDCounter, 1) } // UniqueRepr is a unique representation of the labels, // can be used as a map key, and so on and so forth type UniqueRepr struct { id int64 raw interface{} } // NewUniqueRepr yadda yadda, id should come from NewIDForSDK() func NewUniqueRepr(id int64, data interface{}) UniqueRepr { return UniqueRepr{ id: id, data: data, } } type Labels interface { … Unique() UniqueRepr } // in sdk/metric/sdk.go func (ls *labels) Unique() export.UniqueRepr { return export.NewUniqueRepr(ls.meter.id, ls.ordered) } func New(batcher export.Batcher, opts ...Option) *SDK { … m := &SDK{ …, id: export.NewIDForSDK(), } … }

I think that even if we abandon the idea of SDK ID, hiding the interface{} inside a struct as an unexported member still would be better than returning interface{}. As such, it wouldn't give the exporters any idea about guessing the real type behind interface{}. Maybe even we could rename UniqueRepr to MapKey or something to give a clear message about the purpose of the type.

Your suggestion to wrap the interface{} is a great help.

As noted below, I think we can drop the idea of "SDK ID" and instead move the "label set" type into the API. Then the batchers and exporters can be guaranteed that the unique value is really unqiue.

I'll hold off on this PR for now.

jmacd · 2020-04-19T04:32:32Z

I thought over the objection to this PR, looked at the benchmark changes from #523, and I'm still in favor of the basioc approach. The argument to me is that the benchmarks are significantly better, and the operation being optimized is in the path of every directly-accessed instrument. Assuming that the SDK's sync.Map maintains directly-accessed instrument records at least one interval after they are collected, then any re-use of a directly-accessed instrument stands to win from this PR, because although the SDK's unique value has to be computed on every metric event, the label encoding can be cached across intervals.

I like @krnowak's suggestion that wrapping the interface{} so that users cannot access it is one step forward, but it doesn't avoid the problem of multiple SDKs sending to an export pipeline with alternate implementations of Unique(). In the related discussion about label sets and resource sets in #640, there was a suggestion that we could move a label set type into the API and have it shared by metrics and SDK resources. In this scenario, we can then force any SDK that uses the export API to use a standard label set type, which means we can ensure that the Unique() value is unique independent of the SDK.

I propose to shelve this PR for now, while refactoring label sets as described above in a replacement for #640. After the refactoring, this PR can be re-applied (using @krnowak's suggestion) without a uniqueness problem.

There are two separate matters that this brings up, which I'd like to address:
(1) The SDK is not actually retaining direct-access records across >1 interval (iiuc), so label-encoding caching is probably not working like we want.
(2) One of the motivations here was to reduce the dependency on labelset injection, particularly because there is only one label encoding that gets the benefit of caching. I would like to change the label set to cache 2 or 3 label encodings, so that we won't face a performance cliff if there are >1 exporters configured. In the prototype in OTEP 78 I had refactored the label set in a similar way, both placing it in the API and supporting multiple cached encodings.

jmacd · 2020-04-21T07:40:43Z

See #651

Add (export.Labels).Unique() API, remove label encoder from ungrouped…

c3cacc2

… batcher

jmacd requested review from krnowak, lizthegrey, MrAlias, paivagustavo, rghetia and tedsuo as code owners April 15, 2020 04:11

jmacd added the area:metrics Part of OpenTelemetry Metrics label Apr 15, 2020

jmacd added 3 commits April 14, 2020 21:13

Missing file

52a697c

Move IteratorToSlice to the test package

7a5a21c

More test

a7ea472

jmacd mentioned this pull request Apr 15, 2020

Add resource support to dogstatsd exporter open-telemetry/opentelemetry-go-contrib#13

Merged

MrAlias reviewed Apr 15, 2020

View reviewed changes

Add comments on the Labels API

2a7d3c2

Aneurysm9 approved these changes Apr 15, 2020

View reviewed changes

jmacd closed this Apr 17, 2020

Upstream

e815db9

jmacd reopened this Apr 19, 2020

jmacd changed the title ~~Add Labels.Unique() API, remove label encoder from ungrouped batcher~~ WIP: Add Labels.Unique() API, remove label encoder from ungrouped batcher Apr 19, 2020

This was referenced Apr 19, 2020

Checkpoint only after Update; Keep records in the sync.Map longer #647

Merged

Do not cache reflect.ValueOf() in metrics Labels #649

Merged

New api/label package, common label set impl #651

Merged

jmacd closed this Apr 21, 2020

jmacd mentioned this pull request Apr 23, 2020

Use the label.Set.Equivalent value instead of an encoding in the batcher #658

Merged

jmacd deleted the jmacd/moreunique branch May 11, 2020 16:28

jmacd restored the jmacd/moreunique branch May 11, 2020 16:28

jmacd deleted the jmacd/moreunique branch May 11, 2020 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Add Labels.Unique() API, remove label encoder from ungrouped batcher #641

WIP: Add Labels.Unique() API, remove label encoder from ungrouped batcher #641

jmacd commented Apr 15, 2020

MrAlias Apr 15, 2020

jmacd Apr 15, 2020

MrAlias Apr 15, 2020 •

edited

MrAlias Apr 15, 2020

jmacd Apr 15, 2020

MrAlias Apr 15, 2020

krnowak Apr 16, 2020 •

edited

jmacd Apr 19, 2020

jmacd commented Apr 19, 2020

jmacd commented Apr 21, 2020

WIP: Add Labels.Unique() API, remove label encoder from ungrouped batcher #641

WIP: Add Labels.Unique() API, remove label encoder from ungrouped batcher #641

Conversation

jmacd commented Apr 15, 2020

MrAlias Apr 15, 2020

Choose a reason for hiding this comment

jmacd Apr 15, 2020

Choose a reason for hiding this comment

MrAlias Apr 15, 2020 • edited

Choose a reason for hiding this comment

MrAlias Apr 15, 2020

Choose a reason for hiding this comment

jmacd Apr 15, 2020

Choose a reason for hiding this comment

MrAlias Apr 15, 2020

Choose a reason for hiding this comment

krnowak Apr 16, 2020 • edited

Choose a reason for hiding this comment

jmacd Apr 19, 2020

Choose a reason for hiding this comment

jmacd commented Apr 19, 2020

jmacd commented Apr 21, 2020

MrAlias Apr 15, 2020 •

edited

krnowak Apr 16, 2020 •

edited