Define where to handle invalid span contexts #233

toumorokoshi · 2019-10-22T03:53:37Z

Joining a conversation that's currently happening across two PRs:

It's important for us to standardize on what httptextformatters will return back when they are unable to parse the span context from their respective headers. If the choice is to return an invalid span context, then we should figure out what is responsible for handling that condition, and what the correct behavior is.

Proposal

As a start, I'm proposing:

if a formatter is unable to retrieve a valid spancontext, then an invalid spancontext should be returned.
this should be handled in the propagator shim: https://github.com/open-telemetry/opentelemetry-python/blob/master/opentelemetry-api/src/opentelemetry/propagators/__init__.py#L26 and return back tracer.CURRENT_SPAN.

Here's alternatives and why I'm arguing against them:

Handling invalid spans in the integrations

Handling this in the integrations would require the same boilerplate code to handle and convert the invalid span into the CURRENT_SPAN constant. I worry about someone missing that implementation detail resulting in incongruent behavior.

Handling invalid spans in the SDK

I'm not 100% clear on what this would look like, but it would probably require making propagators and API interface and moving implementation code into the SDK. Not including this behavior would mean that the API alone is not enough to implement correct w3c tracecontext propagation (as invalid span contexts would not be converted to new ones).

I'm having trouble finding where it was stated that the API should propagate tracecontext by defaults. Maybe @c24t or @reyang can point me in the right direction?

Oberon00 · 2019-10-22T15:50:36Z

I think the API will not be required to propagate context see open-telemetry/opentelemetry-specification#208 (comment)

c24t · 2019-10-23T00:30:54Z

I'm having trouble finding where it was stated that the API should propagate tracecontext by defaults. Maybe @c24t or @reyang can point me in the right direction?

open-telemetry/opentelemetry-specification#208 is probably the best source, this isn't really documented in the spec.

I think the API will not be required to propagate context

I think it's too soon to call this issue. The discussion in the specs issue is better, but as I understand it there's no way for us to do all of:

enable context propagation in applications via extensions alone (e.g. ext-grpc for gRPC, ext-wsgi/ext-requests for HTTP)
not require libraries to depend on the SDK
implement context propagation in the SDK only

From #228 (review):

I think we should rather fix the handling of INVALID_SPANCONTEXT in the SDK. Moving generate_spancontext to the API would break open-telemetry/oteps#58

@Oberon00 can you spell this out? Why does moving ID generation into the API make that OTEP impossible to implement?

Returning INVALID_SPAN from the formatter and converting to CURRENT_SPAN in the propagator sounds like a great solution to me, but it sounds like I'm missing some of the implications here.

Does it matter that integrations wouldn't be able to distinguish between invalid and unspecified spans?

Oberon00 · 2019-10-23T08:58:44Z

Because a vendor-SDK might encode custom information in the span context, maybe even in the Trace or Span ID. Thinking about it, moving the generate_span_id might not be as bad as I first thought because any vendor SDK would still have to deal with incoming span contexts that were not created by it anyway.

Anyway, my suggestion would be: Handle INVALID_SPANCONTEXT just like None when it is passed as parent. Maybe even disallow None as parent and only allow INVALID_SPANCONTEXT.

toumorokoshi · 2019-10-23T17:38:12Z

That's a good idea, it makes sure that things are handled in a standard way without adding additional logic to integrations. Although it does come at the cost of not being able to disambiguate no parent vs invalid parent, but I feel like that isn't a big deal.

@c24t @mauriciovasquezbernal any concerns with handling this in Tracer.start_span (or those collection of methods?)

mauriciovasquezbernal · 2019-10-23T19:49:40Z

I don't have concerns at this point. Just a question, propagators must then return INVALID_SPANCONTEXT when something fails. Is an SpanContext with trace_id and span_id 0 valid?

Just asking because the b3 propagator returns the later, a check like context is INVALID_SPAN_CONTEXT is False then.

Fixes open-telemetry#233. The SDK tracer will now create spans with invalid parents as brand new spans, similar to not having a parent at all. Adding this behavior to the Tracer ensures that integrations do not have to handle invalid span contexts in their own code, and ensures that behavior is consistent with w3c tracecontext (which specifies invalid results should be handled by creating new spans).

toumorokoshi · 2019-10-24T04:37:41Z

@mauriciovasquezbernal regarding B3 and how they handle context propagation... I'm following this github repo and I don't see anything explicitly calling out how to handle invalid contexts: https://github.com/openzipkin/b3-propagation

Should we be handling this? or is there a more authoritative source?

looking closely at the spec again, I believe our current b3 propagation is invalid because it does not propagate the ParentSpanContext header. I'll file a ticket around that, but I don't think B3 propagation is something that we should fix in this PR.

c24t · 2019-10-24T07:16:22Z

Maybe even disallow None as parent and only allow INVALID_SPANCONTEXT

Even if we handle both cases the same way, I'd still prefer to allow null here because it makes for a more intuitive API. It's clear that null means "no parent context" in this case.

any concerns with handling this in Tracer.start_span (or those collection of methods?)

I think it's possible we'll discover some reason later that we have to distinguish between these in the SDK, or handle invalid contexts farther up the stack, but I don't have any concerns now.

Fixes open-telemetry#233. The SDK tracer will now create spans with invalid parents as brand new spans, similar to not having a parent at all. Adding this behavior to the Tracer ensures that integrations do not have to handle invalid span contexts in their own code, and ensures that behavior is consistent with w3c tracecontext (which specifies invalid results should be handled by creating new spans).

The SDK tracer will now create spans with invalid parents as brand new spans, similar to not having a parent at all. Adding this behavior to the Tracer ensures that integrations do not have to handle invalid span contexts in their own code, and ensures that behavior is consistent with w3c tracecontext (which specifies invalid results should be handled by creating new spans). Setting the parent to none on spans if the parent context is invalid, reducing logic to handle that situation in downstream processing like exporters.

* fix: ts-mocha allow recursively loading files

toumorokoshi added the discussion Issue or PR that needs/is extended discussion. label Oct 22, 2019

toumorokoshi added this to the Alpha v0.2 milestone Oct 22, 2019

This was referenced Oct 22, 2019

ext/wsgi: use current span when extracting fails #226

Closed

Feature/tracecontext integration test #228

Merged

toumorokoshi mentioned this issue Oct 24, 2019

SDK Tracer treats invalid span parent like null (fixes #233) #235

Merged

toumorokoshi closed this as completed in #235 Oct 24, 2019

srikanthccv pushed a commit to srikanthccv/opentelemetry-python that referenced this issue Nov 1, 2020

fix: ts-mocha allow recursively loading files (open-telemetry#233)

af5d88a

* fix: ts-mocha allow recursively loading files

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define where to handle invalid span contexts #233

Define where to handle invalid span contexts #233

toumorokoshi commented Oct 22, 2019

Oberon00 commented Oct 22, 2019

c24t commented Oct 23, 2019 •

edited

Oberon00 commented Oct 23, 2019

toumorokoshi commented Oct 23, 2019

mauriciovasquezbernal commented Oct 23, 2019

toumorokoshi commented Oct 24, 2019

c24t commented Oct 24, 2019

Define where to handle invalid span contexts #233

Define where to handle invalid span contexts #233

Comments

toumorokoshi commented Oct 22, 2019

Proposal

Handling invalid spans in the integrations

Handling invalid spans in the SDK

Oberon00 commented Oct 22, 2019

c24t commented Oct 23, 2019 • edited

Oberon00 commented Oct 23, 2019

toumorokoshi commented Oct 23, 2019

mauriciovasquezbernal commented Oct 23, 2019

toumorokoshi commented Oct 24, 2019

c24t commented Oct 24, 2019

c24t commented Oct 23, 2019 •

edited