Define behavior for response header context propagation #50

SergeyKanzhelev · 2018-01-17T21:28:52Z

There are scenarios when services need to return correlation information in response. Scenarios include:

Sampling flag (+ sampling score) for delegated sampling
Tenant ID/identity of the service so caller knows where to query telemetry from

Spec need to define which headers SDK may expect and service should use for http response. Also behavior for service mash and proxies needs to be defined so headers would not be lost.

codefromthecrypt · 2018-01-17T23:56:51Z

one thing interesting about this is whether or not downstream need to be considered for the data propagated back. The two you mentioned could be resolved locally, ex not depending on the result of any potential downstream calls. The complexity of sending back sampling flag, trace ID (if for example none was provisioned), tenant ID is not hard, but some of these data imply either taking roles others are doing independently not (ex tenant id is often a separate header and unrelated to tracing). I guess you mention different types of data because you are asking if broadly speaking, trace-context and correlation-context are to be propagated back, such that they can be used for other middleware concerns such as knowing the tenant ID. Is that right?

SergeyKanzhelev · 2018-01-18T00:04:10Z

Yes, that was the main point. We need information that is required to build a trace to be returned back in response in certain scenarios. First scenario is for sampled flag that is part of Trace-Context header and second is about solution specific tenant id that is part of Trace-Context-Ex.

And we need to make sure those headers will be preserved and passed further in response for proxies and balancers. But we cannot demand it from all services.

Also - does it ever make sense to return trace-id in response headers? Or we can limit spec to only allow Trace-Context-Ex header to be returned.

codefromthecrypt · 2018-01-18T00:18:07Z

Yes, that was the main point. We need information that is required to build a trace to be returned back in response in certain scenarios. First scenario is for sampled flag that is part of Trace-Context header and second is about *solution specific* tenant id that is part of Trace-Context-Ex.

thx for clearing it up

And we need to make sure those headers will be preserved and passed further in response for proxies and balancers. But we cannot demand it from all services.

ok so the implementor can't blindly overwrite or depend on local state. it has to be prepared to consider and parse an incoming trace context

Also - does it ever make sense to return trace-id in response headers?

was trying to figure this out. For example, there are middleware discussions of passing trace ID back which occurred in the past openzipkin/b3-propagation#4

Or we can limit spec to only allow Trace-Context-Ex header to be returned.

If anything is passed back there will be similar code impact which will break I'd say most libraries out there. Using the work "allow" is interesting as it is like a MAY, which is less casualties short term.

SergeyKanzhelev · 2018-01-18T00:41:22Z

Ok. So scenario for trace-id I understood from the linked issue is: "let's return trace-id if we started a new one". This may be valid when service do not trust caller to define a trace, but trust enough to expose the newly generated one. I think this may be preferable mode of operation for some cloud services. So it would be great to allow for this implementation in a standard.

So I'd say let's just state that both headers can appear in response. And implementors can do what they need. If you do not know what to do with them and have 1:1 mapping between incoming and outgoing span - pass context along.

SergeyKanzhelev · 2018-01-18T00:41:56Z

It may be good enough for a spec. We can mention specific use cases as a best practices.

wu-sheng · 2018-01-18T01:14:43Z

response header context propagation

@SergeyKanzhelev @adriancole This is an ongoing design in SkyWalking about this concept. But not relate to tenant id or new generated trace-id.

In our case, it supposes to be used like this:

Pass back the server side service name(operation name of EntrySpan). e.g. in Spring HTTP mapping, server side URL can be defined like prod/{userId}/{orderId}, at the same time, client side use prod/123/o345 as URL. You can see there are different in both side. In SkyWalking analysis module, it asks for no blocking/waiting other trace segment in order to improve performance. But as different URLs used, the service dependency is not right.

yurishkuro · 2018-01-18T01:49:31Z

What would be the semantics with server returning a trace ID if the client also sent a trace ID in the request? Will the spec require that ID to be the same? What about span ID?

One use case someone raised recently was about using the returned trace+span IDs to validate that the response is actually to the right request. To quote the source, "I've experienced Java bugs before where we were sending responses on the wrong file descriptors which causes all sorts of trouble." Arguably this could be delegated to be the RPC framework responsibility, at the expense of RPC framework injecting yet another unique ID header.

SergeyKanzhelev · 2018-01-18T18:59:18Z

@wu-sheng are you thinking of operation name in response as a Trace-Context-Ex property or Correlation-Context property? (With the guidance that trace context is used for tracing essential properties and very limited in size). I can see it as a scenario to allow Correlation-Context header in response.

@yurishkuro scenario I suggested is when service do not trust client to send random enough trace-id or has it's reasons to restart the trace. @bogdandrutu was talking a lot about Google services doing it. In this scenario returning different trace-id will make it possible to link items. Scenario of RPC calls request/response matching may be also valid. However I do not think the spec should require the match.

yurishkuro · 2018-01-18T19:51:45Z

However I do not think the spec should require the match.

I think that's fine, we just need to be explicit about it, not leave undefined behavior/expectations.

SergeyKanzhelev · 2018-01-23T16:49:08Z

@yurishkuro @adriancole I;'d appreciate review of PR #51

bogdandrutu · 2018-02-01T18:45:28Z

I know some other things that we may want to have in the response:

server latency (when you cross trusting boundaries) you may want to share this with your client, e.g. one cloud service can return this.

@SergeyKanzhelev yes we do have this problem about trusting the incoming trace-id.

bripkens · 2018-02-01T19:00:12Z

server latency (when you cross trusting boundaries) you may want to share this with your client, e.g. one cloud service can return this.

This could be achieved via Server-Timing: https://w3c.github.io/server-timing/

Actually, Server-Timing could be (ab-)used to implement some of the use cases mentioned here.

SergeyKanzhelev · 2018-02-26T21:29:36Z

Also mention populating of Access-Control-Expose-Headers header

AloisReitbauer · 2018-05-02T08:36:40Z

This relates to whether we can reuse server timing #69

SergeyKanzhelev mentioned this issue Jan 18, 2018

Define response headers behavior - old #51

Closed

SergeyKanzhelev mentioned this issue Feb 6, 2018

Refactor spec towards trace-parent and trace-state #57

Closed

SergeyKanzhelev added this to the experiment milestone May 2, 2018

AloisReitbauer added the trace-context label May 2, 2018

AloisReitbauer modified the milestones: 1. experiment, 3. FPWD Aug 6, 2018

AloisReitbauer mentioned this issue Aug 7, 2018

Do we need a response header? #148

Closed

SergeyKanzhelev mentioned this issue Aug 7, 2018

response headers rationale #150

Merged

SergeyKanzhelev closed this as completed Aug 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define behavior for response header context propagation #50

Define behavior for response header context propagation #50

SergeyKanzhelev commented Jan 17, 2018

codefromthecrypt commented Jan 17, 2018 via email

SergeyKanzhelev commented Jan 18, 2018

codefromthecrypt commented Jan 18, 2018 via email

SergeyKanzhelev commented Jan 18, 2018

SergeyKanzhelev commented Jan 18, 2018

wu-sheng commented Jan 18, 2018 •

edited

Loading

yurishkuro commented Jan 18, 2018

SergeyKanzhelev commented Jan 18, 2018

yurishkuro commented Jan 18, 2018

SergeyKanzhelev commented Jan 23, 2018

bogdandrutu commented Feb 1, 2018

bripkens commented Feb 1, 2018

SergeyKanzhelev commented Feb 26, 2018

AloisReitbauer commented May 2, 2018

Define behavior for response header context propagation #50

Define behavior for response header context propagation #50

Comments

SergeyKanzhelev commented Jan 17, 2018

codefromthecrypt commented Jan 17, 2018 via email

SergeyKanzhelev commented Jan 18, 2018

codefromthecrypt commented Jan 18, 2018 via email

SergeyKanzhelev commented Jan 18, 2018

SergeyKanzhelev commented Jan 18, 2018

wu-sheng commented Jan 18, 2018 • edited Loading

yurishkuro commented Jan 18, 2018

SergeyKanzhelev commented Jan 18, 2018

yurishkuro commented Jan 18, 2018

SergeyKanzhelev commented Jan 23, 2018

bogdandrutu commented Feb 1, 2018

bripkens commented Feb 1, 2018

SergeyKanzhelev commented Feb 26, 2018

AloisReitbauer commented May 2, 2018

wu-sheng commented Jan 18, 2018 •

edited

Loading