More ECS field on spans: "destination" #115

roncohen · 2019-07-10T12:54:08Z

There are several areas that could benefit from having more data on spans. There's a lot we can do, but i suggest we start with simple things. destination is a good first candidate. This builds on and supersedes this proposal which suggests to introduce the peer namespace.

I suggest we automatically set destination.address as the remote address for spans of type ext. Also for db spans, this could be relevant if the data is available. There are probably other we can think of. Generally, i think we shouldn't constrain ourselves to certain types of spans, but set the destination fields everywhere they make sense. Would be great to get your ideas.

outdated proposal 1

destination.address is the raw address. That is, we use whatever we have. We can set destination.port if we can derive it, e.g. for http libraries where the user did not specify a port we can assume it's port 80 for http and 443 for https. Down the line we can look into also supplying destination.ip or destination.domain, but i don't think it's necessary as a start. The rest like destination.bytes and destination.packets are less relevant in this context.

If users create spans manually and set any of the following, we should also set the ECS fields:

peer.ipv4 -> destination.ip
peer.ipv6 -> destination.ip
peer.port -> destination.port
peer.hostname -> destination.domain

@elastic/apm-agent-devs Please have a look. When there are no more objections or additional clarification needed I'll post the checkbox matrix

outdated proposal 2

from @beniwohli:

For databases and similar, we derive the address from the connection string. Drop port if it is the standard port for the given service
For HTTP requests, we derive the address from the URL
- {url.protocol}://{url.domain}:{url.port}, drop port component if it is the standard port for the given protocol

Agents send context.destination.address and context.destination.port, with the exact same meanings as defined by ECS:

context.destination.address: should hold either an IP (v4 or v6) or a host/domain name. The server can copy the value to destination.ip or destination.domain, depending on the value, rather than duplicating that logic in each agent, which requires no local context
context.destination.port: should hold a port number; agents should report default ports

For database queries, the destination information can be extracted from the connection string. For outgoing HTTP requests, the information should be extracted from the URI's authority.

We will leave translating OpenTracing peer.* tags to destination. field(s) to a later time.

Example 1: client request to https://elastic.co/foo/bar

context.destination.address: elastic.co
context.destination.port: 443 (default HTTPS port)

Example 2: client request to http://[::1]:8080/

context.destination.address: ::1
context.destination.port: 8080

Example 3: query to postgresql://postgres123.local/dbinst

context.destination.address: postgres123.local
context.destination.port: 5432 (default postgres port)

Example 4: query to user:pass@tcp(1.2.3.4:1234)/dbname (MySQL)

context.destination.address: 1.2.3.4
context.destination.port: 1234

Agent	Link to agent issue
.NET	elastic/apm-agent-dotnet#611
Go	elastic/apm-agent-go#676
Java	elastic/apm-agent-java#934
Node.js	elastic/apm-agent-nodejs#1684
Python	elastic/apm-agent-python#618
Ruby	elastic/apm-agent-ruby#613
RUM	elastic/apm-agent-rum-js#490
APM Server	elastic/apm-server#2917

The text was updated successfully, but these errors were encountered:

hmdhk · 2019-07-18T12:33:43Z

For the RUM agent the raw address might be a relative url! Since ECS doesn't include relative urls as valid addresses we need to generate address using the page origin. capturing destination.ip is not feasible for the RUM agent.

@roncohen , re. destination.port and destination.domain should we parse these from address on the agent side or can this be done on the APM server. The logic would be the same either way, so we have the opportunity to implement it only once (i.e. in APM server).

watson · 2019-08-02T12:27:54Z

In OpenTracing, the peer.* tags are used a lot. As you mention in elastic/apm-server#813 we should just use destination for outgoing spans, which I'm generally ok with, but how do we know when a user sets peer.*, if it's outgoing on incoming?

felixbarny · 2019-08-02T12:44:41Z

Is it safe to say that when setting peer.* on a transaction, it's incoming when setting on a span, it's outgoing?

axw · 2019-08-05T02:19:16Z

That depends: do we consider the message broker to be the destination or the source for a consumer span? If it's always the destination then yes, but otherwise we might also need to consider the span.kind tag.

SergeyKleyman · 2019-08-22T10:35:06Z

Can we add source.ip/port and destination.ip/port for transactions in addition to spans?
For example for [SIEM integration] Collect authentication related information #128 we are planning to add authentication attempts info to transaction events and it would make things easier if other properties (such as source.ip/port and destination.ip/port) where available in transaction events as well.

SergeyKleyman · 2019-08-22T10:39:11Z

@axw By span.kind tag do you mean something like incoming and outgoing?
But wouldn't event with kind incoming be transaction and not a span? Which is okay since as I mentioned above we should add source.ip/port and destination.ip/port to transaction as well.

axw · 2019-08-23T01:00:29Z

@SergeyKleyman according to https://github.com/opentracing/specification/blob/master/semantic_conventions.md, span.kind can be "producer" or "consumer" for messaging. Message consumption might be represented as a transaction (e.g. for JMS MessageListener style APIs), but you could also have a span for receiving a message from a queue.

axw · 2019-10-23T07:25:44Z

elastic/apm-agent-go#664 is a POC which adds context.destination.address for database spans. If this looks OK, we should press on and get the intake API updated.

@roncohen if we're capturing the destination address for HTTP spans, we should be taking into account proxies in order to ensure we record the peer network address, for SIEM, right?

beniwohli · 2019-10-23T12:04:25Z

Another POC, elastic/apm-agent-python#618. This adds the destination.address for psycodb2 (aka Python PostgreSQL driver), redis and Elasticsearch. I prioritized services that are used by opbeans-python.

The branch can be tested with

scripts/compose.py start master --with-opbeans-python --opbeans-python-agent-branch=db-destination --opbeans-python-agent-repo=beniwohli/apm-agent-python

roncohen · 2019-10-23T12:24:55Z

@axw great! not sure how SIEM deals with proxies. Perhaps there's a separate set of proxy fields?

beniwohli · 2019-11-13T13:06:24Z

Summarizing some discussion from the weekly meeting:

For databases and similar, we derive the address from the connection string.
- proposal: drop port if it is the standard port for the given service
For HTTP requests, we derive the address from the URL
- proposal: {url.protocol}://{url.domain}:{url.port}, drop port component if it is the standard port for the given protocol)

Summarizing open questions:

normalized (destination.protocol, destination.domain, destination.port) vs. denormalized (destination.address)
translate opentracing peer.* tags to destination. field(s)?

Proposal: go with the denormalized form, and leave opentracing translation for the next iteration.

@elastic/apm-agent-devs please comment if you disagree with the above proposals, or 👍 if you're good.

mikker · 2019-11-13T13:21:35Z

I don't really want to maintain a list of default ports if I can help it. Why not just send it? 4-5 digits are not going to add much to the gzipped payloads.

beniwohli · 2019-11-13T13:28:53Z

Sorry, should have added some arguments to the proposals :D this isn't really about saving space, but about trying to harmonize data. If one service is configured to connect to mysql://some-host:3306 and another one to mysql://some-host, some-host will appear as two separate MySQL services in the service map. Unless we do some kind of merging based on the default port in the UI, but I think the agents are in a better position to know default ports of services they instrument.

roncohen · 2019-11-18T08:51:42Z

thanks for pushing this forward @beniwohli! I've updated the description to your suggestion and added the check box matrix.

@elastic/apm-agent-devs Please create your individual issues and link them in the list.

simitt · 2019-11-18T08:55:39Z

@beniwohli not sure you are the right person to ask for, but could you summarize the fields that will be necessary on the Intake API, so we can create the according server issue and plan for it.

beniwohli · 2019-11-18T09:15:22Z

@simitt as far as I can tell, only context.destination.address for now. @roncohen probably knows if they need to be indexed/stored based on what the service map requires.

roncohen · 2019-11-18T10:16:31Z

right, thanks @simitt. context.destination.address is 👍 . Nothing else for now.

felixbarny · 2019-11-18T13:47:09Z

IIUC, it's quite important that agents end up sending the same context.destination.address if they connect to the same service, right?

I think we should update the spec (https://github.com/elastic/apm/blob/master/docs/agent-development.md) and add some examples or acceptance tests for things like Postgres, Elasticsearch, MongoDB, Redis, etc.

wolframhaussig · 2019-11-18T13:54:21Z

Will context.destination.address support more than 1 value? Background of my question is #107 - Proposal: add Database Link to span context. We connect to a database and jump to a second one using a db link. Therefore we would have 2 target DBs

simitt · 2019-11-18T14:25:06Z

And just to clarify this should be added only to the context for spans right? Or are there any use cases where a destination would also be available for a transaction or error?

axw · 2019-11-19T05:32:37Z

@wolframhaussig it would not, and this is only intended to hold network addresses

@simitt yes, only spans

axw · 2019-11-19T06:35:12Z

Seeing as the agents are only producing one field, the server will presumably need to explode the address into ip/domain/port.

For IPv6 addresses, I think we should format them as in URLs, i.e. by surrounding the IPv6 address component in square brackets, regardless of whether there's a port included.

e.g. given http://[::1]:80, we should report [::1], and given http://[::1]:8080, report [::1]:8080

beniwohli · 2019-11-19T09:19:24Z

Seeing as the agents are only producing one field, the server will presumably need to explode the address into ip/domain/port.

Can you expand on that? AFAIK, it's not necessary for the service map to have the components of the address split up, or is it?

axw · 2019-11-19T09:30:29Z

Can you expand on that? AFAIK, it's not necessary for the service map to have the components of the address split up, or is it?

Not directly, but I presume we'll be storing the information in ECS destination.* fields, which do not permit the hostname/IP and port to be stored together. The service maps code would consult those fields to produce the labels. @roncohen is that accurate?

simitt · 2019-11-19T09:54:42Z

From an APM/SIEM integration points of view we want to at least extract the destination.ip from the destination.address (elastic/apm-server#2917)

SergeyKleyman · 2019-11-19T10:09:30Z

@axw

Could you please clarify the following?

e.g. given http://[::1]:80, we should report [::1], and given http://[::1]:8080, report [::1]:8080

I was under impression that span.context.destination.address should include protocol as well so http://[::1]:80 should be reported as http://[::1]:80. Although in this case naming this field span.context.destination.address seems a little bit wrong - maybe span.context.destination.url or span.context.destination.link but we can discuss the name of the field after agreeing on its intended content.

axw · 2019-11-20T02:59:33Z

I had a chat with Ron offline. There's a few issues here.

For service maps, we really want to be able to aggregate on the address and port as one field. That doesn't work if we stick to ECS definitions, as destination.address is expected to hold either an IP or a domain name, while destination.port holds the port number.

Later on we'll likely want to add other types of destination labels for service maps, such as queue or topic names when sending to a message queue or bus. We were deferring this, but given that we'll need a separate field to combine address and port, it probably makes sense for that field to hold these types of labels too.

We've been talking about omitting the port number from the context due to a service maps requirement. This data will also be used for SIEM integration (#115 (comment)); the service maps requirement should not impact SIEM.

Due to the interference caused by these, I'd like to refocus this issue specifically on recording network destination information for SIEM, and create a separate issue for capturing a more abstract, logical service destination label. That label is where we'll consider omitting ports and so on.

So for this specific issue, I'd like to wind back the proposal half way: agents send context.destination.address and context.destination.port, with the exact same meanings as defined by ECS:

context.destination.address: should hold either an IP (v4 or v6) or a host/domain name. The server can copy the value to destination.ip or destination.domain, depending on the value, rather than duplicating that logic in each agent, which requires no local context.
context.destination.port: should hold a port number; agents will not omit default ports.

Example 1: client request to `https://elastic.co/foo/bar`

context.destination.address: elastic.co
context.destination.port: 443

Example 2: client request to `http://[::1]:8080/`

context.destination.address: ::1
context.destination.port: 8080

Example 3: query to `postgresql://postgres123.local/dbinst`

context.destination.address: postgres123.local
context.destination.port: 5432 (default postgres port)

Example 4: query to `user:pass@tcp(1.2.3.4:1234)/dbname` (MySQL)

context.destination.address: 1.2.3.4
context.destination.port: 1234

axw · 2019-11-22T02:47:49Z

The 👀 have it, I've updated the description. I'll follow up with a separate issue for the logical destination soon.

axw · 2019-12-06T06:36:37Z

Note: for IPv6 addresses, surrounding the address with square brackets is only relevant where you have (or may have) the port alongside, e.g. [::1]:80, http://[::1]:80, http://[::1] (square brackets still required because the port is optional). The square brackets are there to disambiguate the IPv6 address, which contains colons, from the proceeding port, which is separated by a colon.

When recording an IPv6 address separately from a port, as we are in the case of context.destination.address, then it should be recorded in its canonical form: without square brackets, e.g. ::1.

graphaelli · 2020-05-07T16:47:58Z

this is done

roncohen mentioned this issue Jul 10, 2019

Standardize on where to store span peer info elastic/apm-server#813

Closed

roncohen mentioned this issue Jul 29, 2019

Servicemap POC elastic/kibana#42120

Closed

simitt mentioned this issue Aug 8, 2019

[SIEM integration] Collect authentication related information #128

Open

15 tasks

roncohen mentioned this issue Aug 23, 2019

Service Map #137

Closed

6 tasks

beniwohli mentioned this issue Oct 23, 2019

capture destination host for instrumentations of services elastic/apm-agent-python#618

Merged

8 tasks

axw mentioned this issue Oct 31, 2019

Populate span.db.link/span.db.instance elastic/apm-agent-nodejs#1482

Closed

felixbarny added this to the 7.6 milestone Nov 18, 2019

felixbarny mentioned this issue Nov 18, 2019

Add destination.* fields elastic/apm-agent-java#934

Closed

mikker mentioned this issue Nov 18, 2019

Add destination ECS fields to spans elastic/apm-agent-ruby#613

Closed

hmdhk mentioned this issue Nov 18, 2019

Add destination.service to spans for service maps feature elastic/apm-agent-rum-js#490

Closed

2 tasks

axw mentioned this issue Nov 19, 2019

Add span.context.destination.address elastic/apm-agent-go#676

Closed

SergeyKleyman mentioned this issue Nov 19, 2019

Add span.context.destination.* properties for SIEM integration elastic/apm-agent-dotnet#611

Closed

2 tasks

axw mentioned this issue Nov 25, 2019

RFC: destination service name #174

Closed

hmdhk mentioned this issue Dec 4, 2019

Provide ECS destination field on spans elastic/apm-agent-rum-js#513

Closed

axw mentioned this issue Dec 6, 2019

feat(rum-core): enrich span context with desination metadata elastic/apm-agent-rum-js#515

Merged

3 tasks

simitt mentioned this issue Dec 10, 2019

Add support for destination on Intake API elastic/apm-server#2917

Closed

axw mentioned this issue Dec 12, 2019

Destination service name (part 2) #180

Closed

felixbarny mentioned this issue Jan 8, 2020

Enforce specs for cross-agent features #192

Closed

This was referenced Mar 19, 2020

add destination fields to all spans for Service Maps and SIEM elastic/apm-agent-nodejs#1684

Closed

feat: enrich spans with destination info elastic/apm-agent-nodejs#1685

Merged

graphaelli closed this as completed May 7, 2020

trentm mentioned this issue Nov 3, 2020

"redis@2.x" TAV test failures elastic/apm-agent-nodejs#1850

Closed

SergeyKleyman mentioned this issue Mar 1, 2021

Add span.context.destination.address/port properties for SIEM integration elastic/apm-agent-php#360

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More ECS field on spans: "destination" #115

More ECS field on spans: "destination" #115

roncohen commented Jul 10, 2019 •

edited by lreuven

Loading

hmdhk commented Jul 18, 2019

watson commented Aug 2, 2019 •

edited

Loading

felixbarny commented Aug 2, 2019

axw commented Aug 5, 2019

SergeyKleyman commented Aug 22, 2019

SergeyKleyman commented Aug 22, 2019

axw commented Aug 23, 2019

axw commented Oct 23, 2019

beniwohli commented Oct 23, 2019

roncohen commented Oct 23, 2019

beniwohli commented Nov 13, 2019

mikker commented Nov 13, 2019 •

edited

Loading

beniwohli commented Nov 13, 2019 •

edited

Loading

roncohen commented Nov 18, 2019

simitt commented Nov 18, 2019

beniwohli commented Nov 18, 2019

roncohen commented Nov 18, 2019

felixbarny commented Nov 18, 2019

wolframhaussig commented Nov 18, 2019

simitt commented Nov 18, 2019

axw commented Nov 19, 2019

axw commented Nov 19, 2019 •

edited

Loading

beniwohli commented Nov 19, 2019

axw commented Nov 19, 2019 •

edited

Loading

simitt commented Nov 19, 2019

SergeyKleyman commented Nov 19, 2019

axw commented Nov 20, 2019 •

edited

Loading

axw commented Nov 22, 2019

axw commented Dec 6, 2019

graphaelli commented May 7, 2020

More ECS field on spans: "destination" #115

More ECS field on spans: "destination" #115

Comments

roncohen commented Jul 10, 2019 • edited by lreuven Loading

Example 1: client request to https://elastic.co/foo/bar

Example 2: client request to http://[::1]:8080/

Example 3: query to postgresql://postgres123.local/dbinst

Example 4: query to user:pass@tcp(1.2.3.4:1234)/dbname (MySQL)

hmdhk commented Jul 18, 2019

watson commented Aug 2, 2019 • edited Loading

felixbarny commented Aug 2, 2019

axw commented Aug 5, 2019

SergeyKleyman commented Aug 22, 2019

SergeyKleyman commented Aug 22, 2019

axw commented Aug 23, 2019

axw commented Oct 23, 2019

beniwohli commented Oct 23, 2019

roncohen commented Oct 23, 2019

beniwohli commented Nov 13, 2019

mikker commented Nov 13, 2019 • edited Loading

beniwohli commented Nov 13, 2019 • edited Loading

roncohen commented Nov 18, 2019

simitt commented Nov 18, 2019

beniwohli commented Nov 18, 2019

roncohen commented Nov 18, 2019

felixbarny commented Nov 18, 2019

wolframhaussig commented Nov 18, 2019

simitt commented Nov 18, 2019

axw commented Nov 19, 2019

axw commented Nov 19, 2019 • edited Loading

beniwohli commented Nov 19, 2019

axw commented Nov 19, 2019 • edited Loading

simitt commented Nov 19, 2019

SergeyKleyman commented Nov 19, 2019

axw commented Nov 20, 2019 • edited Loading

Example 1: client request to https://elastic.co/foo/bar

Example 2: client request to http://[::1]:8080/

Example 3: query to postgresql://postgres123.local/dbinst

Example 4: query to user:pass@tcp(1.2.3.4:1234)/dbname (MySQL)

axw commented Nov 22, 2019

axw commented Dec 6, 2019

graphaelli commented May 7, 2020

roncohen commented Jul 10, 2019 •

edited by lreuven

Loading

watson commented Aug 2, 2019 •

edited

Loading

mikker commented Nov 13, 2019 •

edited

Loading

beniwohli commented Nov 13, 2019 •

edited

Loading

axw commented Nov 19, 2019 •

edited

Loading

axw commented Nov 19, 2019 •

edited

Loading

axw commented Nov 20, 2019 •

edited

Loading

Example 1: client request to `https://elastic.co/foo/bar`

Example 2: client request to `http://[::1]:8080/`

Example 3: query to `postgresql://postgres123.local/dbinst`

Example 4: query to `user:pass@tcp(1.2.3.4:1234)/dbname` (MySQL)