Tracing: Consider adding a flush API. #126

songy23 · 2018-06-22T23:18:24Z

Proposed by @reyang :

I propose to have a top level Tracing.flush() instead. The reasons are:

shutdown hooks are nondeterministic, if there are other shutdown hooks trying to use OpenCensus, their trace data might get dropped (due to the order of shutdown hooks execution).

shutdown hook requires RuntimePermission("shutdownHooks") which could be a problem for certain scenarios.

give user explicit control so they can flush the trace when appropriate.

/cc @bogdandrutu

sebright · 2018-06-23T00:14:42Z

Related: census-instrumentation/opencensus-java#1141

semistrict · 2018-06-24T11:34:36Z

Go does not have shutdown hooks. Most of the exporters already support a Flush method, but not having this in an interface means that we need always downcast in generic code. Would be nice to see this concept formalized.

While we are at it, does it make sense to make the concept of an internal buffer API-visible? Like being able to control the sizing and auto-flushing policy?

songy23 · 2018-06-25T16:35:12Z

While we are at it, does it make sense to make the concept of an internal buffer API-visible?

I'm not sure if this is necessary, given that users already have control over the exporting interval.

Like being able to control the sizing and auto-flushing policy?

Say some user has a config of buffer size 10 and exporting every 10 seconds, what happened when the buffer was full at 3rd second? Do we reset the exporting time?

reyang · 2018-06-25T17:39:56Z

If the buffer was full before the exporting period, I suggest we discard the incoming traces/metrics until the buffer got flushed. And we might need a log entry indicating there are missing entries (with start time and recovery time).

semistrict · 2018-06-25T18:06:50Z

Say some user has a config of buffer size 10 and exporting every 10 seconds, what happened when the buffer was full at 3rd second? Do we reset the exporting time?

Not exactly sure but probably. Most of the Go exporters use google.golang.org/api/support/bundler under the hood.

songy23 · 2018-06-25T18:53:53Z

If the buffer was full before the exporting period, I suggest we discard the incoming traces/metrics until the buffer got flushed.

We're trying to avoid down-sampling. As @Ramonza mentioned, looks like the current logic is to push the buffered data and reset start time once buffer is full.

Not exactly sure but probably. Most of the Go exporters use google.golang.org/api/support/bundler under the hood.

I see, then I think we should warn user that if they have a small buffer but yet huge amount of data at some point of time, the exporting interval would be much shorter and they need to be aware of the increasing traffic.

semistrict · 2018-06-25T19:42:25Z

Another question that occurred to me regarding the proposed Tracing.flush(): why would this be part of the tracing API and not just an Exporter method?

reyang · 2018-06-25T21:25:21Z

@Ramonza I'm fine if we decided to make it an Exporter method instead of Tracing.flush(). Having it in Tracing seems natural to me personally. (e.g. most people could tell what Tracing.flush() or Trace.flush() mean literally)

reyang · 2018-06-25T21:28:19Z

@songy23 if there are too much data going into the buffer, will we eventually block the writers? I'm concerned about the performance impact.

songy23 · 2018-06-25T22:08:13Z

@reyang I don't think we would block the writer currently. In Java the buffer will just keep growing until next export.

semistrict · 2018-06-26T12:31:17Z

The tracing API is targeted to instrumenting code (libraries or applications). Instrumentation shouldn't care about explicit flushing. Flushing has to do with exporting, so I think it logically belongs there.

If we make this part of the exporter API, I think it's more of a language-specific concern rather than a cross-language specs issue.

reyang · 2018-06-26T17:24:59Z

I believe we'll support the model where people manipulating exporters from the runtime configuration instead of hard-coding. In this case, having to call exporter API to flush seems to be a very strange design.
Technically I understand that flush belongs to exporting, my point is that having flush at Tracing level makes more sense to the user. Take one example, we have file.flush instead of disk.flush/network.flush.

semistrict · 2018-06-26T17:33:58Z

I think this is language-specific but it doesn't seem strange to me for exporters to have a flush method. Indeed, most of them in Go already do (it just isn't required by the Exporter interface).

So there's a question of whether, in addition to this per-Exporter flush, we need some kind of global flush that flushes all the registered exporters. In this case, we probably want to also flush stats exporters (since they have exactly the same requirements for buffered items to be submitted before program termination).

briantq · 2018-08-28T22:47:04Z

I ran into issues in #1391 that I cannot flush traces at will. In a multi threaded environment (new Thread per request), I want to be able to write any span data before the thread is killed. Leaving a Thread hung for 5 seconds defeats the purpose of performance. This one issue alone prevents me from using OpenCensus, as I have to choose between getting output or killing my apps performance I have worked hard to optimize.

My vote is to add flush to the exporter method (Tracing.getExportComponent().flush()), though I am fine with it anywhere as long as I can flush my data prior to losing it.

songy23 added the trace label Jun 22, 2018

knyar mentioned this issue Jul 26, 2018

Make it easier to manually report accumulated data census-instrumentation/opencensus-go#846

Merged

songy23 mentioned this issue Aug 27, 2018

No Trace output when running example with Logging Exporter census-instrumentation/opencensus-java#1387

Closed

g-easy mentioned this issue Oct 17, 2018

What is the recommended way to gracefully shutdown? census-instrumentation/opencensus-cpp#226

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracing: Consider adding a flush API. #126

Tracing: Consider adding a flush API. #126

songy23 commented Jun 22, 2018 •

edited

Loading

sebright commented Jun 23, 2018

semistrict commented Jun 24, 2018

songy23 commented Jun 25, 2018

reyang commented Jun 25, 2018

semistrict commented Jun 25, 2018

songy23 commented Jun 25, 2018

semistrict commented Jun 25, 2018

reyang commented Jun 25, 2018

reyang commented Jun 25, 2018

songy23 commented Jun 25, 2018

semistrict commented Jun 26, 2018

reyang commented Jun 26, 2018

semistrict commented Jun 26, 2018

briantq commented Aug 28, 2018

Tracing: Consider adding a flush API. #126

Tracing: Consider adding a flush API. #126

Comments

songy23 commented Jun 22, 2018 • edited Loading

sebright commented Jun 23, 2018

semistrict commented Jun 24, 2018

songy23 commented Jun 25, 2018

reyang commented Jun 25, 2018

semistrict commented Jun 25, 2018

songy23 commented Jun 25, 2018

semistrict commented Jun 25, 2018

reyang commented Jun 25, 2018

reyang commented Jun 25, 2018

songy23 commented Jun 25, 2018

semistrict commented Jun 26, 2018

reyang commented Jun 26, 2018

semistrict commented Jun 26, 2018

briantq commented Aug 28, 2018

songy23 commented Jun 22, 2018 •

edited

Loading