Run internal metrics through the Vector pipeline #230

binarylogic · 2019-04-01T17:13:07Z

Currently metrics are exposed, by default, on port 8888 to be scraped by prometheus. We should just run internal metrics through our own pipeline via a diagnostic source type, allowing the user to connect it to the output of their choice.

The text was updated successfully, but these errors were encountered:

binarylogic · 2020-07-25T18:12:44Z

Just noting, this will be closed once we fully define the remaining events within Vector and implement them. The remaining work is being defined in #3192.

binarylogic · 2020-08-07T23:29:14Z

I've changed my mind. Let's close this and represent the remaining work with other issues.

freeseacher · 2020-08-19T11:31:05Z

That's ok, but this issue is still referenced in https://vector.dev/docs/administration/monitoring/#metrics and also
#1538 mentions
[in-progress] Vector Observability (milestone #24)
but that milestone is 404.

sooo? how should i check if vector alive or not :) ?

leebenson · 2020-08-19T12:01:41Z

@freeseacher, we're actively working on exposing internal metrics. This will likely be surfaced in two ways:

A general /health endpoint, to determine a Vector instance is alive.
A web + CLI interface for real-time Vector metrics.

Details/ETA TBA, but it's a priority and something we're working on now. We'll update issues with the relevant details as it happens.

freeseacher · 2020-08-19T12:31:01Z

but the issue remain to be closed yep ?

leebenson · 2020-08-19T12:34:47Z

This issue is specific to running internal metrics through the Vector pipeline, which is limited in scope vs. the overarching goal of enabling fuller observability of a Vector instance.

#3225 and #3211 tackle it more comprehensively; I'd recommend subscribing to those issues to get better visibility into how this effort is coming along.

jszwedko · 2020-08-19T14:07:19Z

I think this issue is somewhat different than the work represented in #3225 and #3211 . In my mind, it'd still be useful to pipe vector's internal metrics to arbitrary sinks so that people can publish them to Datadog, prometheus, statsd, etc..

@binarylogic do we have this work represented somewhere else? I couldn't find it. I'd advocate for reopening this issue otherwise.

JeanMertz · 2020-09-15T13:28:22Z

AFAIK, we currently have an undocumented internal_metrics source that solves this ticket.

It's undocumented because there was (and still is) quite a lot of churn w.r.t. internal events/metrics. @binarylogic created an issue ~2 weeks ago to make sure we document this, and it's currently planned to be tackled within our current sprint (next two weeks).

Note that this is basically a third way to expose internal events/metrics, in addition to the ones @leebenson mentioned we're currently working on.

In essence, the plumbing exists for us to emit metrics, and an undocumented source exists to expose those metrics, now we're working on 1) documenting that source, and 2) adding more ways to expose those metrics (vector top and a web UI).

JeanMertz · 2020-09-15T13:49:06Z

Here's an example config:

(generated using vector generate stdin,internal_metrics/json_parser/blackhole,console with some manual changes)

data_dir = "/var/lib/vector/"

[sources.source0]
max_length = 102400
type = "stdin"

[sources.source1]
type = "internal_metrics"

[transforms.transform0]
inputs = ["source0"]
drop_field = true
drop_invalid = false
type = "json_parser"

[sinks.sink0]
healthcheck = true
inputs = ["transform0"]
type = "blackhole"
print_amount = 1000

[sinks.sink0.buffer]
type = "memory"
max_events = 500
when_full = "block"

[sinks.sink1]
healthcheck = true
inputs = ["source1"]
type = "console"
encoding.codec = "json"

[sinks.sink1.buffer]
type = "memory"
max_events = 500
when_full = "block"

vector --config vector.toml
Sep 15 15:45:34.456  INFO vector: Log level "info" is enabled.
Sep 15 15:45:34.458  INFO vector: Loading configs. path=["vector.toml"]
Sep 15 15:45:34.460  INFO vector: Vector is starting. version="0.10.0" git_version="v0.9.0-377-g0f0311a" released="Wed, 22 Jul 2020 19:34:29 +0000" arch="x86_64"
Sep 15 15:45:34.460  INFO vector::topology: Running healthchecks.
Sep 15 15:45:34.460  INFO vector::sources::stdin: Capturing STDIN.
Sep 15 15:45:34.460  INFO vector::topology: Starting source "source1"
Sep 15 15:45:34.460  INFO vector::topology::builder: Healthcheck: Passed.
Sep 15 15:45:34.460  INFO vector::topology::builder: Healthcheck: Passed.
Sep 15 15:45:34.460  INFO vector::topology: Starting source "source0"
Sep 15 15:45:34.460  INFO vector::topology: Starting transform "transform0"
Sep 15 15:45:34.460  INFO vector::topology: Starting sink "sink1"
Sep 15 15:45:34.460  INFO vector::topology: Starting sink "sink0"
hello
Sep 15 15:45:37.273  WARN transform{name=transform0 type=json_parser}: vector::internal_events::json: Event failed to parse as JSON field=message self.error=expected value at line 1 column 1 rate_limit_secs=30

outputs these events (to the console sink in this case):

{"name":"events_processed","timestamp":"2020-09-15T13:45:38.461847Z","tags":{"component_kind":"transform","component_type":"json_parser"},"kind":"absolute","counter":{"value":1.0}}
{"name":"bytes_processed","timestamp":"2020-09-15T13:45:38.461881Z","tags":{"component_kind":"sink","component_type":"blackhole"},"kind":"absolute","counter":{"value":5.0}}
{"name":"events_processed","timestamp":"2020-09-15T13:45:38.461884Z","tags":{"component_kind":"sink","component_type":"blackhole"},"kind":"absolute","counter":{"value":1.0}}
{"name":"processing_error","timestamp":"2020-09-15T13:45:38.461886Z","tags":{"component_kind":"transform","component_type":"json_parser","error_type":"failed_parse"},"kind":"absolute","counter":{"value":1.0}}

jszwedko · 2020-09-15T14:53:32Z

Linking to other issues that took the place of this one:

binarylogic added the Type: New Source label Apr 1, 2019

binarylogic added this to the 0.2 milestone Apr 1, 2019

binarylogic mentioned this issue Apr 3, 2019

Do not start / expose metrics by default #242

Closed

binarylogic added Type: New Feature type: enhancement A value-adding code change that enhances its existing functionality. and removed Type: New Feature labels Jun 14, 2019

binarylogic added the Type: New Feature label Jun 29, 2019

binarylogic removed this from the 0.2 - Metrics Support milestone Sep 7, 2019

binarylogic added this to the Improve observability milestone Dec 19, 2019

binarylogic mentioned this issue Apr 5, 2020

Add the ability to print the total count on a time interval in the blackhole sink #2226

Closed

lukesteensen mentioned this issue Apr 8, 2020

feat(new source): add internal metrics source #1953

Merged

binarylogic self-assigned this Apr 8, 2020

binarylogic added the have: must We must have this feature, it is critical to the project's success. It is high priority. label Apr 8, 2020

binarylogic added type: feature A value-adding code addition that introduce new functionality. and removed type: new feature labels Jun 16, 2020

binarylogic removed this from the Vector Observability milestone Jul 20, 2020

binarylogic added domain: data model Anything related to Vector's internal data model domain: metrics Anything related to Vector's metrics events and removed event type: metric labels Aug 6, 2020

binarylogic closed this as completed Aug 7, 2020

JeanMertz mentioned this issue Sep 15, 2020

make blackhole sink's print_amount field optional #3886

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run internal metrics through the Vector pipeline #230

Run internal metrics through the Vector pipeline #230

binarylogic commented Apr 1, 2019

binarylogic commented Jul 25, 2020

binarylogic commented Aug 7, 2020

freeseacher commented Aug 19, 2020

leebenson commented Aug 19, 2020

freeseacher commented Aug 19, 2020

leebenson commented Aug 19, 2020

jszwedko commented Aug 19, 2020

JeanMertz commented Sep 15, 2020 •

edited

Loading

JeanMertz commented Sep 15, 2020 •

edited

Loading

jszwedko commented Sep 15, 2020

Run internal metrics through the Vector pipeline #230

Run internal metrics through the Vector pipeline #230

Comments

binarylogic commented Apr 1, 2019

binarylogic commented Jul 25, 2020

binarylogic commented Aug 7, 2020

freeseacher commented Aug 19, 2020

leebenson commented Aug 19, 2020

freeseacher commented Aug 19, 2020

leebenson commented Aug 19, 2020

jszwedko commented Aug 19, 2020

JeanMertz commented Sep 15, 2020 • edited Loading

JeanMertz commented Sep 15, 2020 • edited Loading

jszwedko commented Sep 15, 2020

JeanMertz commented Sep 15, 2020 •

edited

Loading

JeanMertz commented Sep 15, 2020 •

edited

Loading