New `grpc` source #573

derekperkins · 2019-07-03T18:42:33Z

We currently use the Stackdriver Go client compiled into our applications because I hate the extra stdout/stderr parsing overhead. I already have a structured log in hand, so why would I serialize it to json, print it to stdout and have it parsed back in with significant chances for parsing errors. I'd prefer to be able to ship the structured log directly to Vector and then have it directed to the appropriate sinks.

derekperkins · 2019-07-03T18:55:01Z

As some prior art, here is what the Stackdriver proto looks like.
https://github.com/googleapis/googleapis/blob/master/google/logging/v2/logging.proto

LucioFranco · 2019-07-03T19:57:24Z

We currently already have protobuf definitions for our internal event type https://github.com/timberio/vector/blob/master/proto/event.proto. Looks like we could easily expose a grpc service using https://github.com/tower-rs/tower-grpc that uses are internal protobuf structure.

derekperkins · 2019-07-03T20:03:53Z

That would be great!

This adds an inital gRPC source that provides non TLS based http2 access. It accepts a single unary rpc call that accepts a batch of `EventWrapper`'s from the `event.proto`. These get translated directly into `Event`. Closes #573 Signed-off-by: Lucio Franco <luciofranco14@gmail.com>

binarylogic · 2019-08-05T14:26:58Z

This is basically complete, but it raised a number of design questions that we need to answer before we proceed to merge this. See #617 (review).

Once we obtain consensus on this we can reopen that PR, adjust, and then merge.

fcantournet · 2019-09-27T14:14:41Z

This would be very interesting as a sink for envoy access logs too, to replace the file based approach (which entails dealing with log-rotation and mounting a hostpath in the container etc...)

binarylogic · 2020-01-01T21:46:21Z

I've been thinking about this more, especially as we start to ramp up our marketing efforts around application monitoring and observability. I'm going to respond to this comment.

What does the full user experience here look like? Will we be generating and distributing clients that speak this protocol or is that something users should deal with themselves?

No, to start, clients will deal with this their self. We have discussed repurposing the Timber libraries as Vector in-app extensions but that is not a near term change.

Do we really want to push the details of our internal data representation up into user's applications? Given the multitude of existing formats and protocols for shipping this kind of data around, do we really want to commit to designing, maintaining, and evolving our own?

This is where I think we deviate from the initial implementation. I do not think we should expose our exact internal data model, we should, instead, expose a raw event protobuf (the highest form of our data model). The act of deriving metrics, traces, exceptions, etc from events should be done within Vector itself.

This fits nicely into some of our upcoming marketing efforts around application monitoring and observability. Ideally, applications would emit raw events that describe their internal state and that's it. Deriving specific observability data types from this data should happen external to the application. If an application emits other data types they can use the source designed for that data type. For example, if the application exposes metrics data through a prometheus endpoint then Vector can scrape that with the prometheus source.

Should we really have this protocol and the protobuf/tcp "vector" protocol for which we've already built sources and sinks?

Yes. The vector sources/sinks are meant to be opaque to the user. They are not designed for any other external clients, this source is.

For example, could we implement an SSF source instead? Maybe something from OpenMetrics or OpenTelemetry? Or is this how we should have built the vector source/sink in the first place and this should replace it?

I actually think these are too opinionated for this specific source. Raw events are the true normalized form of all of these data types. If we did decide to support SSF or OpenTelemetry we should support them explicitly through specific sources or decoding transforms.

Final thoughts

The reason I've been thinking about this is because I find it superior to the traditiaonl app -> disk (file) -> agent design. This allows the application to talk directly to Vector. The use of the disk as a buffer is a design decision the user can make in the form of how they configure Vector's buffers.

@lukesteensen let me know what you think.

lukesteensen · 2020-01-07T03:23:35Z

I do not think we should expose our exact internal data model, we should, instead, expose a raw event protobuf (the highest form of our data model). The act of deriving metrics, traces, exceptions, etc from events should be done within Vector itself.

This makes sense to me. I'm curious what kind of schema you have in mind for these events. I assume it would be a pretty loose key/value type of structure with various types of values available. Would we support higher level things like metrics, or just simple scalar values?

Yes. The vector sources/sinks are meant to be opaque to the user. They are not designed for any other external clients, this source is.

I wonder if it would make sense to rename them in that case (e.g. native, internal, forward).

This allows the application to talk directly to Vector.

This is always the sticky bit in my mind. Direct, "smart" logging to some probably-network endpoint puts a pretty large burden on the logging library to handle things like load shedding, never blocking, safe buffering, etc, etc. Going to stdout or local disk is pretty well understood and supported in all languages, so I'd just want to be careful that whatever we recommend is similarly unlikely to cause an application outage for users.

binarylogic · 2020-01-07T04:17:43Z

Going to stdout or local disk is pretty well understood and supported

Agree, I've just been seeing a lot of advice to avoid the disk which got me thinking about direct app -> vector communication, especially since Vector has configurable disk buffers. I still agree stdout is the best method. It's simple, decoupled, 12-factor, etc.

LucioFranco · 2020-01-07T16:09:15Z

Stackdriver seems to do some funky any magic for their format, which we could adapt to not have to expose our proto internally but not sure how worth it is then to have a grpc sink. https://github.com/googleapis/googleapis/blob/master/google/logging/v2/log_entry.proto#L72

ckdarby · 2022-02-02T22:20:58Z

@binarylogic Did having the grpc input fizzle out?

jszwedko · 2022-02-10T23:57:11Z

@ckdarby The vector source is actually a gRPC source now. We haven't publicly documented the interface, but it should be pretty stable since we need to maintain compatibility between each minor Vector version (though may break compatibility over 2+ versions). It's possible we'll just document that and provide additional stability guarantees. You can see the proto definition here: https://github.com/vectordotdev/vector/blob/master/proto/vector.proto (which imports https://github.com/vectordotdev/vector/blob/master/lib/vector-core/proto/event.proto).

ckdarby · 2022-02-11T00:22:36Z

@jszwedko Does that also mean vector sink is a gRPC sink as well 🤯?

jszwedko · 2022-02-11T01:20:57Z

@jszwedko Does that also mean vector sink is a gRPC sink as well 🤯?

It is indeed 😄 This is with version = "2" configured on the component. version = "1" is a raw TCP-based protocol.

jszwedko · 2022-08-01T18:17:29Z

Repurposing this to be an issue for documenting the gRPC interface supported by the vector source and sink, which I believe covers the use-case here.

binarylogic added the Type: New Feature label Jul 3, 2019

binarylogic assigned LucioFranco Jul 8, 2019

LucioFranco mentioned this issue Jul 10, 2019

feat(grpc source): add new basic gRPC source #617

Closed

binarylogic changed the title ~~Provide gRPC logging interface~~ New grpc source Aug 5, 2019

binarylogic unassigned LucioFranco Sep 17, 2019

binarylogic added type: feature A value-adding code addition that introduce new functionality. and removed type: new feature labels Jun 16, 2020

binarylogic added domain: sources Anything related to the Vector's sources needs: approval Needs review & approval before work can begin. needs: requirements Needs a a list of requirements before work can be begin labels Aug 7, 2020

jszwedko added type: task Generic non-code related tasks and removed type: feature A value-adding code addition that introduce new functionality. labels Aug 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New `grpc` source #573

New `grpc` source #573

derekperkins commented Jul 3, 2019 •

edited

derekperkins commented Jul 3, 2019

LucioFranco commented Jul 3, 2019

derekperkins commented Jul 3, 2019

binarylogic commented Aug 5, 2019

fcantournet commented Sep 27, 2019

binarylogic commented Jan 1, 2020 •

edited

lukesteensen commented Jan 7, 2020

binarylogic commented Jan 7, 2020

LucioFranco commented Jan 7, 2020

ckdarby commented Feb 2, 2022

jszwedko commented Feb 10, 2022

ckdarby commented Feb 11, 2022

jszwedko commented Feb 11, 2022

jszwedko commented Aug 1, 2022

New grpc source #573

New grpc source #573

Comments

derekperkins commented Jul 3, 2019 • edited

derekperkins commented Jul 3, 2019

LucioFranco commented Jul 3, 2019

derekperkins commented Jul 3, 2019

binarylogic commented Aug 5, 2019

fcantournet commented Sep 27, 2019

binarylogic commented Jan 1, 2020 • edited

Final thoughts

lukesteensen commented Jan 7, 2020

binarylogic commented Jan 7, 2020

LucioFranco commented Jan 7, 2020

ckdarby commented Feb 2, 2022

jszwedko commented Feb 10, 2022

ckdarby commented Feb 11, 2022

jszwedko commented Feb 11, 2022

jszwedko commented Aug 1, 2022

New `grpc` source #573

New `grpc` source #573

derekperkins commented Jul 3, 2019 •

edited

binarylogic commented Jan 1, 2020 •

edited