feat(new sink): Initial `rabbitmq` sink implementation by Jeffail · Pull Request #1376 · vectordotdev/vector

Jeffail · 2019-12-16T12:13:26Z

Supersedes #1078

I've refactored the config fields but there are two remaining problems I think we ought to discuss:

If the connection is lost the library we're using doesn't handle background reconnects like the librdkafka does. This means we need to implement our own mechanism, otherwise users will be forced to restart the service.
When it's known at poll time that an event has failed to send we need to ensure that it is reattempted indefinitely. This ties into Implement end-to-end record acknowledgement #1107.

Signed-off-by: AlyHKafoury <aly.kafoury@gmail.com>

Signed-off-by: Ashley Jeffs <ash@jeffail.uk>

binarylogic · 2019-12-16T16:20:42Z

Nice! We definitely need to resolve both of those issues. @LucioFranco do you mind chiming on the best way to do that? I feel like you have the best understanding of the underlying networking code.

When it's known at poll time that an event has failed to send we need to ensure that it is reattempted indefinitely

This should currently be the default for all of our HTTP sinks.

LucioFranco · 2019-12-16T17:19:26Z

+    fn new(config: RabbitMQSinkConfig, acker: Acker) -> crate::Result<Self> {
+        let channel = Client::connect(&config.uri, config.connection_properties())
+            .and_then(|client| client.create_channel())
+            .wait()?;


This seems surprising that this works, I would assume it needs to do some io?

LucioFranco · 2019-12-16T17:21:55Z

+                Ok(Async::Ready(Some(((), seqno)))) => {
+                    if self.pending_acks.remove(&seqno) {
+                        self.acker.ack(1);
+                        trace!("published message to rabbitmq");


might make sense to add the seqno to these logs, you can use a span to do that.

Jeffail · 2019-12-17T11:23:53Z

Intention: This is very relevant to all streaming types of sink we might add in the future, so we ought to work out a solution here that we're happy to reuse.

Correct behavior regarding connection loss is to indefinitely attempt to re-establish it, but we need to be sure we don't block shutdown or any other mechanisms.

Regarding failed message sends at a minimum we need to reattempt the message indefinitely (assuming the failure is temporary) and it makes sense to attempt this within the same mechanism as the connection recovery.

LucioFranco · 2019-12-17T19:46:44Z

Ok, I took a long deep look at the lapin library. I think our best bet to provide amq support is via this library. The library itself is a bit odd, it provides a futures interface but doesn't actually hook into any of the tokio primitives. It seems like it was originally created before things like tokio existed but after mio. At a high level, the library provides its own reactor/driver and its own executor for this. After reading through the source code some more, I am happy with using this executor because 1) it only spawns on extra thread 2) if the client/channel types are dropped, the sender to the executor will drop and will clean up the executor. This means that we will not leak threads if we decide to reconnect.

That said, we should move forward with the current library, but we should change how we implement the sink. I suggest that we drop using the Sink trait implementation and instead use a tower::Service based one. The benefits that a tower implementation would provide are two fold 1) simpler we only need to implement on future that ensures we are connected, dispatch the basic_publish, and ensure we got the ack. This can be done with a single future implementation that if the basic_publish fails with the correct error type we re-connect. 2) We can take advantage of tower-retry and the retry policies we have already implemented.

So what I suggest is this:

We implement tower::Service that wraps basic_publish and will hold a Option<Channel> (I think in this case too we want to hold onto the connection). When we get a tower request we will make sure that we already have a channel, if not we create a new one that can reconnect as well. Then we dispatch the request via basic_publish. If we notice this returns an error we peak at the error and if it is one that means the tcp connection is broken we will want to clear out the option because this channel is no longer usable. Then we return the error, this will let upstream tower services retry.
Layer the tower services so we only let one concurrent basic_publish and retry to maintain order and sequence.
StreamingServiceSink we will need to implement something similar to BatchServiceSink that instead of batching items will attempt to submit items to a sink in a streaming fashion.

This is just a high-level view of how I would go about implementing reconnect, of course, we could continue with the current implementation but we would repeat a lot of code that we have already implemented and tested.

binarylogic · 2020-01-27T15:27:45Z

Nothing, this has been assigned to @LucioFranco to complete, although this is not high priority at the moment. Once @LucioFranco is done with the HTTP sink work it is worth considering this next.

binarylogic · 2020-03-09T14:25:55Z

Closing this for now since there are some substantial changes that we need to make to get this merged. We'll reopen when we get more user demand.

AlyHKafoury and others added 2 commits December 16, 2019 10:32

Implement RabbitMQ sink

9380005

Signed-off-by: AlyHKafoury <aly.kafoury@gmail.com>

Refactor rabbitmq config fields

1fb4753

Signed-off-by: Ashley Jeffs <ash@jeffail.uk>

Jeffail requested a review from LucioFranco December 16, 2019 12:13

Jeffail requested a review from binarylogic as a code owner December 16, 2019 12:13

Jeffail mentioned this pull request Dec 16, 2019

feat(new sink): Initial rabbitmq sink implementation #1078

Closed

binarylogic assigned LucioFranco Dec 16, 2019

LucioFranco reviewed Dec 16, 2019

View reviewed changes

binarylogic mentioned this pull request Feb 10, 2020

Shape "Tech-debt payment #1" project #1550

Closed

binarylogic closed this Mar 9, 2020

binarylogic deleted the AlyHKafoury-rabbitmq-sink branch April 24, 2020 20:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(new sink): Initial `rabbitmq` sink implementation#1376

feat(new sink): Initial `rabbitmq` sink implementation#1376
Jeffail wants to merge 2 commits intomasterfrom
AlyHKafoury-rabbitmq-sink

Jeffail commented Dec 16, 2019

Uh oh!

binarylogic commented Dec 16, 2019

Uh oh!

LucioFranco Dec 16, 2019

Uh oh!

LucioFranco Dec 16, 2019

Uh oh!

Jeffail commented Dec 17, 2019

Uh oh!

LucioFranco commented Dec 17, 2019

Uh oh!

binarylogic commented Jan 27, 2020

Uh oh!

binarylogic commented Mar 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Jeffail commented Dec 16, 2019

Uh oh!

binarylogic commented Dec 16, 2019

Uh oh!

LucioFranco Dec 16, 2019

Choose a reason for hiding this comment

Uh oh!

LucioFranco Dec 16, 2019

Choose a reason for hiding this comment

Uh oh!

Jeffail commented Dec 17, 2019

Uh oh!

LucioFranco commented Dec 17, 2019

Uh oh!

binarylogic commented Jan 27, 2020

Uh oh!

binarylogic commented Mar 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants