Every queue has a consume rate of at best 25 messages per second. How can I increase this? #63

ppinel · 2016-10-17T14:21:59Z

Hello,

I am using SenecaJS with this plugin and RabbitMQ. When I did some benchmarking to understand why at best I had 25 messages per second handled on each queue, I found out that it takes 40ms for a message to go and come back.
For exemple, a microservice ask for an object to another microservice (Basically a mongodb query).
The act callback is called 40ms after when Mongodb query only takes 2ms to complete.
So I now understand why on RabbitMQ admin interface, I have at best 25 messages handled per second per queue: 40ms * 25 = 1 second.

I use the default config for Amqp plugin and SenecaJS. All microservices and RabbiMQ are on the same server.
Each microservice has only one instance running.

Is this a normal behaviour?
Is it possible to only increase the rate by modifying the parameters of Seneca AMQP Transport, SenacaJS and/or RabbitMQ ?

nfantone · 2016-10-17T16:16:44Z

Hi, @ppinel. Thanks for taking the time to write about your issue.

I've tested this on my side and found wildly different numbers than the ones you brought up. For instance, running the microservices on the /examples, both client.js and listener.js locally (and RabbitMQ 3.6.4 on localhost), I got up to 820+ msg/s.

See screenshot below:

And while this isn't a thorough benchmark by any means, our results are so different, that I'm inclined to say there's something in your topology or .act function that's putting a lid on the throughput.

Are you connecting to the broker locally (ie.: using localhost)? Are you behind a proxy or a load balancer? Was the broker under heavy load when you measured your numbers? How many queues are declared on your vhost? How many messages are on those queues, on average? Is your callback doing anything else that could amount to a significant delay? How are you measuring the RTT?

Finally, could you run the examples and share your results? Or maybe (if this isn't too much to ask), could you profile your script and tell me exactly where exec time is spent? That would be very helpful.

Thanks again!

EDIT
I forgot to mention that I did make one small modification to the examples/client.js script: set the second setInterval argument to 0.

Also, all my tests were run using node 6.8.1, seneca@3.2.1 and seneca-amqp-transport@2.1.0 on a Macbook Pro (early 2015) 3.1 GHz Intel Core i7, 16 GB 1867 MHz DDR3.

ppinel · 2016-10-18T13:56:22Z

Thanks for your response.

Yes I am connecting to the broker locally, not using any load balancer or proxy. The broker wasn't under heavy load because it's on my dev env and nobody is using it but me.
There is 121 queues declared with 104 corresponding to response and 17 to action.
There is a low traffic on the queue in general. Microservices are behind an API gateway and can talk with each other through SenecaJS.

To understand why my action was taking too long I just set a var with the current time at the beginning of the action and displayed at different moment of the logic the diff between now and my var.
For exemple at some point of the logic MicroserviceA act on MicroserviceB. I display the time diff at right before calling the act and another display in the callback of the act.
I also do the same system on MicroserviceB to get how many ms the logic takes to execute.
I end up with 2ms on MicroserviceB and 40ms on MicroserviceA. So from here I was wondering who was taking 38ms. When I tested the charge, I end up with a max of 25 messages per second.

I am not saying is SenecaJS, RabbitMQ or anyone fault. I just want to understand why I have this results to improve where it needs to be.
So your quick answer is much appreciated.

I will do the test asap and get back to you with results.

EDIT:

I am looking here for stats, it's maybe not the right place.

nfantone · 2016-10-18T14:00:20Z

Awesome. Your setup LGTM. Could you also include the output of RabbitMQ's web admin? In particular, I'm interested in the "Acknowledge" rate.

ppinel · 2016-10-18T14:17:05Z

Here the exemple running on my server. I am lost oO

ppinel · 2016-10-18T14:41:07Z

On my macbook pro:

ppinel · 2016-10-18T15:15:14Z

I just reinstall RabbitMQ on the server and still have the same issue.
I updated node to 6.8.1 same issue.

nfantone · 2016-10-18T15:56:35Z

@ppinel All right. Numbers on your Macbook look a lot more than what I'd expect. What other differences can you think of between your local env and your server?

Consider:

~~seneca, seneca-amqp-transport versions~~ (latest).
~~Node version~~ (6.8.1 in both).
~~RabbitMQ version~~ (3.6.1 in both).
~~Is your server running inside a VM or a container?~~ (dedicated server)

Also, you did change setInterval argumento to 0 in both cases, right?

ppinel · 2016-10-18T16:10:00Z

Yes I did changed setInterval in both cases, otherwise messages are coming to slowly and I never reach the mysterious limit.
3.6.1 on my server too, and seneca, seneca-amqp-transport versions are the same.

Regarding the server it's from a french hosting service OVH.
https://www.soyoustart.com/fr/offres/e3-ssd-2.xml or
https://www.soyoustart.com/fr/offres/e3-ssd-3.xml
So dedicated server.

ppinel · 2016-10-18T16:16:54Z

It could be on the OS layer. Is there any limitation you had to change or met someone with issues on Debian?
I am looking at the Overview tab and nothing have crazy value as fd, socket, memory etc.

nfantone · 2016-10-18T16:18:13Z

I'm perplexed. And out of ideas.

Could you please read and check this out? Use the rabbitmqctl command in your server to check if the connection is being stalled.

nfantone · 2016-10-18T16:26:01Z

It could be on the OS layer. Is there any limitation you had to change or met someone with issues on Debian?

Although rare, it could be. Check the Process statistics panel on Overview -> (More about this node) and take a look at the thresholds in each graph. For example, mine look like this:

Also, is your RabbitMQ node Disc or RAM based?

ppinel · 2016-10-18T16:43:38Z

First one is while running exemple, second one is before.

I can't tell if my RabbitMQ is disk or memory based. How can I know this?

nfantone · 2016-10-18T16:51:05Z

Ok, so no problems there.

I can't tell if my RabbitMQ is disk or memory based. How can I know this?

There' a tag under the node's title on its description page. It also appears on the Overview tab.

Did you manage to check the flow status of your connection?

ppinel · 2016-10-18T16:53:23Z

Ok disk everywhere.
Also tested on a fresh AWS EC2 on debian and same issue.

ppinel · 2016-10-18T16:55:14Z

What's 48MB low watermark ?

ppinel · 2016-10-18T16:59:13Z

I checked on the connections tab during a test:

nfantone · 2016-10-18T17:22:59Z

What's 48MB low watermark ?

Minimum free space needed for the node to run.

Ok, so no throttling on the connection. This looks like a non-arbitrary fixed cap of sorts. 25 is too much of a nice round number to be a coincidence.

Anything on the RabbitMQ log? It should be under /var/log.

ppinel · 2016-10-18T18:07:17Z

Also it's the same number across server even on the aws ec2 instance.
I created a stackoverflow question: http://stackoverflow.com/questions/40114532/every-queue-has-a-consume-rate-of-at-best-25-messages-per-second-how-can-i-incr

ppinel · 2016-10-18T18:09:27Z

Thank you for your time, I'll post an update here as soon as I have the solution.

nfantone · 2016-10-18T18:13:00Z

Thanks! Yes, if you come up with a solution, please share it. I'm intrigued now.

ppinel · 2016-10-18T21:21:29Z

I wanted to know if by any chance you take advantage of this option https://github.com/squaremo/amqp.node/blob/master/lib/connect.js#L98?
Idea from this post https://groups.google.com/forum/#!searchin/rabbitmq-discuss/performance$20problem/rabbitmq-discuss/3eiFVoQVfUU/ELOeTOU8EE8J

ppinel · 2016-10-19T07:32:57Z

I hardcoded the no delay option in amqplib to true and I have 930 messages deliver per second with the example.
I know why this is usefull but maybe I could decrease the delay.

ppinel · 2016-10-19T08:08:01Z

Found out how to set the noDelay option: { amqp: { socketOptions: { noDelay: true } } }.

nfantone · 2016-10-19T11:41:51Z

Had no idea about that noDelay flag. And I assume that if kept false, the delay in socket communication is OS dependant? That would explain your behaviour on Debian.

You learn something new everyday. Thanks for your teachings 🐐 ! Perhaps, I can include this in some FAQ section on the README.md or the wiki.

Feel free to close the issue if you think there's nothing else you can do to improve your message rate.

ppinel · 2016-10-19T11:47:02Z

Yes it's OS dependant. Would be great if this flag could be documented.
Also, this might not be the right solution for everyone.
Thanks again for your help!

nfantone · 2016-10-19T11:50:07Z

@ppinel After reading the whole thread on that Google Groups forum, I'm inclined to say that your solution with noDelay: true is very reasonable. And it is platform dependant, as you mentioned above. Nagle will throttle the communication on the TCP level to avoid congestion on the network, by "joining together" packets and avoid sending small segments. More on that here.

Glad you could work things out. Good work.

nfantone added the question label Oct 17, 2016

nfantone changed the title ~~Every queue have a consume rate with at best 25 messages per second. How can I increase this?~~ Every queue has a consume rate of at best 25 messages per second. How can I increase this? Oct 18, 2016

ppinel closed this as completed Oct 19, 2016

jpwilliams mentioned this issue Mar 1, 2017

Do we need to use noDelay? jpwilliams/remit#21

Closed

oliversturm added a commit to oliversturm/cqrs-grid-demo that referenced this issue Apr 7, 2017

fixed speed issues (senecajs/seneca-amqp-transport#63)

2c2b984

Glathrop mentioned this issue Nov 5, 2018

Question: Background Workers / Long Running Processes #128

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Every queue has a consume rate of at best 25 messages per second. How can I increase this? #63

Every queue has a consume rate of at best 25 messages per second. How can I increase this? #63

ppinel commented Oct 17, 2016

nfantone commented Oct 17, 2016 •

edited

ppinel commented Oct 18, 2016 •

edited

nfantone commented Oct 18, 2016 •

edited

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016 •

edited

nfantone commented Oct 18, 2016 •

edited

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

nfantone commented Oct 18, 2016 •

edited

nfantone commented Oct 18, 2016

ppinel commented Oct 18, 2016

nfantone commented Oct 18, 2016

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

nfantone commented Oct 18, 2016 •

edited

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

nfantone commented Oct 18, 2016

ppinel commented Oct 18, 2016

ppinel commented Oct 19, 2016

ppinel commented Oct 19, 2016

nfantone commented Oct 19, 2016

ppinel commented Oct 19, 2016

nfantone commented Oct 19, 2016 •

edited

Every queue has a consume rate of at best 25 messages per second. How can I increase this? #63

Every queue has a consume rate of at best 25 messages per second. How can I increase this? #63

Comments

ppinel commented Oct 17, 2016

nfantone commented Oct 17, 2016 • edited

ppinel commented Oct 18, 2016 • edited

nfantone commented Oct 18, 2016 • edited

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016 • edited

nfantone commented Oct 18, 2016 • edited

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

nfantone commented Oct 18, 2016 • edited

nfantone commented Oct 18, 2016

ppinel commented Oct 18, 2016

nfantone commented Oct 18, 2016

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

nfantone commented Oct 18, 2016 • edited

ppinel commented Oct 18, 2016

ppinel commented Oct 18, 2016

nfantone commented Oct 18, 2016

ppinel commented Oct 18, 2016

ppinel commented Oct 19, 2016

ppinel commented Oct 19, 2016

nfantone commented Oct 19, 2016

ppinel commented Oct 19, 2016

nfantone commented Oct 19, 2016 • edited

nfantone commented Oct 17, 2016 •

edited

ppinel commented Oct 18, 2016 •

edited

nfantone commented Oct 18, 2016 •

edited

ppinel commented Oct 18, 2016 •

edited

nfantone commented Oct 18, 2016 •

edited

nfantone commented Oct 18, 2016 •

edited

nfantone commented Oct 18, 2016 •

edited

nfantone commented Oct 19, 2016 •

edited