Question: is there any reason why the max stream length is set to 10.000? #110

dirkgroenen · 2020-01-13T15:37:16Z

First of all; thanks for all the hard work currently done to this new version! 👏

We currently have a setup of Bull which we use to retrieve and process large datasets in the background. After having it run on production for a month or two we noticed our Redis instance was consuming over 2 GB of memory. After a more thorough inspection it seemed that this was mostly caused by the event stream which keeps reference of processed jobs which have big data and returnvalue properties.

We're currently thinking about reducing the streams.events.maxLen to a (way) lower value, but before doing so I would like to know what the (potential) impact could be.

I've done some tests and looked around in the code and I can't seem to find any reason why we couldn't lower it to something like 250 (e.g.). Can you elaborate on the reason bind the default setting's value of 10.000 and whether it would potentially harm anything when we set it to for example 250? Thanks!

The text was updated successfully, but these errors were encountered:

manast · 2020-01-13T15:59:16Z

In legacy Bull we used PubSub for delivering global events. This works well but have some important limitations:

you do not get any guarantees that your event listeners will receive all the events. Due to network issues for example you may loose some event.
its almost impossible to create a UI where you use the events to update the status of the jobs.

These two problems are solved by using streams. As long as you have an eventId that represents your last received event you can replay events until you catch up to real time. Soon all the getters will also return the last eventId, so that you can update an UI relaying on the events.

In your case if you do not care about global events you can set this value to 1. If you care, then just find a value that is dimensioned for your queues.

The thinking here can be something like, how many seconds of events would I like to keep so that I can handle a network partition or similar?.

dirkgroenen · 2020-01-14T06:54:51Z

Thanks for your explanation @manast. 👍

manast added the better docs label Jan 13, 2020

roggervalf closed this as completed Dec 29, 2021

manast pushed a commit that referenced this issue Apr 22, 2022

GitBook: [#110] docs: update queues section links

73ac8c7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: is there any reason why the max stream length is set to 10.000? #110

Question: is there any reason why the max stream length is set to 10.000? #110

dirkgroenen commented Jan 13, 2020

manast commented Jan 13, 2020

dirkgroenen commented Jan 14, 2020

Question: is there any reason why the max stream length is set to 10.000? #110

Question: is there any reason why the max stream length is set to 10.000? #110

Comments

dirkgroenen commented Jan 13, 2020

manast commented Jan 13, 2020

dirkgroenen commented Jan 14, 2020