feat: initial implementation endpoint based workers #7

manast · 2024-02-24T23:18:55Z

This PR implements a new approach for handling queues and workers based on a standard HTTP Restful api and a webhook based api for processing jobs. More documentation will be available here: https://docs.bullmq.net/ on the following days.

roggervalf

Lgtm

src/validators/workers.validators.ts

README.md

kibertoad · 2024-02-25T10:39:24Z

package.json

-    "@taskforcesh/message-broker": "https://github.com/taskforcesh/message-broker.git#master",
-    "bullmq": "latest",
+    "bullmq": "^5.3.2",
+    "chalk": "^5.3.0",


any reasons to use chalk? faster options are available.

Chalk is well maintained so thats a plus. Usually I pick "pino" though, as a fast and versatile alternative to console.log, maybe it is the best alternative here as well... 🤔

+1 on pino, it is awesome. it doesn't work in browsers, but from my understanding you only need it on the server

We can change it in a separate PR as it will not imply a breaking change.

kibertoad · 2024-02-25T10:43:09Z

src/cache.ts

+  get(key: string) {
+    if (this.cache.has(key)) {
+      const value = this.cache.get(key);
+      if (value) {


delete and reset on every get is not great from perf perspective.
If you want a faster implementation, you can check out how it's done in https://github.com/kibertoad/toad-cache/blob/main/src/LruMap.js

Yeah, it seems like delete+set would be slow, but I have been surprised so many times in the past where the seemly inefficient solution happens to be fast due to JS runtime internals... I would rather make some benchmarks first before changing this to a more complex solution. It could also be the case that this solution allows 1M get calls per second, meaning that a faster "get" will not lead to a noticeable faster "addJob".

I've measured difference between toad-cache and tinylru. tinylru does explicit set for LRU bumping on every get, while toad-cache doesn't. Perf difference is 3.8K ops/sec vs 4.2K ops/sec. I expect that explicit delete on top of that should make it significantly slower - delete is a pretty expensive operation in JavaScript: https://stackoverflow.com/questions/27397034/why-is-delete-slow-in-javascript

Maybe "bun" is super fast, but when testing this:

import { LRUCache } from './src/cache'; const cache = new LRUCache<string>(3); const start = Date.now(); for (let i = 0; i < 1_000_000; i++) { cache.put(`key${i}`, `value${i}`); } console.log(`Put: ${Date.now() - start}ms`); for (let i = 0; i < 1_000_000; i++) { cache.get(`key${i}`); } console.log(`Get: ${Date.now() - start}ms`);

I get this results:

$ bun bench-cache.ts Put: 284ms Get: 428ms

So around 2.3M ops/sec

on an intel i7 from 2018.

If I increase the cache size to 3000:

$ bun bench-cache.ts Put: 291ms Get: 482ms

nice! yeah, looks like bun is super-optimized for this type of use-cases

kibertoad · 2024-02-25T10:45:00Z

src/controllers/http/worker-http-controller.ts

+// Gracefully close all workers
+process.on('exit', async () => {
+  for (const queueName in workers) {
+    await workers[queueName].close();


would that work fine if all workers share same connection?

Yes. The shared connection will not be closed, but all workers also have a dedicated connection for blocking calls, this dedicated connection needs to be closed by calling worker.close().

How come? worker.close() does this:

.finally(() => client.disconnect()) .finally(() => this.connection.close())

client here seems to be the shared connection.

(that's actually something that I addressed in taskforcesh/bullmq#2449, having encountered exactly this in our app, workers closing shared connection)

But client in this context is actually a duplication of the client that you pass to the constructor, so it is safe to close it...

const client = this.blockingConnection.status == 'ready' ? await this.blockingConnection.client : null;

this.blockingConnection = new RedisConnection( isRedisInstance(opts.connection) ? (<Redis>opts.connection).duplicate({ connectionName }) : { ...opts.connection, connectionName }, false, true, opts.skipVersionCheck, );

in fact not just safe, you must close it to avoid a leak.

Oh, I see, so it effectively always creates a new connection for each worker.

Is there a reason for that? There are limitations for shared connections that wouldn't work for BullMQ?

Because we need connections that block, and a blocked connection can not be shared as no commands will be send until the connection is unblocked.

kibertoad · 2024-02-25T10:46:37Z

src/controllers/http/worker-http-controller.ts

+    }, workerEndpoint.timeout || 3000)
+
+    try {
+      const response = await fetch(workerEndpoint.url, {


are you sure you don't want to use a lightweight wrapper like wretch to write less boilerplate? or even undici, which is also faster

Not really. I prefer less dependencies actually unless the savings in code are really meaningful.

kibertoad · 2024-02-25T10:51:04Z

src/controllers/http/worker-http-controller.ts

+  init: (redisClient: Redis | Cluster) => {
+    // Load workers from Redis and start them
+    debugEnabled && debug('Loading workers from Redis...');
+    const stream = redisClient.hscanStream(workerMetadataKey, { count: 10 });


Maybe something similar to what we used for queue discovery could be helpful here too:

public static async getActiveQueueIds(redis: Redis): Promise<string[]> { await redis.zremrangebyscore( QUEUE_IDS_KEY, '-inf', Date.now() - daysToMilliseconds(RETENTION_QUEUE_IDS_IN_DAYS), ) const queueIds = await redis.zrange(QUEUE_IDS_KEY, 0, -1) return queueIds.sort() } public async start(): Promise<void> { await this.redis.zadd(QUEUE_IDS_KEY, Date.now(), this.config.queueId) }

For the proxy we need to store a complete json object for every queue, so I don't know if ZSET is a good structure for this since it is a nice property of hsets to be able to update a workers options without needing to iterate through all the zset items.

Would hscan work well on a big Redis store? Wouldn't it iterate over the whole thing? maybe some key lookup map would be helpful?

It would only iterate on the given hashmap, and since it is scanning it is not keeping redis busy. Also, this operation is only needed when restarting the proxy. The number of workers should not be very big either...

src/validators/workers.validators.ts

feat: initial implementation endpoint based workers

1e5be48

manast requested review from ccollie and roggervalf February 24, 2024 23:19

manast added 2 commits February 25, 2024 00:37

chore: inserted message-broker instead of a dependency

09917f2

chore: remove broken ci step

9b129e9

roggervalf approved these changes Feb 25, 2024

View reviewed changes

src/validators/workers.validators.ts Outdated Show resolved Hide resolved

manast added 7 commits February 25, 2024 10:56

chore: refactor environment variables into config.ts

7cca237

ci: add dragonflydb support to the tests

10e0fd1

ci: start redis instance before running the tests

ca89f4d

test: set maxRetriesPerRequest: null

865ec24

chore: move all imports to single line

2fc15c1

test: add worker before removing it

b7b786c

test(worker-http-controller): correct broken test

211dbe9

kibertoad reviewed Feb 25, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

kibertoad reviewed Feb 25, 2024

View reviewed changes

src/validators/workers.validators.ts Outdated Show resolved Hide resolved

kibertoad reviewed Feb 25, 2024

View reviewed changes

src/validators/workers.validators.ts Show resolved Hide resolved

manast added 2 commits February 25, 2024 15:43

perf(validators): outrefactor constants to make them globals

809abd7

docs: make clearer the sentence about untrusted sources

bf0c404

This was referenced Feb 25, 2024

Provide an option to opt-out of automatic Redis connection closing on worker.close taskforcesh/bullmq#2437

Closed

[Bug]: BullMQ Throws 'Connection is Closed' Error with ioredis Cluster on Disconnect taskforcesh/bullmq#2402

Open

manast merged commit aa67b6e into main Feb 25, 2024
3 checks passed

manast deleted the feat/initial-endpoint-based-workers branch February 25, 2024 22:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: initial implementation endpoint based workers #7

feat: initial implementation endpoint based workers #7

manast commented Feb 24, 2024

roggervalf left a comment

kibertoad Feb 25, 2024

manast Feb 25, 2024

kibertoad Feb 25, 2024

manast Feb 25, 2024

kibertoad Feb 25, 2024

manast Feb 25, 2024 •

edited

kibertoad Feb 25, 2024 •

edited

manast Feb 25, 2024

manast Feb 25, 2024

manast Feb 25, 2024

kibertoad Feb 25, 2024

kibertoad Feb 25, 2024

manast Feb 25, 2024

kibertoad Feb 25, 2024

manast Feb 25, 2024

manast Feb 25, 2024

manast Feb 25, 2024

kibertoad Feb 25, 2024

manast Feb 25, 2024

kibertoad Feb 25, 2024

manast Feb 25, 2024

kibertoad Feb 25, 2024

manast Feb 25, 2024 •

edited

kibertoad Feb 25, 2024

manast Feb 25, 2024

feat: initial implementation endpoint based workers #7

feat: initial implementation endpoint based workers #7

Conversation

manast commented Feb 24, 2024

roggervalf left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

manast Feb 25, 2024 • edited

Choose a reason for hiding this comment

kibertoad Feb 25, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

manast Feb 25, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

manast Feb 25, 2024 •

edited

kibertoad Feb 25, 2024 •

edited

manast Feb 25, 2024 •

edited