Elasticache Jobs not always starting #658

dolpm · 2021-07-23T00:03:29Z

I am using Bull to handle a checkout flow. Oddly enough, when using elasticache my queued jobs do not always start. It is a pretty basic configuration (shown below). They are always added, and in the case that they do not start, a user can just re-start the checkout flow and once again there is around a 50/50 that it will begin.

My development environment (redislabs) works just fine...

For elasticache I am directly connecting to a single shard. I have also tried by connecting using cluster-mode and the same issue is occurring.... I use this version of IORedis on a bunch of other applications on this shard without issue.

Versions:
Bullmq: v1.40.0
IORedis: v4.27.6
Elasticache engine version: v5.0.6

Update: getting this error after some time

manast · 2021-07-23T08:36:04Z

Seems to me like there is some kind of connection issue. Can you provide more information on what you mean with "restart the checkout flow" ?

dolpm · 2021-07-23T12:49:14Z

In the context of Bull, restarting the checkout flow will mean that a new task gets added to the queue.

This code gets called and is successful every time BUT it is a toss up whether the task will actually start:

Nothing is blocking the queue, though. All tasks will run immediately IF they are able to start (even after another doesn't start).

I agree that is a connection issue, however, I have tried a bunch of different connections that have the same result.

Cluster implementation:

manast · 2021-07-23T13:51:33Z

I still do not understand, "restart the checkout flow" === adding a job? So you mean, after adding a new job it may start processing again older jobs?

dolpm · 2021-07-23T14:18:14Z

No. It will not start processing older jobs - that is why it is confusing.... Sometimes jobs that get added to the queue just never start.

Lets say a user is checking out an item, however the process breaks. The job will be added to the queue but never start processing.

After waiting for it to finish, they decide to try again. Lets say it is successful this time. In this case, a new job will be added to the queue and this new job will process immediately as if the original never existed in the first place.

The original job that didn't start doesn't block the queue at all.

manast · 2021-07-23T14:52:47Z

But the Worker class is independent from the Queue class for adding jobs so it does not make any sense. I suggest you the following, write the simplest code that you can that just adds and process dummy jobs (that do not do anything). Since you say it is 50% chance of this happening you should be able to reproduce it easily. If you do not succeed then you know the issue is in the more complex code.

dolpm · 2021-07-23T21:29:14Z

Will do. Is it possible that this has something to do w/ re-using the connection for the queue and worker?

dolpm · 2021-07-23T21:45:28Z

works well until it doesn't.

manast · 2021-07-23T22:10:21Z

Can you provide the code that reproduces it?

dolpm · 2021-07-23T22:17:00Z

/* eslint-disable import/prefer-default-export */
import { Queue, Worker } from 'bullmq';
import IORedis from 'ioredis';

import processCheckout from './worker';

const QUEUE_NAME = 'checkouts';
const QUEUE_PREFIX = '{checkouts}';

let connection;
if (
  process.env.NODE_ENV === 'production' ||
  process.env.NODE_ENV === 'staging'
) {
  connection = new IORedis.Cluster([{ host: process.env.REDIS_URI }], {
    slotsRefreshTimeout: 1500,
    scaleReads: 'all',
    redisOptions: {
      showFriendlyErrorStack: true,
      lazyConnect: true
    }
  });
} else {
  connection = new IORedis(process.env.REDIS_URI);
}

const CheckoutQueue = new Queue(QUEUE_NAME, {
  prefix: QUEUE_PREFIX,
  connection
});

/*
const worker = new Worker(QUEUE_NAME, processCheckout, {
  prefix: QUEUE_PREFIX,
  connection
});
*/

const testWorker = new Worker(
  QUEUE_NAME,
  async (job) => {
    console.log('job started:', job.id, '\n');
  },
  {
    prefix: QUEUE_PREFIX,
    connection
  }
);

setInterval(async () => {
  const job = await CheckoutQueue.add(new Date().toString(), {}, {});
  console.log('job added:', job.id);
}, 5000);

dolpm · 2021-07-24T16:08:46Z

Ended up following the issue back to webpack.

dolpm closed this as completed Jul 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Elasticache Jobs not always starting #658

Elasticache Jobs not always starting #658

dolpm commented Jul 23, 2021 •

edited

manast commented Jul 23, 2021

dolpm commented Jul 23, 2021

manast commented Jul 23, 2021

dolpm commented Jul 23, 2021 •

edited

manast commented Jul 23, 2021 •

edited

dolpm commented Jul 23, 2021

dolpm commented Jul 23, 2021

manast commented Jul 23, 2021

dolpm commented Jul 23, 2021

dolpm commented Jul 24, 2021

Elasticache Jobs not always starting #658

Elasticache Jobs not always starting #658

Comments

dolpm commented Jul 23, 2021 • edited

manast commented Jul 23, 2021

dolpm commented Jul 23, 2021

manast commented Jul 23, 2021

dolpm commented Jul 23, 2021 • edited

manast commented Jul 23, 2021 • edited

dolpm commented Jul 23, 2021

dolpm commented Jul 23, 2021

manast commented Jul 23, 2021

dolpm commented Jul 23, 2021

dolpm commented Jul 24, 2021

dolpm commented Jul 23, 2021 •

edited

dolpm commented Jul 23, 2021 •

edited

manast commented Jul 23, 2021 •

edited