Handle "LOADING Redis is loading the dataset in memory" #358

shaharmor · 2016-08-15T06:40:45Z

Hi,

When a slave is first being connected to a master it needs to load the entire DB, which takes time.
Any command that is send to that slave during this time will receive a LOADING Redis is loading the dataset in memory response.

I think we should handle this and retry the command (Maybe even to a different node within the same slot).

@luin thoughts?

The text was updated successfully, but these errors were encountered:

shaharmor · 2016-08-15T07:20:02Z

Its possible that during a failover to a slave, the old master will sync from the new master and cause this error to be returned, which makes the whole failover mechanism not so failsafe.

luin · 2016-08-15T07:50:17Z

ioredis already supports detecting loading in standalong version: https://github.com/luin/ioredis/blob/master/lib/redis.js#L420-L428. Seems we just need to wait for the "ready" event of the new redis node here: status.https://github.com/luin/ioredis/blob/master/lib/cluster/connection_pool.js#L58-L63

shaharmor · 2016-09-08T19:26:11Z

@luin something like this?

redis = new Redis(_.defaults({
      retryStrategy: null,
      readOnly: readOnly
    }, node, this.redisOptions, { lazyConnect: true }));

    var _this = this;
    redis._readyCheck(function (err) {
      // TODO: handle error
      _this.nodes.all[node.key] = redis;
      _this.nodes[readOnly ? 'slave' : 'master'][node.key] = redis;

      redis.once('end', function () {
        delete _this.nodes.all[node.key];
        delete _this.nodes.master[node.key];
        delete _this.nodes.slave[node.key];
        _this.emit('-node', redis);
        if (!Object.keys(_this.nodes.all).length) {
          _this.emit('drain');
        }
      });

      _this.emit('+node', redis);

      redis.on('error', function (error) {
        _this.emit('nodeError', error);
      });
    });

Also, how should we handle an error in the _readyCheck function?

luin · 2016-09-09T03:26:37Z

Hmm...I just checked the code, and it seems that when a node has not finished loading data from the disk, the commands sent to it will be added to its offline queue instead of sending to the redis immediately.

shaharmor · 2016-09-09T11:51:27Z

So that means that this should already be fixed? I've seen this happen in production, so its definitely an issue.

Could it be that it happens only to slaves or something? or when using scaleReads?

shaharmor · 2016-09-09T11:51:56Z

Its also possible that it happens if the slave was once connected, but then got restarted for some reason

luin · 2016-09-09T18:06:06Z

That's strange. Either the node is a slave or a master doesn't affect the support of offline queue. Are you able to reproduce the issue? Or enable the debug log maybe?

kishorpawar · 2016-10-25T11:47:15Z

I found this issue when I did following.

Accidentally I ran FLUSHALL on redis-cli, I tried to do ctrl-d.
Without stopping redis-server I copied backed up rdb to dump.rdb and restarted redis-server. I found that the copy did not happen actually.
I stopped redis-server and then copied backed up rdb to dump.rdb and started redis-server. Copy worked.
Started redis-cli
Ran command KEYS * and got error (error) LOADING Redis is loading the dataset in memory

kaidiren · 2017-07-28T08:48:31Z

@shaharmor So how did you deal with it finally ?

stale · 2017-10-23T17:03:08Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 7 days if no further activity occurs, but feel free to re-open a closed issue if needed.

shaharmor · 2018-02-13T12:11:14Z

Hey @luin , I just encountered this issue again, and I think we should see how we can fix it.

stale · 2018-03-15T12:17:52Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 7 days if no further activity occurs, but feel free to re-open a closed issue if needed.

Eywek · 2019-04-09T14:04:41Z

Hello,

Any news on this? I got the same error on ioredis v4.0.10

alavers · 2020-04-05T18:08:18Z

@Eywek @shaharmor Do you have any more details on how you reproduce this issue?

Is it possible you're connected to a slave that has begun a resync? E.g if the master it was pointing to performed a failover? A redis slave would return -LOADING errors during a resync which might explain how you encounter them without a connection reset.

What happens if you implement a reconnectOnError that returns 2 when a LOADING error is encountered?

xiandong79 · 2020-04-15T03:59:19Z

any update

alavers · 2020-04-15T13:56:18Z

^I have a hypothesis that an error handler like this:

    reconnectOnError: function(err) {
      if (err.message.includes("LOADING")) {
        return 2;
      }
    }

might solve this problem and if so should perhaps be made a default ioredis behavior. But I haven't built a repeatable way to reproduce this issue.

bartpeeters · 2021-09-24T12:27:02Z

We were able to reproduce this issue by setting up an AWS ElastiCache cluster with the following config:

3 shards, 1 replica per shard
Engine: Clustered Redis
Engine Version Compatibility: 3.2.10
Auto-failover: enabled

We filled this cluster with about 700 Mb of data.
Then we setup an ioredis application which continuously sent redis.get's all with keys that belonged to hash slots of one of our shards.

We deleted the replica node in the chosen shard, no gets failed.

But when we added back a node in this shard, we got multiple

got error during get key theKey93923, error: ReplyError: LOADING Redis is loading the dataset in memory

We used the following config for ioredis:

const Redis = require('ioredis');

const redis = new Redis.Cluster(
  [
    {
      host: 'bart-test.rmoljo.clustercfg.euw1.cache.amazonaws.com',
      port: 6379,
    },
  ],
  {
    enableReadyCheck: true,
    scaleReads: 'slave',
  }
);

Using @alavers 's snippet did indeed solve the issue:

const redis = new Redis.Cluster(
  [
    {
      host: 'bart-test.rmoljo.clustercfg.euw1.cache.amazonaws.com',
      port: 6379,
    },
  ],
  {
    enableReadyCheck: true,
    scaleReads: 'slave',
    redisOptions: {
      reconnectOnError: function(err) {
        if (err.message.includes("LOADING")) {
          console.log('got one of dem loading ones');
          return 2;
        }
      }

  }
);

We see the log message

got one of dem loading ones

and not a single error.

Note that we were only able to reproduce it if we used option scaleReads: 'slave'.

We also tried this exact same scenario with a Redis Cluster on our Dev pc and we were unable to reproduce it that way.
Ioredis kept sending requests to the master while this new replica node was LOADING the redis dataset in memory.
No idea why the behaviour is different between ElastiCache and a non ElastiCache Redis Cluster.

bartpeeters · 2021-10-04T10:22:37Z

Should we make @alavers error handler:

    reconnectOnError: function(err) {
      if (err.message.includes("LOADING")) {
        return 2;
      }
    }

ioredis default behaviour, as we were able to reproduce this (see comment above).

If yes, we could make a PR for this.

michel-el-hajj · 2021-10-05T07:18:41Z

Sometime this means that you have too much data in Redis and on redis restart, it's going to load all this data in the cache. This will lead to a huge queue that will block any query. If the data isn't important, you need to delete the saved data on the server and restart Redis once again. FLUSHALL won't work since the queue is huge, you need to delete the data directly.

hktalent · 2021-12-19T12:25:28Z

@shaharmor Mac OS:

rm -rf /usr/local/var/db/redis/*
brew services restart redis
redis-cli -n 4 FLUSHDB

shaharmor added enhancement question labels Aug 15, 2016

shaharmor assigned luin Aug 15, 2016

shaharmor added bug and removed question labels Aug 15, 2016

stale bot added the wontfix label Oct 23, 2017

stale bot closed this as completed Oct 30, 2017

shaharmor reopened this Feb 13, 2018

stale bot removed the wontfix label Feb 13, 2018

stale bot added the wontfix label Mar 15, 2018

stale bot closed this as completed Mar 22, 2018

luin reopened this Apr 9, 2019

stale bot removed the wontfix label Apr 9, 2019

luin added pinned wontfix labels Apr 9, 2019

stale bot removed the wontfix label Apr 9, 2019

ttilberg mentioned this issue May 27, 2019

Faktory cannot boot, because Redis is slow to load (?) contribsys/faktory#225

Closed

DaemonSnake mentioned this issue Aug 8, 2023

AbstractConnector's disconnect #1792

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle "LOADING Redis is loading the dataset in memory" #358

Handle "LOADING Redis is loading the dataset in memory" #358

shaharmor commented Aug 15, 2016

shaharmor commented Aug 15, 2016

luin commented Aug 15, 2016

shaharmor commented Sep 8, 2016

luin commented Sep 9, 2016

shaharmor commented Sep 9, 2016

shaharmor commented Sep 9, 2016

luin commented Sep 9, 2016

kishorpawar commented Oct 25, 2016

kaidiren commented Jul 28, 2017

stale bot commented Oct 23, 2017

shaharmor commented Feb 13, 2018

stale bot commented Mar 15, 2018

Eywek commented Apr 9, 2019

alavers commented Apr 5, 2020

xiandong79 commented Apr 15, 2020

alavers commented Apr 15, 2020

bartpeeters commented Sep 24, 2021

bartpeeters commented Oct 4, 2021

michel-el-hajj commented Oct 5, 2021

hktalent commented Dec 19, 2021

Handle "LOADING Redis is loading the dataset in memory" #358

Handle "LOADING Redis is loading the dataset in memory" #358

Comments

shaharmor commented Aug 15, 2016

shaharmor commented Aug 15, 2016

luin commented Aug 15, 2016

shaharmor commented Sep 8, 2016

luin commented Sep 9, 2016

shaharmor commented Sep 9, 2016

shaharmor commented Sep 9, 2016

luin commented Sep 9, 2016

kishorpawar commented Oct 25, 2016

kaidiren commented Jul 28, 2017

stale bot commented Oct 23, 2017

shaharmor commented Feb 13, 2018

stale bot commented Mar 15, 2018

Eywek commented Apr 9, 2019

alavers commented Apr 5, 2020

xiandong79 commented Apr 15, 2020

alavers commented Apr 15, 2020

bartpeeters commented Sep 24, 2021

bartpeeters commented Oct 4, 2021

michel-el-hajj commented Oct 5, 2021

hktalent commented Dec 19, 2021