redis may lose latency records generated by expirecycle #4453

soloestoy · 2017-11-20T13:44:18Z

Here I have a instance which have dozens of key-value pairs only in db0, and some of them are very big, just like a list contains millions of elements.

And the big keys have been set expire time, so when they were deleted by background expire-cycle the client got too much latency. I think we can see latency records about expire-cycle, but it didn't.

I checked the function activeExpireCycle and found some bugs, here they are:

void activeExpireCycle(int type) {
    ...
    int j, iteration = 0;
   for (j = 0; j < dbs_per_call; j++) {
        do {
            /* If there is nothing to expire try next DB ASAP. */
            if ((num = dictSize(db->expires)) == 0) {
                db->avg_ttl = 0;
                break;
            }
            /* When there are less than 1% filled slots getting random
             * keys is expensive, so stop here waiting for better times...
             * The dictionary will be resized asap. */
            if (num && slots > DICT_HT_INITIAL_SIZE &&
                (num*100/slots < 1)) break;
            /* We can't block forever here even if there are many keys to
             * expire. So after a given amount of milliseconds return to the
             * caller waiting for the other active expire cycle. */
            iteration++;
            if ((iteration & 0xf) == 0) { /* check once every 16 iterations. */
                long long elapsed = ustime()-start;

                latencyAddSampleIfNeeded("expire-cycle",elapsed/1000);
                if (elapsed > timelimit) timelimit_exit = 1;
            }
            if (timelimit_exit) return;
            /* We don't repeat the cycle if there are less than 25% of keys
             * found expired in the current DB. */
        } while (expired > ACTIVE_EXPIRE_CYCLE_LOOKUPS_PER_LOOP/4);
    }
}

From the function we can see that, we call latencyAddSampleIfNeeded every 16 iterations, but unfortunately there are two breaks before iteration++ which can make iteration never reach 16, that also make timelimit_exit never be 1. So, we should move the iteration++ to the location before breaks.
Check elapsed every 16 iterations is a good idea to save cpu time, but if server.dbnum is smaller than 16, that can also make iteration never reach 16. So, we should add a condition (iteration % server.dbnum) == 0 to protect it.
I think it is not necessary to call latencyAddSampleIfNeeded every 16 iterations, call it only when we reach the timelimit is enough, and also after the expire-cycle if we doesn't reach timelimit.

The text was updated successfully, but these errors were encountered:

antirez · 2017-11-21T14:50:37Z

Excellent analysis. thanks! What about this slightly modified fix?

diff --git a/src/expire.c b/src/expire.c
index a02fe566..b001ea5e 100644
--- a/src/expire.c
+++ b/src/expire.c
@@ -140,7 +140,7 @@ void activeExpireCycle(int type) {
     if (type == ACTIVE_EXPIRE_CYCLE_FAST)
         timelimit = ACTIVE_EXPIRE_CYCLE_FAST_DURATION; /* in microseconds. */

-    for (j = 0; j < dbs_per_call; j++) {
+    for (j = 0; j < dbs_per_call && timelimit_exit == 0; j++) {
         int expired;
         redisDb *db = server.db+(current_db % server.dbnum);

@@ -155,6 +155,7 @@ void activeExpireCycle(int type) {
             unsigned long num, slots;
             long long now, ttl_sum;
             int ttl_samples;
+            iteration++;

             /* If there is nothing to expire try next DB ASAP. */
             if ((num = dictSize(db->expires)) == 0) {
@@ -207,18 +208,18 @@ void activeExpireCycle(int type) {
             /* We can't block forever here even if there are many keys to
              * expire. So after a given amount of milliseconds return to the
              * caller waiting for the other active expire cycle. */
-            iteration++;
             if ((iteration & 0xf) == 0) { /* check once every 16 iterations. */
                 long long elapsed = ustime()-start;
-
-                latencyAddSampleIfNeeded("expire-cycle",elapsed/1000);
                 if (elapsed > timelimit) timelimit_exit = 1;
+                break;
             }
-            if (timelimit_exit) return;
             /* We don't repeat the cycle if there are less than 25% of keys
              * found expired in the current DB. */
         } while (expired > ACTIVE_EXPIRE_CYCLE_LOOKUPS_PER_LOOP/4);
     }
+
+    long long elapsed = ustime()-start;
+    latencyAddSampleIfNeeded("expire-cycle",elapsed/1000);
 }

It has the following advantages:

No duplication of the latencyAddSampleIfNeeded() call. We do it just at cycle exit.
Let's avoid the (iteration % dbs_per_call) == 0 conditional. We log the latency event at exit anyway, and if the internal while loop never loops enough times to reach 16, we don't care about entering the loop at all, we'll exit anyway because we ran all the DBs scheduled for such cycle.

Anyway it can happen that we are unlucky and we expire successive very big keys one after the other till we reach 16 iterations and we finally realize we are doing too much work, but this is kinda of a tradeoff... Makes sense?

antirez · 2017-11-21T15:01:54Z

P.S. @soloestoy please if you agree with the above, could you update your PR? I would like to merge your code to retain attribution of the discovery & fix.

soloestoy · 2017-11-21T15:45:26Z

@antirez I agree with the above and have updated my PR, it's a great honor, thank you! ^_^

P.S. we should put break into {} with timelimit_exit = 1.

antirez · 2017-11-21T15:47:35Z

My honor and pleasure :-) Yep indeed, break is inside the conditional, sorry for the mistake.

antirez · 2017-11-21T15:48:38Z

Btw I tested a (fixed) version and in general writing a regression test for this, and apparently it works as advertised. However not sure if to commit this because being time-based it's the kind of test that will randomly fail forever. We have a few more of those and it's terrible :-)

antirez · 2017-11-24T21:01:50Z

Merged @soloestoy thank you. I remember that when we met in Hangzhou you also showed me different fixes to the object frequency. Are those already PRed? Thanks.

soloestoy · 2017-11-25T00:48:20Z

@antirez yes, I think we can use LFU to find hotkeys, see PR #4392 , and some other PRs #4418 #4210

soloestoy mentioned this issue Nov 20, 2017

expire & latency: fix the missing latency records generated by expire #4452

Merged

antirez closed this as completed Mar 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

redis may lose latency records generated by expirecycle #4453

redis may lose latency records generated by expirecycle #4453

soloestoy commented Nov 20, 2017

antirez commented Nov 21, 2017 •

edited

antirez commented Nov 21, 2017

soloestoy commented Nov 21, 2017

antirez commented Nov 21, 2017

antirez commented Nov 21, 2017

antirez commented Nov 24, 2017

soloestoy commented Nov 25, 2017

redis may lose latency records generated by expirecycle #4453

redis may lose latency records generated by expirecycle #4453

Comments

soloestoy commented Nov 20, 2017

antirez commented Nov 21, 2017 • edited

antirez commented Nov 21, 2017

soloestoy commented Nov 21, 2017

antirez commented Nov 21, 2017

antirez commented Nov 21, 2017

antirez commented Nov 24, 2017

soloestoy commented Nov 25, 2017

antirez commented Nov 21, 2017 •

edited