Fix dictScan(): It can't scan all buckets when dict is shrinking. #4907

youjiali1995 · 2018-05-08T09:05:44Z

The only difference is when iterating over larger table, cursor is increased in reverse:

        /* Iterate over indices in larger table that are the expansion
         * of the index pointed to by the cursor in the smaller table */
        do {
            /* Emit entries at cursor */
            if (bucketfn) bucketfn(privdata, &t1->table[v & m1]);
            de = t1->table[v & m1];
            while (de) {
                next = de->next;
                fn(privdata, de);
                de = next;
            }

            /* Increment the reverse cursor not covered by the smaller mask.*/
            v |= ~m1;
            v = rev(v);
            v++;
            v = rev(v);

            /* Continue while bits covered by mask difference is non-zero */
        } while (v & (m0 ^ m1));

When jumping out of the loop, the highest bit of (v & m0) is increased, so move following code to if block:

        /* Set unmasked bits so incrementing the reversed cursor
         * operates on the masked bits */
        v |= ~m0;

        /* Increment the reverse cursor */
        v = rev(v);
        v++;
        v = rev(v);

antirez · 2018-05-15T08:19:45Z

Thanks for this submission. Before looking at this patch I invoke the spirit of @pietern that wrote the original code, because I will be more comfortable changing this code (that so much helped Redis) after his nod :-D

antirez · 2018-06-01T14:53:59Z

Thanks @youjiali1995, I've checked the problem, verified that your solution is sounding, and created a simulation that allows us to study SCAN better in case of new problems (https://github.com/antirez/dict-scan-fuzz-tester). I'm merging your Pull Request in all the branches.

Please note that, fortunately, only SCAN is affected. HSCAN, ZSCAN and SSCAN should be immune to the problem because their dictionaries always expand from a smaller to a bigger one. The issue you found only affects dictScan() when going to smaller hash tables. Thank you for your contribution!

antirez · 2018-06-01T15:19:08Z

Just one final thing @youjiali1995, I would love to know how you have found this bug.

youjiali1995 · 2018-06-01T15:30:20Z

Because we use scan to clear data. But sometimes there are some keys left, and we can reproduce this problem. So I read the code and find the bug. @antirez

antirez · 2018-06-01T15:32:51Z

Thanks, awesome process to find the bug. I appreciate that instead of just posting an issue you went the extra mile and read the code to understand what was happening.

Fix dictScan(): It can't scan all buckets when dict is shrinking.

beijingzhangwei · 2021-11-16T15:33:36Z

@youjiali1995 咨询你个问题：字典扫描都是从小表开始，游标从零开始，缩容的时候，参考你们博客https://tech.meituan.com/2018/07/27/redis-rehash-practice-optimization.html 游标返回的范围不应该是小表的范围吗？看博客是示例应该0-7，为什么会出现返回20为游标的情况呢？（我知道如果人工指定游标参数的话会出现漏key情况）

sundb · 2021-11-17T03:20:02Z

@beijingzhangwei You should preferably use English to describe the problem.
Can you provide more information? How much does tablesize shrink from?
Are you sure it was still rehashing when you call scan?

beijingzhangwei · 2021-11-17T03:39:19Z

@youjiali1995 Refer to your company blog https://tech.meituan.com/2018/07/27/redis-rehash-practice-optimization.html
Shouldn't the range returned by the cursor be the range of the small table? Looking at the blog, the example should be 0-7. Why does it return 20 as the cursor? (I know that if the cursor parameters are manually specified, there will be key leakage)

beijingzhangwei · 2021-11-17T03:41:27Z

@beijingzhangwei You should preferably use English to describe the problem. Can you provide more information? How much does tablesize shrink from? Are you sure it was still rehashing when you call scan?

ok，i use english. My question come from @youjiali1995 company blog, which describe the bug.

sundb · 2021-11-17T04:05:22Z

@beijingzhangwei Not sure I fully understand what you meant.
Cursor 20 was fetched before rehashing, so returning 20 is possible.
After that, the cursor will be 0-7.

beijingzhangwei · 2021-11-17T04:40:40Z

Thx. I get it. 发自我的iPhone

…

------------------ Original ------------------ From: sundb ***@***.***> Date: Wed,Nov 17,2021 0:05 PM To: redis/redis ***@***.***> Cc: beijingzw ***@***.***>, Mention ***@***.***> Subject: Re: [redis/redis] Fix dictScan(): It can't scan all buckets when dict is shrinking. (#4907) @beijingzhangwei Not sure I fully understand what you meant. Cursor 20 was fetched before rehashing, so returning 20 is possible. After that, the cursor will be 0-7. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe. Triage notifications on the go with GitHub Mobile for iOS or Android.

Fix dictScan(): It can't scan all buckets when dict is shrinking.

Fix dictScan(): It can't scan all buckets when dict is shrinking.

8d93f92

youjiali1995 mentioned this pull request May 8, 2018

dictScan() can't scan all buckets when dict is shrinking. #4906

Closed

trevor211 approved these changes May 14, 2018

View reviewed changes

antirez merged commit 86de089 into redis:unstable Jun 1, 2018

youjiali1995 deleted the fix-dictScan branch June 1, 2018 15:45

JackieXie168 pushed a commit to JackieXie168/redis that referenced this pull request Dec 17, 2018

Merge pull request redis#4907 from youjiali1995/fix-dictScan

acaeed0

Fix dictScan(): It can't scan all buckets when dict is shrinking.

pulllock pushed a commit to pulllock/redis that referenced this pull request Jun 28, 2023

Merge pull request redis#4907 from youjiali1995/fix-dictScan

5418bfb

Fix dictScan(): It can't scan all buckets when dict is shrinking.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix dictScan(): It can't scan all buckets when dict is shrinking. #4907

Fix dictScan(): It can't scan all buckets when dict is shrinking. #4907

youjiali1995 commented May 8, 2018 •

edited

antirez commented May 15, 2018

antirez commented Jun 1, 2018

antirez commented Jun 1, 2018

youjiali1995 commented Jun 1, 2018 •

edited

antirez commented Jun 1, 2018

beijingzhangwei commented Nov 16, 2021

sundb commented Nov 17, 2021 •

edited

beijingzhangwei commented Nov 17, 2021

beijingzhangwei commented Nov 17, 2021

sundb commented Nov 17, 2021

beijingzhangwei commented Nov 17, 2021 via email

Fix dictScan(): It can't scan all buckets when dict is shrinking. #4907

Fix dictScan(): It can't scan all buckets when dict is shrinking. #4907

Conversation

youjiali1995 commented May 8, 2018 • edited

antirez commented May 15, 2018

antirez commented Jun 1, 2018

antirez commented Jun 1, 2018

youjiali1995 commented Jun 1, 2018 • edited

antirez commented Jun 1, 2018

beijingzhangwei commented Nov 16, 2021

sundb commented Nov 17, 2021 • edited

beijingzhangwei commented Nov 17, 2021

beijingzhangwei commented Nov 17, 2021

sundb commented Nov 17, 2021

beijingzhangwei commented Nov 17, 2021 via email

youjiali1995 commented May 8, 2018 •

edited

youjiali1995 commented Jun 1, 2018 •

edited

sundb commented Nov 17, 2021 •

edited