Multi-key commands about lookupKeyWrite and lookupKeyRead #7475

yangbodong22011 · 2020-07-07T02:19:28Z

Hi,
In Redis, when we operate the key, we need lookupKeyWrite or lookupKeyRead, but under the multi-key command, this flag is not strict.

E.g:

void sunionDiffGenericCommand(client *c, robj **setkeys, int setnum,
                              robj *dstkey, int op) {
    ...

    for (j = 0; j < setnum; j++) {
        robj *setobj = dstkey ?
            lookupKeyWrite(c->db,setkeys[j]) :
            lookupKeyRead(c->db,setkeys[j]);
        ...
    }
}

According to whether dstkey exists, use lookupKeyWrite or lookupKeyRead respectively, but should only use lookupKeyRead?

void zunionInterGenericCommand(client *c, robj *dstkey, int op) {
    ...
    /* read keys to be used for input */
    src = zcalloc(sizeof(zsetopsrc) * setnum);
    for (i = 0, j = 3; i < setnum; i++, j++) {
        robj *obj = lookupKeyWrite(c->db,c->argv[j]);
        
        ...
    }
}

Similarly, should lookupKeyRead also be used here?

If the judgment is based on the read and write attributes of the command itself, should georadius be modified to lookupKeyWrite?

{"georadius",georadiusCommand,-6,
     "write @geo",
     0,georadiusGetKeys,1,1,1,0,0,0},

void georadiusGeneric(client *c, int flags) {
    /* Look up the requested zset */
    robj *zobj = NULL;
    if ((zobj = lookupKeyReadOrReply(c, key, shared.emptyarray)) == NULL ||  // command is "write @geo"
        checkType(c, zobj, OBJ_ZSET)) {
        return;
    }
}

I think it should be decided according to the specific operation of this key, so the above should be changed to lookupKeyRead.

The text was updated successfully, but these errors were encountered:

soloestoy · 2020-07-07T02:40:09Z

Makes sense, @oranagra what do you think?

oranagra · 2020-07-07T06:08:43Z

To be honest i never really understood that part and the reasoning behind it very well.

But the logic in sunionDiffGenericCommand makes it very clear what was the intention.
i.e. open the source keys with either read or write mode depending on if the command is a write or read one.

96ffb2fe97 (Pieter Noordhuis 2010-07-02 19:57:12 +0200  941)         robj *setobj = dstkey ?
96ffb2fe97 (Pieter Noordhuis 2010-07-02 19:57:12 +0200  942)             lookupKeyWrite(c->db,setkeys[j]) :
96ffb2fe97 (Pieter Noordhuis 2010-07-02 19:57:12 +0200  943)             lookupKeyRead(c->db,setkeys[j]);

In SUNION vs SUNIONSTORE is may be clear from the command flags, but in commands such as SORT and GEORADIUS we need to look for the STORE argument, which is what i did for SORT in this recent PR: #5390

I personally think we should continue in that path, and make the code consistent with itself, eventually we should be able to tell (and add statistics) of both considerations:

keys opened as part of read command vs write command.
keys opened for the purpose of reading, modifying, or overwriting them.

but most important is obviously not to have bugs 8-), so let's look at the differences between lookupKeyWrite and lookupKeyRead to see if something is missing.

putting aside key miss keyspace notification (which is a recent addition and may be just missing from lookupKeyWrite by mistake), and putting aside statistics.

It's a bit hard to read, but as far as i can tell, the only actual diff in behavior is that lookupKeyRead can return NULL on a replica if the current client is not the master, when the key already expired but not yet deleted by the master (so expireIfNeeded doesn't delete it). and even that only happens on pure READONLY commands.

So the only bugs that can happen by using the wrong lookup function are when there are reads executed directly to the replica, but for these (READONLY commands), we never even consider calling lookupKeyWrite.
And for non-READONLY commands, the only difference between calling lookupKeyRead and lookupKeyWrite would be statistics (and keymiss event). right?

@soloestoy @yossigo @guybe7 (@antirez) please validate my understanding / reasoning.
if we agree on that, i think there's a need for a small refactoring / renaming so that this is clearer by reading the code.

guybe7 · 2020-07-07T07:38:12Z

@oranagra TBH i don't really see the point of knowing which "keys opened as part of read command vs write command"...
IMHO what we need to keep track of is only "keys opened for the purpose of reading, modifying, or overwriting them".
i'm not a Redis user so maybe i'm missing something...
i opened a related issue a while back: #6842 - it refers specifically to the issue with expired keys on a replica

oranagra · 2020-07-07T09:23:42Z

@guybe7 i also don't see why we wanna know if the open was part of a read or write, but clearly it mattered to Pieter, and maybe to Salvatore (merged my #5390), so for now i prefer keep that indication and not use it, rather than refactor it out of the code base.

i forgot about #6842, but i now realize that the problem there is not actually the use of lookupKeyRead vs lookupKeyWrite, but rather the different behavior for non-READONLY commands (which looks like a bug), more details there..

soloestoy · 2020-07-07T09:44:05Z

I agree with @guybe7 , the judgement of using lookupKeyRead or lookupKeyWrite should depend on the purpose instead of the command, cause write command may read some keys and modify the others.

Also, we need remove the check of CMD_READONLY in lookupKeyReadWithFlags I think.

yossigo · 2020-07-07T19:04:53Z

Trying to sum it up:

lookupKeyRead() and lookupKeyWrite() indicate what we intend to do with the key and should not be affected by the command itself.
Expired keys should never be hidden from commands originating from a master or AOF. Typically these would always be write commands (especially now that Lua scripts don't replicate) but the criteria is their origin and not whether or not they're writing.
We may want to preserve the RO/RW context of the command as an intermediate step just in case we missed something, but we can do it explicitly rather than rely on the command flags which is not always accurate.
Does that make sense?

yangbodong22011 · 2020-07-08T14:07:00Z

For the code modification, I want to confirm:

lookupKeyRead or lookupKeyWrite depend on the purpose instead of the command.
remove the check of CMD_READONLY in lookupKeyReadWithFlags.

Please tell me any other suggestions.

…9572) Writable replicas now no longer use the values of expired keys. Expired keys are deleted when lookupKeyWrite() is used, even on a writable replica. Previously, writable replicas could use the value of an expired key in write commands such as INCR, SUNIONSTORE, etc.. This commit also sorts out the mess around the functions lookupKeyRead() and lookupKeyWrite() so they now indicate what we intend to do with the key and are not affected by the command calling them. Multi-key commands like SUNIONSTORE, ZUNIONSTORE, COPY and SORT with the store option now use lookupKeyRead() for the keys they're reading from (which will not allow reading from logically expired keys). This commit also fixes a bug where PFCOUNT could return a value of an expired key. Test modules commands have their readonly and write flags updated to correctly reflect their lookups for reading or writing. Modules are not required to correctly reflect this in their command flags, but this change is made for consistency since the tests serve as usage examples. Fixes #6842. Fixes #7475.

…edis#9572) Writable replicas now no longer use the values of expired keys. Expired keys are deleted when lookupKeyWrite() is used, even on a writable replica. Previously, writable replicas could use the value of an expired key in write commands such as INCR, SUNIONSTORE, etc.. This commit also sorts out the mess around the functions lookupKeyRead() and lookupKeyWrite() so they now indicate what we intend to do with the key and are not affected by the command calling them. Multi-key commands like SUNIONSTORE, ZUNIONSTORE, COPY and SORT with the store option now use lookupKeyRead() for the keys they're reading from (which will not allow reading from logically expired keys). This commit also fixes a bug where PFCOUNT could return a value of an expired key. Test modules commands have their readonly and write flags updated to correctly reflect their lookups for reading or writing. Modules are not required to correctly reflect this in their command flags, but this change is made for consistency since the tests serve as usage examples. Fixes redis#6842. Fixes redis#7475.

oranagra added the state:needs-design the solution is not obvious and some effort should be made to design it label Jul 21, 2020

oranagra added this to the Redis 6.2 milestone Jul 21, 2020

itamarhaber added this to To do in 6.2 Aug 25, 2020

oranagra removed this from To do in 6.2 Oct 19, 2020

oranagra mentioned this issue Oct 28, 2020

Add COPY command #7953

Merged

oranagra mentioned this issue Nov 11, 2020

Writable replica may use the data of expired keys #6842

Closed

oranagra mentioned this issue Feb 2, 2021

XINFO should use lookupKeyReadOrReply #8436

Merged

oranagra mentioned this issue Mar 17, 2021

COPY src should use lookupKeyRead #8663

Closed

oranagra mentioned this issue Sep 25, 2021

Avoid accessing the expire dict for keys without expire #9547

Open

zuiderkwast mentioned this issue Sep 30, 2021

Sort out the mess around writable replicas and lookupKeyRead/Write #9572

Merged

oranagra closed this as completed in #9572 Nov 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-key commands about lookupKeyWrite and lookupKeyRead #7475

Multi-key commands about lookupKeyWrite and lookupKeyRead #7475

yangbodong22011 commented Jul 7, 2020 •

edited

soloestoy commented Jul 7, 2020

oranagra commented Jul 7, 2020

guybe7 commented Jul 7, 2020

oranagra commented Jul 7, 2020

soloestoy commented Jul 7, 2020

yossigo commented Jul 7, 2020

yangbodong22011 commented Jul 8, 2020

Multi-key commands about lookupKeyWrite and lookupKeyRead #7475

Multi-key commands about lookupKeyWrite and lookupKeyRead #7475

Comments

yangbodong22011 commented Jul 7, 2020 • edited

soloestoy commented Jul 7, 2020

oranagra commented Jul 7, 2020

guybe7 commented Jul 7, 2020

oranagra commented Jul 7, 2020

soloestoy commented Jul 7, 2020

yossigo commented Jul 7, 2020

yangbodong22011 commented Jul 8, 2020

yangbodong22011 commented Jul 7, 2020 •

edited