[NEW] listpack migration - replace all usage of ziplist with listpack #8702

sundb · 2021-03-26T10:47:52Z

The problem

When inserting or deleting elements in the middle of ziplist, ziplist may have a cascading effect.
ziplist is already used in streams, t_zset and quicklist should also use listpack instead of ziplist.

Description of the feature

Changes:

quicklist is compatible with the old ziplist storage format.
Use listpack instead of ziplsit in t_zset.c.
Use listpack instead of ziplsit in quicklist.
Use listpack instead of ziplsit in hash.
Personality of rdb format.
- If deep sanitization is disable, does not convert when reading rdb, because the conversion from ziplist to listpack will cause slows downs rdb loading, We should convert when quicklistNode is being accessed.
- if deep sanitization is enabled, direct ziplist to listpack conversion.

zuiderkwast · 2021-03-29T20:59:59Z

If we're going to do take on this effort, we could as well take the step to be even more efficient than listpack.

I've figured we can save even more space by utilizing the suffix not only for storing a reverse-encoded length, but to tag the value itself and put payload bits in it. I have draft specs (and an incomplete proof-of-concept) here:

Encoding: https://github.com/zuiderkwast/bidipack/blob/main/bidipack.md
Size comparison with ziplist and listpack: https://github.com/zuiderkwast/bidipack/blob/main/comparison.md#element-encoding-sizes

sundb · 2021-03-30T00:56:18Z

@zuiderkwast The reason for using listpack instead of ziplist is that ziplist may cause cascading updates when insert and delete in middle, which is the biggest problem.
capacity is incremented by 1, 2 or 4, this can lead to a lot of memory waste, ziplsit and listpack are designed to save memory, so is it a loss or gain?

zuiderkwast · 2021-03-30T08:19:13Z

Thanks for reading it. :-) I know cascading updates is the main potential problem and that there was a bug which was fixed and that's why listpack was designed. Now, it doesn't seem to be a problem in practice anymore though.

Listpack also stores numbers up to 127 in a slightly more compact way. The compactness can be explored further. That's my point. My "bidipack" is not a complete spec yet.

I'll clarify the capacity idea. It's not incremented by 1, 2 or 4 times. (It was badly explained by me.) It is the input to the capacity formula, which if you increment it by 1 gives the underlying allocation sizes of the allocations used by jemalloc (8, 16, 32, 48, ..., 8 KiB, 10 KiB, 12 KiB, .., 160 KiB, 192 KiB, 224 KiB, 256 KiB, ...) as per http://jemalloc.net/jemalloc.3.html#size_classes i.e. four sizes before the allocation is doubled. There is no point to allocate something in between these sizes, since it would be unused anyway. It's the sizes returned by malloc_size() for any allocation.

The problems this allocation scheme is designed to solve is the extra memory moves caused by realloc followed by memmove for every insertion. It's a problem which has been mentioned e.g. by @oranagra in this comment This is not the main part idea of the bidipack though and it can be completely skipped.

oranagra · 2021-03-30T09:27:43Z

@zuiderkwast i guess you meant this comment

i didn't have time to read the entire thing yet, but from the portion i read i think that:

the idea of doing prereallocation is nice speedup, but it obviously comes on the expense of memory. for quicklist and stream it may be ok, since we'll trim that before creating a new node and the excess may be insignificant, but in hash, and zset it's probably not a good idea.
the idea of encoding growth strategy inside the listpack is probably not useful, even if we add some global config for that strategy (unlikely), we're very unlikely to add a per-key config.
encoding the capacity may not be needed, since we can call malloc_size, or even just do a realloc and count on it being very fast when it's a NOP. that's what i means in that other comment, when IIRC at that time i misunderstood that PR (that we should benchmark and see if there's reason to avoid these NOP calls).
one of the reasons behind listpack, was not only to avoid the cascading updates of ziplist, but also to simplify the code and make it less bug prone (we suspect there's still a bug in ziplist causing corruption, but we can't found it).

zuiderkwast · 2021-03-30T09:57:35Z

Thanks for finding the relevant comment. I suppose many of those things in my draft are not needed, e.g. alloc strategy (unless to optimize for append only, i.e. streams), but the way I wrote that description was as a stand-alone idea and malloc_size isn't widely used. Avoiding calls to malloc_size could be a small speedup and avoiding realloc when it does move data is good to avoid moving data twice, but all that can be skipped for simplicity.

huangzhw · 2021-03-30T11:18:12Z

@zuiderkwast Very interesting idea. Maybe a little complicated.

zuiderkwast · 2021-03-30T11:58:44Z

Thanks @huangzhw, I think we can pick only a small part of it, to make it simple. Or just use listpack since it's already battle-tested in streams.

It could be useful to store a version tag somewhere to be able to update stuff in the future and to support multiple encodings.

oranagra · 2021-03-30T12:57:43Z

you can't obviously change the encoding of listpacks in streams without breaking compatibility with old rdb files.

what we can do (either now, or in the future), is use different rdb opcodes when we store the new format like create RDB_TYPE_LIST_QUICKLIST_LISTPACK and RDB_TYPE_STREAM_BIDIPACK (same as the RDB_TYPE_ZSET_2 we have now).

then, we can either convert all payloads at rdb load time (if it's just about adding a version header), or support both side by side (i.e. support both listpack and bidipack, or a listpack with version header field, like we're gonna do for supporting ziplist side by side with listpack).

some formats, like hash, can simply have another (3rd or 4th) encoding, others like quicklist, have an "encoding" (container) field per node (see QUICKLIST_NODE_CONTAINER_ZIPLIST), so we can support both listpack and bidipack.

Part one of implementing #8702 (taking hashes first before other types) ## Description of the feature 1. Change ziplist encoded hash objects to listpack encoding. 2. Convert existing ziplists on RDB loading time. an O(n) operation. ## Rdb format changes 1. Add RDB_TYPE_HASH_LISTPACK rdb type. 2. Bump RDB_VERSION to 10 ## Interface changes 1. New `hash-max-listpack-entries` config is an alias for `hash-max-ziplist-entries` (same with `hash-max-listpack-value`) 2. OBJECT ENCODING will return `listpack` instead of `ziplist` ## Listpack improvements: 1. Support direct insert, replace integer element (rather than convert back and forth from string) 3. Add more listpack capabilities to match the ziplist ones (like `lpFind`, `lpRandomPairs` and such) 4. Optimize element length fetching, avoid multiple calculations 5. Use inline to avoid function call overhead. ## Tests 1. Add a new test to the RDB load time conversion 2. Adding the listpack unit tests. (based on the one in ziplist.c) 3. Add a few "corrupt payload: fuzzer findings" tests, and slightly modify existing ones. Co-authored-by: Oran Agra <oran@redislabs.com>

Part one of implementing redis#8702 (taking hashes first before other types) ## Description of the feature 1. Change ziplist encoded hash objects to listpack encoding. 2. Convert existing ziplists on RDB loading time. an O(n) operation. ## Rdb format changes 1. Add RDB_TYPE_HASH_LISTPACK rdb type. 2. Bump RDB_VERSION to 10 ## Interface changes 1. New `hash-max-listpack-entries` config is an alias for `hash-max-ziplist-entries` (same with `hash-max-listpack-value`) 2. OBJECT ENCODING will return `listpack` instead of `ziplist` ## Listpack improvements: 1. Support direct insert, replace integer element (rather than convert back and forth from string) 3. Add more listpack capabilities to match the ziplist ones (like `lpFind`, `lpRandomPairs` and such) 4. Optimize element length fetching, avoid multiple calculations 5. Use inline to avoid function call overhead. ## Tests 1. Add a new test to the RDB load time conversion 2. Adding the listpack unit tests. (based on the one in ziplist.c) 3. Add a few "corrupt payload: fuzzer findings" tests, and slightly modify existing ones. Co-authored-by: Oran Agra <oran@redislabs.com>

Part two of implementing #8702 (zset), after #8887. ## Description of the feature Replaced all uses of ziplist with listpack in t_zset, and optimized some of the code to optimize performance. ## Rdb format changes New `RDB_TYPE_ZSET_LISTPACK` rdb type. ## Rdb loading improvements: 1) Pre-expansion of dict for validation of duplicate data for listpack and ziplist. 2) Simplifying the release of empty key objects when RDB loading. 3) Unify ziplist and listpack data verify methods for zset and hash, and move code to rdb.c. ## Interface changes 1) New `zset-max-listpack-entries` config is an alias for `zset-max-ziplist-entries` (same with `zset-max-listpack-value`). 2) OBJECT ENCODING will return listpack instead of ziplist. ## Listpack improvements: 1) Add `lpDeleteRange` and `lpDeleteRangeWithEntry` functions to delete a range of entries from listpack. 2) Improve the performance of `lpCompare`, converting from string to integer is faster than converting from integer to string. 3) Replace `snprintf` with `ll2string` to improve performance in converting numbers to strings in `lpGet()`. ## Zset improvements: 1) Improve the performance of `zzlFind` method, use `lpFind` instead of `lpCompare` in a loop. 2) Use `lpDeleteRangeWithEntry` instead of `lpDelete` twice to delete a element of zset. ## Tests 1) Add some unittests for `lpDeleteRange` and `lpDeleteRangeWithEntry` function. 2) Add zset RDB loading test. 3) Add benchmark test for `lpCompare` and `ziplsitCompare`. 4) Add empty listpack zset corrupt dump test.

Part three of implementing #8702, following #8887 and #9366 . ## Description of the feature 1. Replace the ziplist container of quicklist with listpack. 2. Convert existing quicklist ziplists on RDB loading time. an O(n) operation. ## Interface changes 1. New `list-max-listpack-size` config is an alias for `list-max-ziplist-size`. 2. Replace `debug ziplist` command with `debug listpack`. ## Internal changes 1. Add `lpMerge` to merge two listpacks . (same as `ziplistMerge`) 2. Add `lpRepr` to print info of listpack which is used in debugCommand and `quicklistRepr`. (same as `ziplistRepr`) 3. Replace `QUICKLIST_NODE_CONTAINER_ZIPLIST` with `QUICKLIST_NODE_CONTAINER_PACKED`(following #9357 ). It represent that a quicklistNode is a packed node, as opposed to a plain node. 4. Remove `createZiplistObject` method, which is never used. 5. Calculate listpack entry size using overhead overestimation in `quicklistAllowInsert`. We prefer an overestimation, which would at worse lead to a few bytes below the lowest limit of 4k. ## Improvements 1. Calling `lpShrinkToFit` after converting Ziplist to listpack, which was missed at #9366. 2. Optimize `quicklistAppendPlainNode` to avoid memcpy data. ## Bugfix 1. Fix crash in `quicklistRepr` when ziplist is compressed, introduced from #9366. ## Test 1. Add unittest for `lpMerge`. 2. Modify the old quicklist ziplist corrupt dump test. Co-authored-by: Oran Agra <oran@redislabs.com>

Part three of implementing redis#8702, following redis#8887 and redis#9366 . ## Description of the feature 1. Replace the ziplist container of quicklist with listpack. 2. Convert existing quicklist ziplists on RDB loading time. an O(n) operation. ## Interface changes 1. New `list-max-listpack-size` config is an alias for `list-max-ziplist-size`. 2. Replace `debug ziplist` command with `debug listpack`. ## Internal changes 1. Add `lpMerge` to merge two listpacks . (same as `ziplistMerge`) 2. Add `lpRepr` to print info of listpack which is used in debugCommand and `quicklistRepr`. (same as `ziplistRepr`) 3. Replace `QUICKLIST_NODE_CONTAINER_ZIPLIST` with `QUICKLIST_NODE_CONTAINER_PACKED`(following redis#9357 ). It represent that a quicklistNode is a packed node, as opposed to a plain node. 4. Remove `createZiplistObject` method, which is never used. 5. Calculate listpack entry size using overhead overestimation in `quicklistAllowInsert`. We prefer an overestimation, which would at worse lead to a few bytes below the lowest limit of 4k. ## Improvements 1. Calling `lpShrinkToFit` after converting Ziplist to listpack, which was missed at redis#9366. 2. Optimize `quicklistAppendPlainNode` to avoid memcpy data. ## Bugfix 1. Fix crash in `quicklistRepr` when ziplist is compressed, introduced from redis#9366. ## Test 1. Add unittest for `lpMerge`. 2. Modify the old quicklist ziplist corrupt dump test. Co-authored-by: Oran Agra <oran@redislabs.com>

sundb self-assigned this Mar 26, 2021

oranagra added this to To do in 7.0 via automation Mar 26, 2021

sundb mentioned this issue Apr 12, 2021

listpack migration - replace all usage of ziplist with listpack #8761

Closed

oranagra linked a pull request Apr 22, 2021 that will close this issue

listpack migration - replace all usage of ziplist with listpack #8761

Closed

oranagra removed this from Backlog in 7.0 Apr 22, 2021

This was referenced Apr 28, 2021

Replace all usage of ziplist with listpack for quicklist #8880

Closed

Replace all usage of ziplist with listpack for t_hash #8887

Merged

sundb mentioned this issue Aug 12, 2021

Replace all usage of ziplist with listpack for t_zset #9366

Merged

oranagra added this to Backlog in 7.0 via automation Nov 1, 2021

oranagra moved this from Backlog to To Do in 7.0 Nov 1, 2021

oranagra added the 7.0-must-have label Nov 2, 2021

sundb mentioned this issue Nov 5, 2021

Replace ziplist with listpack in quicklist #9740

Merged

2 tasks

oranagra linked a pull request Nov 7, 2021 that will close this issue

Replace ziplist with listpack in quicklist #9740

Merged

2 tasks

oranagra removed this from To Do in 7.0 Nov 14, 2021

oranagra closed this as completed in #9740 Nov 24, 2021

DarrenJiang13 mentioned this issue Jul 31, 2022

fix typo zl to lp as ziplist was replaced by listpack. #11062

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NEW] listpack migration - replace all usage of ziplist with listpack #8702

[NEW] listpack migration - replace all usage of ziplist with listpack #8702

sundb commented Mar 26, 2021 •

edited

zuiderkwast commented Mar 29, 2021 •

edited

sundb commented Mar 30, 2021 •

edited

zuiderkwast commented Mar 30, 2021

oranagra commented Mar 30, 2021 •

edited

zuiderkwast commented Mar 30, 2021

huangzhw commented Mar 30, 2021

zuiderkwast commented Mar 30, 2021

oranagra commented Mar 30, 2021

[NEW] listpack migration - replace all usage of ziplist with listpack #8702

[NEW] listpack migration - replace all usage of ziplist with listpack #8702

Comments

sundb commented Mar 26, 2021 • edited

zuiderkwast commented Mar 29, 2021 • edited

sundb commented Mar 30, 2021 • edited

zuiderkwast commented Mar 30, 2021

oranagra commented Mar 30, 2021 • edited

zuiderkwast commented Mar 30, 2021

huangzhw commented Mar 30, 2021

zuiderkwast commented Mar 30, 2021

oranagra commented Mar 30, 2021

sundb commented Mar 26, 2021 •

edited

zuiderkwast commented Mar 29, 2021 •

edited

sundb commented Mar 30, 2021 •

edited

oranagra commented Mar 30, 2021 •

edited