Change skip list P value to 1/e, which improves search times #3889

sean-public · 2017-03-20T02:29:57Z

The probability of adding new nodes to each level of a skip list (the node's height) is determined by successively "rolling the dice" at each level until it doesn't meet a probability P or it reaches a maximum level.

I have observed default P values for skip lists "in the wild" ranging from 0.25 to 0.5. In Redis, ZSKIPLIST_P is 0.25 with a ZSKIPLIST_MAXLEVEL of 32.

The optimal value for P in a general-purpose skip list is 1/e, where e is Euler's number. For a detailed proof, see Analysis of an optimized search algorithm for skip lists by Kirschenhofer et al (1995).

To investigate, I ran the server before and after changing the P value by running a server, redis-server --save "" --appendonly no, then ran a bash script to benchmark ZSET operations. Here is a chart showing the resulting improvement in ZADD operations per second (higher is better):

This change should improve the speed of all operations that require searching a skip list of any length. Typical benchmark results show a 4-8% improvement with a tradeoff of slightly more memory used to store the data structure.

sean-public · 2017-06-27T17:25:00Z

It's been a few months, is there anything I can do to have this PR considered? I've tried IRC already and got some positive feedback but nobody with write access noticed.

antirez · 2017-07-14T21:39:06Z

Hello, very interesting! First time I see this, sorry and thanks for pinging me about it. I'll review soon, thanks.

abclirun · 2017-08-31T07:28:41Z

I think when p is 1/4 the space is about 1.33n while p is 1/e the space goes up to about 1.58n。 When adjust that , you should also keep memory in mind

antirez · 2017-08-31T08:21:40Z

Yep that was my wonder indeed, p is a tradeoff between space and time, since sorted sets are already extremely heavy I wonder if changing the val is a good tradeoff for us. Another thing that worries me is the graph above, why the red line is so out of the theoretical trajectory? I wonder if the test was affected by errors enough, so that it's hard to tell exactly how faster the new implementation is. However it looks quite a bit faster, and we should take in mind that the sorted set is a skiplist plus an hash table, so the memory footprint of the skiplist itself is not so dominant. TLDR: this could be a good change but I need memory numbers to evaluate it.

sean-public · 2017-09-20T05:31:45Z

I ran another round of tests. Here are the results. Let me know if I can clarify anything or look deeper into the tradeoffs.

Configuration

Hardware is 2016 Macbook Pro 15 touchbar. CPU: 2.7 GHz Intel Core i7. Memory is 16 GB 2133 MHz LPDDR3. OS is MacOS Sierra 10.12.6.

I compiled 4 versions from unstable branch.

ZSKIPLIST_P=0.25 and MALLOC=libc
ZSKIPLIST_P=1/M_E and MALLOC=libc
ZSKIPLIST_P=0.25 and MALLOC=jemalloc
ZSKIPLIST_P=1/M_E and MALLOC=jemalloc

Benchmark

I wrote another bash script that keeps adding to a sorted set via ZADD up to 10 million entries with more granularity than the previous test. It outputs CSV data that I charted afterwards.

Results

First, some averages to see how much faster and memory-hungry P=1/e is.

	% requests/sec faster	% more memory
`libc malloc`	9.55%	3.28%
`jemalloc`	3.69%	4.46%

The final amount of memory allocated after inserting all 10 million items in all 4 tests:

	P=0.25	P=1/e
`libc malloc`	1,276,409,376	1,316,910,928
`jemalloc`	1,037,229,432	1,081,102,856

Here are some charts for libc malloc detailing requests per second (RPS) and memory use as the set is expanded:

Next up are the charts for jemalloc with the same tests:

Conclusions

jemalloc is much faster (duh)
the P=1/e speed benefits are more pronounced with libc malloc (9.55% vs 3.69%)
the additional memory used is very consistent but differs for each allocator (3.28% and 4.46%)
the jemalloc tests very consistently have a RPS spike at 1,750,000 entries

I think the tradeoff is worth it. It doesn't introduce any additional unpredictability in speed and memory use is very steady with a fixed additional overhead. I also think that it's worth exposing this type of tuning as a user-controlled setting.

yoav-steinberg · 2021-10-28T09:34:49Z

@filipecosta90 I think this one is worth your review. It might be good to verify the results again. And then finally make a call regarding the performance/memory tradeoff this presents. The bottom line here is that given the relatively modest numbers (for the jemalloc case), I don't think it's critical either way. But we should definitely merge or close this already.

filipecosta90 · 2021-11-02T13:08:34Z

@filipecosta90 I think this one is worth your review. It might be good to verify the results again. And then finally make a call regarding the performance/memory tradeoff this presents. The bottom line here is that given the relatively modest numbers (for the jemalloc case), I don't think it's critical either way. But we should definitely merge or close this already.

@yoav-steinberg I'm working on seeing if this change can be measured/detected by the current use-case on https://github.com/redis/redis-benchmarks-specification.
TLDR will get the numbers, ideally in an automated manner via the action:run-benchmark tag. Will update further until EOW

madolson · 2021-11-18T04:29:31Z

@filipecosta90 Any chance you have an update?

filipecosta90 · 2021-11-23T18:05:07Z

@filipecosta90 Any chance you have an update?

planning tomorrow to focus on this @madolson. Sorry for the late reply

filipecosta90 · 2021-12-07T14:29:33Z

@filipecosta90 Any chance you have an update?

@madolson @sean-public from the simple ZADD test ( described here https://github.com/redis/redis-benchmarks-specification/blob/main/redis_benchmarks_specification/test-suites/redis-benchmark-full-suite-1Mkeys-100B.yml ) that only focus on achievable throughput and overall client p50 latency there is no visible change on an oss-standalone setup:

Madelyn you can consult it here:
https://benchmarksredisio.grafana.net/d/1fWbtb7nz/experimental-oss-spec-benchmarks?orgId=1&var-platforms=amd64-ubuntu18.04-m6i.8xlarge-1&var-test_suite=redis-benchmark-full-suite-1Mkeys-100B&var-metric_context_path=ZADD&var-build_variants=gcc%3A8.5.0-amd64-debian-buster-default&from=1638748800000&to=1638921599000

However, this is a very simple test to catch the changes/improvements @sean-public worked on. I'll extend the tests to include at least the shared charts on this PR description to have an automated clean improvement %change calculated. and get back with further data.

sean-public · 2022-02-23T04:12:08Z

there is no visible change on an oss-standalone setup

That seems correct, as the earlier benchmarks using the scripts I provided show a visually appreciable difference in latency well after 1M keys are inserted (the magnitude varying based on memory allocator used).

I think the speed/memory tradeoff is solid and only improves performance as measured in requests per second, especially with pipelining.

CLAassistant · 2024-03-24T23:29:41Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

sean-public seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

sean-public · 2024-03-28T19:23:16Z

Not signing the CLA.

Change skip list P value to 1/e, which improves search times

14c4493

leomurillo mentioned this pull request Feb 3, 2020

Set ZSKIPLIST_MAXLEVEL to 32 #6818

Merged

yoav-steinberg requested a review from filipecosta90 October 28, 2021 09:35

filipecosta90 added the action:run-benchmark Triggers the benchmark suite for this Pull Request label Nov 2, 2021

Merge branch 'unstable' into unstable

d7ca401

filipecosta90 added action:run-benchmark Triggers the benchmark suite for this Pull Request and removed action:run-benchmark Triggers the benchmark suite for this Pull Request labels Dec 7, 2021

sean-public closed this Mar 28, 2024

sean-public mentioned this pull request Mar 28, 2024

Change skip list P value to 1/e, which improves search times valkey-io/valkey#67

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change skip list P value to 1/e, which improves search times #3889

Change skip list P value to 1/e, which improves search times #3889

sean-public commented Mar 20, 2017 •

edited

sean-public commented Jun 27, 2017

antirez commented Jul 14, 2017

abclirun commented Aug 31, 2017

antirez commented Aug 31, 2017

sean-public commented Sep 20, 2017 •

edited

yoav-steinberg commented Oct 28, 2021

filipecosta90 commented Nov 2, 2021

madolson commented Nov 18, 2021

filipecosta90 commented Nov 23, 2021

filipecosta90 commented Dec 7, 2021

sean-public commented Feb 23, 2022

CLAassistant commented Mar 24, 2024

sean-public commented Mar 28, 2024

Change skip list P value to 1/e, which improves search times #3889

Change skip list P value to 1/e, which improves search times #3889

Conversation

sean-public commented Mar 20, 2017 • edited

sean-public commented Jun 27, 2017

antirez commented Jul 14, 2017

abclirun commented Aug 31, 2017

antirez commented Aug 31, 2017

sean-public commented Sep 20, 2017 • edited

Configuration

Benchmark

Results

Conclusions

yoav-steinberg commented Oct 28, 2021

filipecosta90 commented Nov 2, 2021

madolson commented Nov 18, 2021

filipecosta90 commented Nov 23, 2021

filipecosta90 commented Dec 7, 2021

sean-public commented Feb 23, 2022

CLAassistant commented Mar 24, 2024

sean-public commented Mar 28, 2024

sean-public commented Mar 20, 2017 •

edited

sean-public commented Sep 20, 2017 •

edited