bpo-35094: Improved algorithms for random.sample #10192

ciphergoth · 2018-10-28T19:05:22Z

Current algorithms for random.sample allocate considerable auxiliary memory to track what's been used so far; with this pull request we sample in a maximally memory efficient way. Peformance is similar or in some cases faster. See also https://github.com/ciphergoth/sansreplace

https://bugs.python.org/issue35094

the-knights-who-say-ni · 2018-10-28T19:05:24Z

Hello, and thanks for your contribution!

I'm a bot set up to make sure that the project can legally accept your contribution by verifying you have signed the PSF contributor agreement (CLA).

Unfortunately our records indicate you have not signed the CLA. For legal reasons we need you to sign this before we can look at your contribution. Please follow the steps outlined in the CPython devguide to rectify this issue.

You can check yourself to see if the CLA has been received.

Thanks again for your contribution, we look forward to reviewing it!

pablogsal · 2018-10-28T19:10:51Z

Lib/random.py

+        if is_set or k*2 >= n:
+            for i, item in enumerate(population):
+                r = randbelow(i + 1)
+                if r < k:


You can reduce one indentation level with an early continue:

if r >= k: continue if i < k: result[i] = result[r] result[r] = item

Fixed - thanks!

ciphergoth · 2018-10-28T19:39:16Z

BTW CLA signed at 10:52 Pacific time this morning, just waiting for the systems to catch up!

rhettinger · 2019-08-24T21:58:03Z

For future reference, here are my notes from evaluating the PR:

Call count summary

k	Baseline	Patched
10	10	20
20	20	40
100	100	200
200	200	400
1,000	1,000	2,000
2,000	2,000	4,000
5,001	5,001	10,000
7,500	5,001	10,000
8,500	8,500	10,000
9,500	9,500	10,000

Test Code

from unittest.mock import Mock
from random import Random
import sys

prng = Random()
prng._randbelow = Mock(wraps=prng._randbelow)

n, k = map(int, sys.argv[1:3])
prng.sample(range(n), k)
rbc = prng._randbelow.call_count
print(f'randbelow_calls={rbc} <-- n={n} k={k}')

Results with the PR Applied

$ python3.8 randombelow_counter.py 10000 10
randbelow_calls=20 <-- n=10000 k=10
$ python3.8 randombelow_counter.py 10000 20
randbelow_calls=40 <-- n=10000 k=20
$ python3.8 randombelow_counter.py 10000 100
randbelow_calls=200 <-- n=10000 k=100
$ python3.8 randombelow_counter.py 10000 200
randbelow_calls=400 <-- n=10000 k=200
$ python3.8 randombelow_counter.py 10000 1000
randbelow_calls=2000 <-- n=10000 k=1000
$ python3.8 randombelow_counter.py 10000 2000
randbelow_calls=4000 <-- n=10000 k=2000
$ python3.8 randombelow_counter.py 10000 5001
randbelow_calls=10000 <-- n=10000 k=5001
$ python3.8 randombelow_counter.py 10000 7500
randbelow_calls=10000 <-- n=10000 k=7500
$ python3.8 randombelow_counter.py 10000 8500
randbelow_calls=10000 <-- n=10000 k=8500
$ python3.8 randombelow_counter.py 10000 9500
randbelow_calls=10000 <-- n=10000 k=9500

Results for the Baseline Version

$ python3.8 randombelow_counter.py 10000 10
randbelow_calls=10 <-- n=10000 k=10
$ python3.8 randombelow_counter.py 10000 20
randbelow_calls=20 <-- n=10000 k=20
$ python3.8 randombelow_counter.py 10000 100
randbelow_calls=100 <-- n=10000 k=100
$ python3.8 randombelow_counter.py 10000 200
randbelow_calls=201 <-- n=10000 k=200
$ python3.8 randombelow_counter.py 10000 1000
randbelow_calls=1055 <-- n=10000 k=1000
$ python3.8 randombelow_counter.py 10000 2000
randbelow_calls=2000 <-- n=10000 k=2000
$ python3.8 randombelow_counter.py 10000 5001
randbelow_calls=5001 <-- n=10000 k=5001
$ python3.8 randombelow_counter.py 10000 7500
randbelow_calls=7500 <-- n=10000 k=7500
$ python3.8 randombelow_counter.py 10000 8500
randbelow_calls=8500 <-- n=10000 k=8500
$ python3.8 randombelow_counter.py 10000 9500
randbelow_calls=9500 <-- n=10000 k=9500

rhettinger · 2019-08-24T22:01:38Z

Also, here at the comparative timings with and without the patch:

With the PR Applied

$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=10)'
20000 loops, best of 11: 14.9 usec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=20)'
10000 loops, best of 11: 26.7 usec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=100)'
2000 loops, best of 11: 122 usec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=200)'
1000 loops, best of 11: 243 usec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=1_000)'
200 loops, best of 11: 1.27 msec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=2_000)'
100 loops, best of 11: 2.58 msec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=5_001)'
50 loops, best of 11: 5.56 msec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=7500)'
50 loops, best of 11: 5.7 msec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=8500)'
50 loops, best of 11: 5.78 msec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=9500)'
50 loops, best of 11: 5.83 msec per loop

Baseline

$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=10)'
20000 loops, best of 11: 10.6 usec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=20)'
20000 loops, best of 11: 17.9 usec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=100)'
5000 loops, best of 11: 74.9 usec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=200)'
2000 loops, best of 11: 144 usec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=1_000)'
500 loops, best of 11: 738 usec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=2_000)'
200 loops, best of 11: 1.63 msec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=5_001)'
100 loops, best of 11: 3.38 msec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=7500)'
50 loops, best of 11: 4.9 msec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=8500)'
50 loops, best of 11: 5.52 msec per loop
$ ./python.exe -m timeit -r11 -s 'from random import sample' 'sample(range(10_000), k=9500)'

ciphergoth · 2019-08-24T22:42:00Z

Thanks for following up on this! Surprised by the numbers you're seeing - I'll try reproducing your tests and investigate further. Thanks again!

bpo-35094: Improved algorithms for random.sample

7e4f85e

ciphergoth requested a review from rhettinger as a code owner October 28, 2018 19:05

the-knights-who-say-ni added the CLA not signed label Oct 28, 2018

bedevere-bot added the awaiting review label Oct 28, 2018

pablogsal requested review from markshannon and mdickinson and removed request for markshannon October 28, 2018 19:08

pablogsal reviewed Oct 28, 2018

View reviewed changes

ciphergoth added 2 commits October 28, 2018 12:13

Add news file.

b4d887e

Use continue to remove a level of indentation.

d1338fa

rhettinger closed this Oct 28, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bpo-35094: Improved algorithms for random.sample #10192

bpo-35094: Improved algorithms for random.sample #10192

ciphergoth commented Oct 28, 2018 •

edited by bedevere-bot

the-knights-who-say-ni commented Oct 28, 2018

pablogsal Oct 28, 2018

ciphergoth Oct 28, 2018

ciphergoth commented Oct 28, 2018

rhettinger commented Aug 24, 2019 •

edited

rhettinger commented Aug 24, 2019

ciphergoth commented Aug 24, 2019

bpo-35094: Improved algorithms for random.sample #10192

bpo-35094: Improved algorithms for random.sample #10192

Conversation

ciphergoth commented Oct 28, 2018 • edited by bedevere-bot

the-knights-who-say-ni commented Oct 28, 2018

pablogsal Oct 28, 2018

Choose a reason for hiding this comment

ciphergoth Oct 28, 2018

Choose a reason for hiding this comment

ciphergoth commented Oct 28, 2018

rhettinger commented Aug 24, 2019 • edited

Call count summary

Test Code

Results with the PR Applied

Results for the Baseline Version

rhettinger commented Aug 24, 2019

With the PR Applied

Baseline

ciphergoth commented Aug 24, 2019

ciphergoth commented Oct 28, 2018 •

edited by bedevere-bot

rhettinger commented Aug 24, 2019 •

edited