gh-119396: Optimize unicode_repr() #119617

vstinner · 2024-05-27T16:40:03Z

Use stringlib to specialize unicode_repr() for each string kind (UCS1, UCS2, UCS4).

Benchmark:

+-------------------------------------+---------+----------------------+
| Benchmark                           | ref     | change2              |
+=====================================+=========+======================+
| repr('abc')                         | 100 ns  | 103 ns: 1.02x slower |
+-------------------------------------+---------+----------------------+
| repr('a' * 100)                     | 369 ns  | 369 ns: 1.00x slower |
+-------------------------------------+---------+----------------------+
| repr(('a' + squote) * 100)          | 1.21 us | 946 ns: 1.27x faster |
+-------------------------------------+---------+----------------------+
| repr(('a' + nl) * 100)              | 1.23 us | 907 ns: 1.36x faster |
+-------------------------------------+---------+----------------------+
| repr(dquote + ('a' + squote) * 100) | 1.08 us | 858 ns: 1.25x faster |
+-------------------------------------+---------+----------------------+
| Geometric mean                      | (ref)   | 1.16x faster         |
+-------------------------------------+---------+----------------------+

Issue: Optimize _PyUnicodeWriter implementation #119396

Use stringlib to specialize unicode_repr() for each string kind (UCS1, UCS2, UCS4). Benchmark: +-------------------------------------+---------+----------------------+ | Benchmark | ref | change2 | +=====================================+=========+======================+ | repr('abc') | 100 ns | 103 ns: 1.02x slower | +-------------------------------------+---------+----------------------+ | repr('a' * 100) | 369 ns | 369 ns: 1.00x slower | +-------------------------------------+---------+----------------------+ | repr(('a' + squote) * 100) | 1.21 us | 946 ns: 1.27x faster | +-------------------------------------+---------+----------------------+ | repr(('a' + nl) * 100) | 1.23 us | 907 ns: 1.36x faster | +-------------------------------------+---------+----------------------+ | repr(dquote + ('a' + squote) * 100) | 1.08 us | 858 ns: 1.25x faster | +-------------------------------------+---------+----------------------+ | Geometric mean | (ref) | 1.16x faster | +-------------------------------------+---------+----------------------+

vstinner · 2024-05-27T16:40:24Z

Benchmark:

import pyperf
runner = pyperf.Runner()
squote = "'"
dquote = '"'
nl = '\n'
runner.bench_func("repr('abc')", repr, 'abc')
runner.bench_func("repr('a' * 100)", repr, 'a' * 100)
runner.bench_func("repr(('a' + squote) * 100)", repr, ('a' + squote) * 100)
runner.bench_func("repr(('a' + nl) * 100)", repr, ('a' + nl) * 100)
runner.bench_func("repr(dquote + ('a' + squote) * 100)", repr, dquote + ('a' + squote) * 100)

vstinner · 2024-05-27T16:40:41Z

cc @serhiy-storchaka

vstinner · 2024-05-27T16:42:19Z

This is a first step. The second step will be to avoid a temporary string in PyUnicode_FromFormat("%R", str_obj).

vstinner · 2024-05-28T11:36:58Z

This is a first step. The second step will be to avoid a temporary string in PyUnicode_FromFormat("%R", str_obj).

I implemented the second step locally. Sadly, it's slower! Not faster. IMO the first step (making the code faster) is still worth it :-)

Use stringlib to specialize unicode_repr() for each string kind (UCS1, UCS2, UCS4). Benchmark: +-------------------------------------+---------+----------------------+ | Benchmark | ref | change2 | +=====================================+=========+======================+ | repr('abc') | 100 ns | 103 ns: 1.02x slower | +-------------------------------------+---------+----------------------+ | repr('a' * 100) | 369 ns | 369 ns: 1.00x slower | +-------------------------------------+---------+----------------------+ | repr(('a' + squote) * 100) | 1.21 us | 946 ns: 1.27x faster | +-------------------------------------+---------+----------------------+ | repr(('a' + nl) * 100) | 1.23 us | 907 ns: 1.36x faster | +-------------------------------------+---------+----------------------+ | repr(dquote + ('a' + squote) * 100) | 1.08 us | 858 ns: 1.25x faster | +-------------------------------------+---------+----------------------+ | Geometric mean | (ref) | 1.16x faster | +-------------------------------------+---------+----------------------+

vstinner added the skip news label May 27, 2024

bedevere-app bot added the awaiting core review label May 27, 2024

bedevere-app bot mentioned this pull request May 27, 2024

Optimize _PyUnicodeWriter implementation #119396

Closed

Fix make check-c-globals

d634471

vstinner requested a review from ericsnowcurrently as a code owner May 27, 2024 16:56

vstinner merged commit 0518edc into python:main May 28, 2024
34 checks passed

vstinner deleted the unicode_repr branch May 28, 2024 16:05

bedevere-app bot removed the awaiting core review label May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-119396: Optimize unicode_repr() #119617

gh-119396: Optimize unicode_repr() #119617

vstinner commented May 27, 2024 •

edited by bedevere-app bot

Loading

vstinner commented May 27, 2024

vstinner commented May 27, 2024

vstinner commented May 27, 2024

vstinner commented May 28, 2024

gh-119396: Optimize unicode_repr() #119617

gh-119396: Optimize unicode_repr() #119617

Conversation

vstinner commented May 27, 2024 • edited by bedevere-app bot Loading

vstinner commented May 27, 2024

vstinner commented May 27, 2024

vstinner commented May 27, 2024

vstinner commented May 28, 2024

vstinner commented May 27, 2024 •

edited by bedevere-app bot

Loading