gh-139871: Optimize small takes in bytearray.take_bytes #141741

cmaloney · 2025-11-19T09:56:12Z

When less than half the buffer is taken just copy that small part out rather than doing a big alloc + memcpy + big shrink.

cc: @vstinner, @encukou

Issue: Add .take_bytes([n]) a zero-copy path from bytearray to bytes #139871

When less than half the buffer is taken just copy that small part out rather than doing a big alloc + memmove + big shrink.

Lib/test/test_bytes.py

vstinner

LGTM

Objects/bytearrayobject.c

encukou · 2025-11-19T17:06:28Z

Hmm, would it make sense to take this path on (self->ob_start != self->ob_bytes), too? In that case the code currently goes on to memmove data.
OTOH, doing this would cause the bytearray into using this path in future takes, until a reallocation happens...

Co-authored-by: Victor Stinner <vstinner@python.org>

Co-authored-by: Petr Viktorin <encukou@gmail.com>

cmaloney · 2025-11-20T01:49:38Z

Hmm, would it make sense to take this path on (self->ob_start != self->ob_bytes), too?

I'd need to measure more / not sure. The realloc code does memcpy + malloc if 25% of space can be saved:

cpython/Objects/obmalloc.c

Line 2676 in ca1e86f

if (4 * nbytes > 3 * size) {

Doing a malloc + memcpy + free instead of the memmove + malloc + memcpy + free would be good but not sure how much to tune to the particular allocator constants.

My theory for usage pattern has been reading a block of bytes off a stream (network or terminal), appending (or readinto) the bytearray, then taking little chunks out of that as bytes. With that bytearray realloc code should recompact periodically effectively meaning this turns into a reasonably efficient ringbuffer.... (for that use case a "please repack but don't reduce capacity flag may be useful"). Not sure how common that usage is going to be though

encukou · 2025-11-20T07:48:38Z

That sounds like readlines().

Anyway, no need to block this PR on that. Thank you!

bedevere-bot · 2025-11-20T08:08:32Z

⚠️⚠️⚠️ Buildbot failure ⚠️⚠️⚠️

Hi! The buildbot AMD64 Debian root 3.x (tier-1) has failed when building commit e265ce8.

What do you need to do:

Don't panic.
Check the buildbot page in the devguide if you don't know what the buildbots are or how they work.
Go to the page of the buildbot that failed (https://buildbot.python.org/#/builders/345/builds/12729) and take a look at the build logs.
Check if the failure is related to this commit (e265ce8) or if it is a false positive.
If the failure is related to this commit, please, reflect that on the issue and make a new Pull Request with a fix.

You can take a look at the buildbot page here:

https://buildbot.python.org/#/builders/345/builds/12729

Failed tests:

test.test_os.test_os

Failed subtests:

test_timerfd_interval - test.test_os.test_os.TimerfdTests.test_timerfd_interval

Summary of the results of the build (if available):

==

Click to see traceback logs

Traceback (most recent call last):
  File "/root/buildarea/3.x.angelico-debian-amd64/build/Lib/test/test_os/test_os.py", line 4009, in test_timerfd_interval
    self.assertEqual(self.read_count_signaled(fd), 1)
    ~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AssertionError: 2 != 1


Traceback (most recent call last):
  File "/root/buildarea/3.x.angelico-debian-amd64/build/Lib/test/test_os/test_os.py", line 4017, in test_timerfd_interval
    self.assertEqual(self.read_count_signaled(fd), count)
    ~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AssertionError: 4 != 3

cmaloney · 2025-11-20T08:15:43Z

Test failure looks independent of this change, os_read_impl doesn't use bytearray and this doesn't touch the other part of it int.from_bytes (in general things don't use .take_bytes very much yet)

pythongh-139871: Optimize small takes in bytearray.take_bytes

783717c

When less than half the buffer is taken just copy that small part out rather than doing a big alloc + memmove + big shrink.

bedevere-app bot added the awaiting review label Nov 19, 2025

bedevere-app bot mentioned this pull request Nov 19, 2025

Add .take_bytes([n]) a zero-copy path from bytearray to bytes #139871

Closed

cmaloney added the skip news label Nov 19, 2025

encukou reviewed Nov 19, 2025

View reviewed changes

Lib/test/test_bytes.py Show resolved Hide resolved

vstinner approved these changes Nov 19, 2025

View reviewed changes

Objects/bytearrayobject.c Outdated Show resolved Hide resolved

bedevere-app bot added awaiting merge and removed awaiting review labels Nov 19, 2025

cmaloney and others added 2 commits November 19, 2025 12:02

Update Objects/bytearrayobject.c

16a7378

Co-authored-by: Victor Stinner <vstinner@python.org>

Update Lib/test/test_bytes.py

25a03d9

Co-authored-by: Petr Viktorin <encukou@gmail.com>

encukou merged commit e265ce8 into python:main Nov 20, 2025
44 checks passed

bedevere-app bot removed the awaiting merge label Nov 20, 2025

cmaloney deleted the ba_tb_keep_majority branch November 20, 2025 08:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-139871: Optimize small takes in bytearray.take_bytes #141741

gh-139871: Optimize small takes in bytearray.take_bytes #141741

Uh oh!

cmaloney commented Nov 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

vstinner left a comment

Uh oh!

Uh oh!

encukou commented Nov 19, 2025

Uh oh!

cmaloney commented Nov 20, 2025 •

edited

Loading

Uh oh!

encukou commented Nov 20, 2025

Uh oh!

Uh oh!

bedevere-bot commented Nov 20, 2025

Uh oh!

cmaloney commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

gh-139871: Optimize small takes in bytearray.take_bytes #141741

gh-139871: Optimize small takes in bytearray.take_bytes #141741

Uh oh!

Conversation

cmaloney commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

vstinner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

encukou commented Nov 19, 2025

Uh oh!

cmaloney commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

encukou commented Nov 20, 2025

Uh oh!

Uh oh!

bedevere-bot commented Nov 20, 2025

⚠️⚠️⚠️ Buildbot failure ⚠️⚠️⚠️

Uh oh!

cmaloney commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cmaloney commented Nov 19, 2025 •

edited

Loading

cmaloney commented Nov 20, 2025 •

edited

Loading