gh-139424: microoptimize _collections_abc._check_methods #139401

BobTheBuidler · 2025-09-28T18:01:36Z

We can reduce the number of attribute lookups requried here.

I also think we can speed it up a little bit more if we make methods arg a packed tuple instead of an unpacked one since we can bypass the additional tuple creation at the time of each call

Seeking input on that before I implement the same, I'm not 100% certain this internal function is safe to change.

Issue: _collections_abc._check_methods microoptimization #139424

We can reduce the number of attribute lookups requried here. I also think we can speed it up a little bit more if we make `methods` arg a packed tuple instead of an unpacked one since we can bypass the additional tuple creation at the time of each call Seeking input on that before I implement the same

python-cla-bot · 2025-09-28T18:01:40Z

All commit authors signed the Contributor License Agreement.

bedevere-app · 2025-09-28T18:01:41Z

Most changes to Python require a NEWS entry. Add one using the blurb_it web app or the blurb command-line tool.

If this change has little impact on Python users, wait for a maintainer to apply the skip news label instead.

AA-Turner · 2025-09-28T18:03:54Z

Please provide a pyperf benchmark. For micro-optimisations, we generally look for a minimum of 10% performance improvement.

A

picnixz · 2025-09-28T18:09:18Z

Also, please create an issue where you can report the benchmarks as well and the script you used to deduce them. I would suggest creating test cases with deeply nested classes.

ashm-dev · 2025-09-29T07:37:49Z

Lib/_collections_abc.py


 def _check_methods(C, *methods):
-    mro = C.__mro__
+    mro_dicts = [B.__dict__ for B in C.__mro__]


Maybe we should use a tuple instead of a list to reduce memory usage, since this collection isn’t modified:

Suggested change

mro_dicts = [B.__dict__ for B in C.__mro__]

mro_dicts = tuple(B.__dict__ for B in C.__mro__)

would it be reasonable to cache this tuple for re-use?

But the caching here would need to be more sophisticated, since when the class is updated, the result must also change. Standard caching only takes the input into account, so in this case the input would be cached, but the output should actually differ depending on the class state.

can a class mro change after the fact?

I know the contents of dict could change but if the cache is just holding references to each class' dict itself, then any modifications of any dict should already be reflected in the cache

If the mro can change this becomes invalid

to be clear I'm not suggesting to cache the result of this function, only the mro dict tuple

Ah, got it. I think caching the MRO dict tuple should be safe.

Here you also need to decide what exactly we want to optimize, since a tuple gives about ~10% better memory efficiency, while a list is about ~10% faster when iterating in loops.

interesting, I didn't know the lists are faster to iterate thru, even with the length check at each iteration. TIL, thanks

In that case I'd opt for the list for top speed in the subclass check but I'm not sure how that fits into the best practices here or what the people want

It's not worth debating tiny differences like this until we've verified that the change itself is worth pursing.

dolfinus · 2025-09-29T16:31:46Z

Lib/_collections_abc.py

+        for base_dict in mro_dicts:
+            if method in base_dict:
+                if base_dict[method] is None:
                    return NotImplemented
                break


Suggested change

for base_dict in mro_dicts:

if method in base_dict:

if base_dict[method] is None:

return NotImplemented

break

for base_dict in mro_dicts:

method_impl = base_dict.get(method, sentinel)

if method_impl is None:

return NotImplemented

if method_impl is not sentinel:

break

This avoids calculating hash and looking for key in dict twice. sentinel object can be created at the top of the method.

That looks like a clear win, good eye! Might make sense to do the check optimistically, ie: consider the sentinel more likely than the None

if method_impl is sentinel: continue if method_impl is None: return NotImplemented break

picnixz

If you want to apply any of the given suggestions, please first open an issue and provide micro-benchmarks (use a PGO+LTO+non-debug build for that, and pyperf as well).

bedevere-app · 2025-09-29T16:51:17Z

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phrase I have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

BobTheBuidler · 2025-09-29T17:37:31Z

Oh wow, who knew the most active PR conversation in my github history would be on a PR microoptimizing a helper function within the collections module! Thank you for the feedback everybody!

Clearly this will be more involved than I originally planned, its going to take me some time to put together a benchmark so bear with me.

But I'll get that done soonish and test out these suggestions.

BobTheBuidler · 2025-09-29T17:42:38Z

please first open an issue and provide micro-benchmarks (use a PGO+LTO+non-debug build for that, and pyperf as well).

This will be my first time using this tool, do you have any suggestions or best practices for its use within this repo?

picnixz · 2025-09-29T17:50:45Z

Yes, here are the steps you can do:

Create a benchmark script. Examples of such scripts can be found in various issues, but in your case, I think the one in gh-92810: Reduce memory usage by ABCMeta.__subclasscheck__ #131914 be a good starting point.
To use pyperf, you can take inspiration from the following script gh-74598: add fnmatch.filterfalse for excluding names #121185 (comment) (it's hidden in some "details" box).
The idea is to just benchmark the function and not a full-fledge program. I suggest you have a look at https://github.com/psf/pyperf as well.

I don't have time now to shepherd this but you can take your time for that. However, I'd appreciate we create an issue first as well to give more visibility (possibly, this feature could be rejected if the benchmarks are not satisfactory enough; we usually strive for >= 10% improvements).

AA-Turner · 2025-09-29T18:49:04Z

I'm going to close this PR for now, let's reopen this if the benchmarks in the linked issue demonstrate it's worth it. Ben's advice above is the best to follow in terms of demonstrating the worth of the change.

A

BobTheBuidler requested a review from rhettinger as a code owner September 28, 2025 18:01

bedevere-app bot added the awaiting review label Sep 28, 2025

StanFromIreland added the pending The issue will be closed if no feedback is provided label Sep 28, 2025

ashm-dev reviewed Sep 29, 2025

View reviewed changes

dolfinus reviewed Sep 29, 2025

View reviewed changes

picnixz requested changes Sep 29, 2025

View reviewed changes

bedevere-app bot added awaiting changes and removed awaiting review labels Sep 29, 2025

BobTheBuidler mentioned this pull request Sep 29, 2025

_collections_abc._check_methods microoptimization #139424

Open

StanFromIreland changed the title ~~feat: microoptimize _collections_abc._check_methods~~ gh-139424: microoptimize _collections_abc._check_methods Sep 29, 2025

picnixz removed the pending The issue will be closed if no feedback is provided label Sep 29, 2025

AA-Turner closed this Sep 29, 2025

	mro_dicts = [B.__dict__ for B in C.__mro__]
	mro_dicts = tuple(B.__dict__ for B in C.__mro__)

Uh oh!

gh-139424: microoptimize _collections_abc._check_methods #139401

gh-139424: microoptimize _collections_abc._check_methods #139401

Uh oh!

Conversation

BobTheBuidler commented Sep 28, 2025 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

python-cla-bot bot commented Sep 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bedevere-app bot commented Sep 28, 2025

Uh oh!

AA-Turner commented Sep 28, 2025

Uh oh!

picnixz commented Sep 28, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

picnixz left a comment

Choose a reason for hiding this comment

Uh oh!

bedevere-app bot commented Sep 29, 2025

Uh oh!

BobTheBuidler commented Sep 29, 2025

Uh oh!

BobTheBuidler commented Sep 29, 2025

Uh oh!

picnixz commented Sep 29, 2025

Uh oh!

AA-Turner commented Sep 29, 2025

Uh oh!

Uh oh!

BobTheBuidler commented Sep 28, 2025 •

edited by bedevere-app bot

Loading

python-cla-bot bot commented Sep 28, 2025 •

edited

Loading