Skip to content

[mypyc] feat: cache len for container creation from expressions with length known at compile time #19503

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 42 commits into
base: master
Choose a base branch
from

Conversation

BobTheBuidler
Copy link
Contributor

@BobTheBuidler BobTheBuidler commented Jul 25, 2025

Currently, if a user uses an immutable type as the sequence input for a for loop, the length is checked once at each iteration which, while necessary for some container types such as list and dictionaries, is not necessary for iterating over immutable types tuple, str, and bytes.

This PR modifies the codebase such that the length is only checked at the first iteration, and reused from there.

Also, in cases where a simple genexp is the input argument for a tuple, the length is currently checked one additional time before entering the iteration (this is done to determine how to size the new tuple). In those cases, we don't even need a length check at the first iteration step, and can reuse the result of that first len call (or compile-time determined constant) instead.

Lastly, in cases where a tuple is created from a genexp and the length of the genexp is knowable at compile time, this PR replaces PyList_AsTuple with the tuple constructor fast-path.

@BobTheBuidler

This comment was marked as outdated.

@BobTheBuidler BobTheBuidler changed the title feat: cache len for iterating over immutable types [mypyc] feat: cache len for iterating over immutable types Jul 25, 2025
@BobTheBuidler BobTheBuidler marked this pull request as ready for review July 26, 2025 23:03
@BobTheBuidler
Copy link
Contributor Author

@ilevkivskyi this one is also ready for review if and when you get a chance, though its definitely a bit more involved than #19497

@@ -1147,3 +1187,33 @@ def gen_step(self) -> None:
def gen_cleanup(self) -> None:
for gen in self.gens:
gen.gen_cleanup()


def get_expr_length(expr: Expression) -> int | None:
Copy link
Contributor Author

@BobTheBuidler BobTheBuidler Aug 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These 2 helper functions can be extended to cover more cases and used for other length-based optimizations I have in mind

@BobTheBuidler BobTheBuidler changed the title [mypyc] feat: cache len for iterating over immutable types [mypyc] feat: cache len for iterating over immutable types and expressions with length known at compile time Aug 3, 2025
@BobTheBuidler
Copy link
Contributor Author

BobTheBuidler commented Aug 6, 2025

This PR shows a very meager improvement (~0.3%) on self-check, but I'm not sure if self-check is a reliable benchmark for this particular set of changes. It's a pretty niche-case optimization

@BobTheBuidler
Copy link
Contributor Author

Would it make this one easier to review if I split it up? I thought keeping all the length stuff together made sense but I could plausibly do a separate PR just for the immutable type iteration stuff.

@JukkaL
Copy link
Collaborator

JukkaL commented Aug 13, 2025

Would it make this one easier to review if I split it up? I thought keeping all the length stuff together made sense but I could plausibly do a separate PR just for the immutable type iteration stuff.

Yes, this would benefit from being split into at least 2 PRs. Right now it's a bit difficult to see which changes are related to each other.

Copy link
Collaborator

@JukkaL JukkaL left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(Not a full review.)

abc: Final = "abc"

def test() -> None:
a = [str(x) for x in [*abc, *"def", *b"ghi", ("j", "k"), *("l", "m", "n")]]
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this would be better as a run test. It's pretty hard to review the IR for correctness.

Copy link
Contributor Author

@BobTheBuidler BobTheBuidler Aug 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The idea behind this IR test is the line r19 = PyList_New(13) which demonstrates mypyc successfully parsed the length of all star inputs and was able to predetermine the length of the resulting list object, which allows mypyc to use the list creation fastpath

abc: Final = "abc"

def test() -> None:
a = tuple(str(x) for x in [*abc, *"def", *b"ghi", ("j", "k"), *("l", "m", "n")])
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similarly this seems to work better as a run test.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Like in the other case, the idea behind this IR test is the line r19 = PyTuple_New(13) which demonstrates mypyc successfully parsed the length of all star inputs and was able to predetermine the length of the resulting tuple object

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll need to look into why the list is still being created with variable length, but that can be a separate PR

@BobTheBuidler BobTheBuidler changed the title [mypyc] feat: cache len for iterating over immutable types and expressions with length known at compile time [mypyc] feat: cache len for container creation from expressions with length known at compile time Aug 16, 2025
@BobTheBuidler BobTheBuidler requested a review from JukkaL August 19, 2025 17:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants