-
-
Notifications
You must be signed in to change notification settings - Fork 31.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Move const folding to the peephole optimizer #126835
Comments
We had a discussion with @tomasr8 and decided that splitting this into parts would be good. Initially, I tried to do it in one PR, but realized that it would be impossible to review, as it would do a lot of unrelated things. To begin, we decided to start by moving tuple folding from AST optimizer to CFG. Problem: Currently compiler emits a warning if Possible solution: Perform this in a CFG after folding. This will be slower (I have no information on how slower it will be, but I assume it will) than performing this in codegen phase. Footnotes |
Is there a plan what parts/in what order should be moved from ast optimizer to cfg? Does it make sense to create sub issues to make it more granular (instead of just PRs linked to this issue)? This issue |
…en to CFG (#129426) Codegen phase has an optimization that transforms ``` LOAD_CONST x LOAD_CONST y LOAD_CONXT z BUILD_LIST/BUILD_SET (3) ``` -> ``` BUILD_LIST/BUILD_SET (0) LOAD_CONST (x, y, z) LIST_EXTEND/SET_UPDATE 1 ``` This optimization has now been moved to CFG phase to make #128802 work. Co-authored-by: Irit Katriel <1055913+iritkatriel@users.noreply.github.com> Co-authored-by: Yan Yanchii <yyanchiy@gmail.com>
I'm currently looking into moving unaryop/binop folding (though I'm at Fosdem atm so It might take a bit longer) Anything else is up for grabs I'd say, unless Kirill is working on something currently? |
I am looking into binop folding as well (almost done). Do you want to pass it or should I take something else? |
Problem with binop I discovered is that after moving it to cfg, we, for instance, run into problem with case matching which @Eclips4 mentioned in issue description. For example: match x:
case 0 + 0j:
pass raises File "/Users/yyanchii/Desktop/cpython/internal/t.py", line 2
case 0 + 0j:
^^^^^^
SyntaxError: patterns may only match literals and attribute lookups It is raised from: Lines 6022 to 6025 in 89fe067
So at codegen stage, it already expects (in this case) binops to be folded. @tomasr8 did you run into this already? We would need to add more logic in here which doesn't look good to me. On the other hand, how would we move binop folding to CFG then? |
I think the current priority is to move unaryop folding from AST optimizer to codegen phase. Then we should have to think about binaryop folding :) |
Placing this here, not to forget in future: #129568 (comment). Maybe in future we get rid of emitting |
cc @iritkatriel |
|
|
Sorry, ignore previous comments. I just realized it is controlled by |
I was also confused after reading "best effort", but I strongly feel that it does not have anything to do with constant folding. |
Yes, or just deprecate now and not worry about scheduling removal. We'll just document that they don't do anything anymore. |
Move folding of constant subscription from AST optimizer to CFG. Co-authored-by: Irit Katriel <1055913+iritkatriel@users.noreply.github.com>
I'm just now reading your messages - @WolframAlph I checked your binop PR and it's pretty much identical to what I had 👍 |
@Eclips4 @tomasr8 I have noticed that CFG already does tuple folding: Lines 1357 to 1395 in 0664c1a
So folding it in ast optimizer: Lines 558 to 568 in 0664c1a
can be just removed and CFG is already prepared for that. Indeed, when I remove tuple folding from ast optimizer, I still get folded tuple when I dis some constant tuples. So CFG already handles that. |
@tomasr8 there is also string formatting left. |
@WolframAlph ok if I tackle string formatting then? |
Go for it, I haven't started it yet. |
I've been going through expression folding code in Lines 765 to 770 in e65e9f9
If we don't migrate this to CFG, we would need to keep a lot of expression related recursive boilerplate code just for this case. I got it migrated locally successfully, but it requires passing optimization level + names dict from compiler unit metadata to _PyCfg_OptimizeCodeUnit . Problem is that would affect test internal c api code as this function is called from there as well (indirectly). @iritkatriel what do you think would be the best course here? Maybe we just create default values implicitly for optimization level & names for internal c api test? But then there will be no way to test this. Seems like a lot of changes required just for this special case.
|
@WolframAlph Do we really want to move it? Maybe we should just leave it as it is. I'm sure it's not worth the trouble of moving it to another stage |
We don't have to, but then we have to keep boilerplate code in |
Yes, let's leave this in ast_opt.c. This file is not going away, and it needs to continue to perform a full AST traversal (so keep all the recursion). This pass will be used for things which are not const-folding optimisations, like detecting syntax errors. We will rename it at some point. So leave the recursion, and leave the |
One other thing that we should also probably leave in Lines 619 to 626 in 798f8d3
As for string formatting: I had a look at the code and I think it's safer if we keep that in the optimizer as well. As Yan pointed out, this optimization can end up with more instructions than it started with (e.g. it turns |
Or creating new block with enough space to optimize and then re-link old and new. Also ugly IMO. |
Yeah either way I think it adds a lot of complexity for little benefit.. but I guess that's fine, not everything has to necessarily go |
I would like to address how we are searching & constructing constant sequences for constant foldings. Currently we collect constant sequence with: Lines 1354 to 1386 in 990ad27
Benefit is (arguably) that we need single pass and the code is simpler, but the drawback is that regardless of whether it will or it will not be a constant sequence, we always pre-allocate tuple of requested size. Considering that most of the cases it won't be a constant sequence, this seems to be quite wasteful. This becomes even more concerning when it comes to (for example) binary operations. Now, this would pre-allocate tuple for every single binary operation existing, regardless of whether it is constant or not. Splitting this logic into 2 parts for 1: counting and 2: constructing resulting tuple would improve the situation a lot I believe. But there is another thing to consider. Some of the foldings directly use resulting tuple as final constant, but some (binop, unaryop) use it only to get their operands and then deallocate it anyway, which also seems to be quite unnecessary (even if the event of folding constant is much less likely than otherwise). Therefore I would like to propose to change the way how we search for constant sequence. Instead, we could do single pass to collect indices of instructions that load constants, something like this: static bool
get_const_instruction_sequence(basicblock *bb, int start, int size, int *indices)
{
assert(start < bb->b_iused);
assert(size <= STACK_USE_GUIDLINE);
for (; start >= 0 && size > 0; start--) {
cfg_instr *instr = &bb->b_instr[start];
if (instr->i_opcode == NOP) {
continue;
}
if (!loads_const(instr->i_opcode)) {
return false;
}
indices[--size] = start;
}
return size == 0;
} And calling function would need to do something like this (binop for example): int indices[2];
if (!get_const_instruction_sequence(bb, i-1, 2, indices)) {
return SUCCESS;
} Calling function would be responsible to make sure it calls As a result, we would construct constant tuples only when necessary. This would also make noping out instructions faster as we already know indices and don't have to make unnecessary iterations as we do now. WDYT @iritkatriel @Eclips4 @tomasr8 ? |
Makes sense to me. |
… sequence during constant folding (#131054)
…nstant sequence during constant folding (python#131054)
Little update. All expressions foldings (except for string formatting and |
@iritkatriel what is left to be done on this? One thing coming to my mind is cleaning up test cases, as there are cases IIRC when different tests test the same thing/some outdated tests etc... But not sure if it is related to this task. |
Cleaning up the tests is a good idea, we can do it on this issue. Another thing is renaming ast_opt.c and |
Tests cleanup PR: #131826 |
@iritkatriel I suppose all structs/functions using |
Which ones? There is some stuff that is about optimization, so case by case basis. |
For instance: Lines 18 to 26 in 3796884
cpython/Include/internal/pycore_compile.h Lines 37 to 52 in 3796884
|
Those can be renamed. |
Feature or enhancement
Proposal:
For additional context see #126830 (comment).
Flow graph optimizer has more information and can do better job.
The problem is that we need to convert from
UNARY_OP(-, CONST(1))
toCONST(-1)
, still before the code generation phase, because this leads to a few problems, one of which is shown below.cc @markshannon
Has this already been discussed elsewhere?
No response given
Links to previous discussion of this feature:
No response
Linked PRs
optimize_if_const_subscr
refleaks #129634test_peepholer.py::TestTranforms
#131826The text was updated successfully, but these errors were encountered: