[CIR][CIRGen] Add const attribute to alloca operations #892

Lancern · 2024-09-27T13:43:18Z

This PR tries to give a simple initial implementation for eliminating redundant loads of constant objects, an idea originally posted by OfekShilon.

Specifically, this PR adds a new unit attribute const to the cir.alloca operation. Presence of this attribute indicates that the alloca-ed object is declared const in the input source program. CIRGen is updated accordingly to start emitting this new attribute.

ChuanqiXu9

How does this patch deal with constant reference and constexpr?

ChuanqiXu9 · 2024-09-29T08:02:55Z

clang/lib/CIR/Dialect/Transforms/ConstLoadElimination.cpp

+  // - If there is a load operation that properly dominates it, replace the
+  //   load with that dominator load. This process is "recursive": if load A
+  //   dominates load B and load B dominates load C, we should eventually
+  //   replace load C with load A.


Why don't we records A dominate C directly?

Lancern · 2024-09-30T08:26:50Z

How does this patch deal with constant reference [...]?

Constant reference is not yet taken care of in this patch, I'll add it later!

[...] and constexpr?

Well since constexpr variables are implifitly const I believe they are covered by this patch. Note that during CodeGen (and CIRGen) quite a lot of constexpr variable references have already been evaluated to their values so I believe there's not much we have to care here.

ChuanqiXu9 · 2024-09-30T08:40:52Z

Well since constexpr variables are implifitly const I believe they are covered by this patch.

In Decl, we have isInlineSpecified and isInline for the different cases. So I am hesitating when I see the use of isConstQualified here. But I didn't check it actually though. It maybe helpful to add some test here.

Lancern · 2024-09-30T08:55:15Z

It maybe helpful to add some test here.

Sounds good to me, I'll add a test along with the update for references.

Lancern · 2024-09-30T14:03:37Z

Two updates:

Added a test case for local constexpr variable;
Added const attribute for allocas for local reference variables.

bcardosolopes

Thanks for working on cool ideas @Lancern!

I think we can split this patch into two: (1) introduce the const + CIRGen + tests and (2) optimization on top of new alloca attribute.

For (2), I wonder if you tried to explore the path of teaching more traditional optimizations about these new introduced properties (const in the example here). For example, a combo of:

I believe that in principle we should implement these hooks and try to get these optimizations for free from MLIR before we try to develop custom CIR ones. I wonder if you found any limitations while exploring those.

https://mlir.llvm.org/docs/Passes/#-sccp

Lancern · 2024-10-01T14:47:37Z

For (2), I wonder if you tried to explore the path of teaching more traditional optimizations about these new introduced properties (const in the example here).

Not quite, I have only tried the combination with mem2reg. The optimization in this PR is quite orthogonal to mem2reg although they both optimize some simple cases like:

int produce_int();
int test() {
  const int x = produce_int();
  int a = x;
  int b = x;
  return a + b;
}

In this simple case, since x is only written once and its address does not escape, mem2reg could effectively eliminate all memory allocations and transform it into code similar to:

int produce_int();
int test() {
  const int x = produce_int();
  return x + x;
}

However, once the address of x escapes, for example in this case:

int produce_int();
void consume(const int &);
int test() {
  const int x = produce_int();
  int a = x;
  consume(x);
  int b = x;
  return a + b;
}

Since the allocation for x must be retained, mem2reg now becomes helpless. The load for a and the load for b could not be eliminated by mem2reg. sccp is not helpful either since it does not reason about values in memory. Only the optimization in this PR could eliminate the load for a and b. More over, the constness information is critical here to safely eliminate the load for b. Without this knowledge, an optimizer cannot safely assume that consume does not change x and it cannot eliminate the load for b.

I think we can split this patch into two: (1) introduce the const + CIRGen + tests and (2) optimization on top of new alloca attribute.

OK I'll split it later. I may draft a more detailed RFC along with PR (2) so we could all get a feel about the range and impact of this cool optimization.

bcardosolopes · 2024-10-01T19:20:36Z

Not quite, I have only tried the combination with mem2reg. The optimization in this PR is quite orthogonal to mem2reg...

It's orthogonal but my point is that compiler optimizations usually work with a combination of multiple passes and not adhoc passes that do all work and analysis at once.

In this simple case, since x is only written once and its address does not escape, mem2reg could effectively eliminate all memory allocations and transform it into code...

Are you saying mem2reg can generate transformations that allows this to be optimized without any of the changes from this PR?

Since the allocation for x must be retained, mem2reg now becomes helpless.

I understand where you are coming from and what you want to achieve, but I'm a bit worried about making assumptions about memory in adhoc fashion, without for example, the help of a proper alias analysis to feed in this information.

In general, what I'm trying to convey is that we should first start implementing the hooks for the existing passes MLIR provides and slowly enable them in our pipeline. Putting to the context of this PR, I'd like to see how const can help in general, with small and sound pieces introduced in bites. I like the overall direction but C++ is quite tricky, and I don't see any report of this optimization being applied to any significant bigger piece of a code base, build time footprint and correctness guarantees, it feels a bit too-optimistic-too-early to me.

Lancern · 2024-10-02T08:05:08Z

Are you saying mem2reg can generate transformations that allows this to be optimized without any of the changes from this PR?

For the simple case I shown in the previous comment, yes. But for more complex examples, we have to come up a way to teach mem2reg (or any other existing optimizations) about the constness added in this PR.

In general, what I'm trying to convey is that we should first start implementing the hooks for the existing passes MLIR provides and slowly enable them in our pipeline.

I get your idea. You're conveying that after landing the constness attribute, a more practical way to make it useful is to first try teach existing MLIR optimizations about the constness and see what they could already do. Do I understand it correctly?

Lancern · 2024-10-02T16:05:43Z

Updated, removed the transformation pass from this PR.

bcardosolopes · 2024-10-02T18:13:16Z

For the simple case I shown in the previous comment, yes. But for more complex examples, we have to come up a way to teach mem2reg (or any other existing optimizations) about the constness added in this PR.

Neat, might be worth adding that testcase to current mem2reg tests.

I get your idea. You're conveying that after landing the constness attribute, a more practical way to make it useful is to first try teach existing MLIR optimizations about the constness and see what they could already do. Do I understand it correctly?

This would be one interesting path to go along, yes. There are many possible paths though:

Mentioned above: add more existing passes to our pipeline and see what type of goodness you can get out of it (const would be a good example, but anything in general).
Some folks mention that LLVM cannot take advantage of source level constness. One train of work here is to find out what already exists but it's missing being propagated in LLVM to make that happen, is it because some information didn't get propagated from the frontend given that without CIR there's no way to propagate high level info early in the pipeline? It's possible all we need is a simple analysis pass on top of CIR that propagates const info such that LLVM lowering can emit even more metadata and help LLVM optimizations to better kick in. If I was working on this, it's probably where I'd start - give more info to LLVM so that existing LLVM optimizations can just do more work.
If you are really passionate about the pointer escaping aspect, you could find a way to integrate an escape analysis / alias analysis into CIR pipeline - a good start would be to check with the MLIR community what's out there or if there's anything we could reuse / collaborate on.

One concern I have with the existing PR approach is that dominance checks can get expensive, you might need more caching or more conservative assumptions, maybe looking into how LLVM eliminate redudant loads can provide you with a few more insights on how these opts usually operate to be efficient. Another caveat here is that ClangIR is currently WIP building bigger codebases / benchmarks, it's probably gonna get easier to get measurements / evaluate optimizations once we have a baseline for correctness and compile time.

This patch adds a new attribute `const` to the alloca operation to indicate that the corresponding local variable declaration is `const`-qualified. Future optimizations may find this new attribute useful.

Lancern · 2024-10-10T13:42:12Z

Rebased onto the latest main.

This PR tries to give a simple initial implementation for eliminating redundant loads of constant objects, an idea originally posted by OfekShilon. Specifically, this PR adds a new unit attribute `const` to the `cir.alloca` operation. Presence of this attribute indicates that the alloca-ed object is declared `const` in the input source program. CIRGen is updated accordingly to start emitting this new attribute.

Lancern requested review from lanza and bcardosolopes as code owners September 27, 2024 13:43

Lancern force-pushed the alloca-const branch 3 times, most recently from 0b45aea to e58e514 Compare September 27, 2024 17:41

ChuanqiXu9 reviewed Sep 29, 2024

View reviewed changes

Lancern force-pushed the alloca-const branch from e58e514 to ad9f9ec Compare September 30, 2024 14:00

Lancern force-pushed the alloca-const branch from ad9f9ec to a5f0feb Compare September 30, 2024 14:46

bcardosolopes requested changes Sep 30, 2024

View reviewed changes

lanza force-pushed the main branch from 605343a to 4c44bf1 Compare October 2, 2024 05:01

Lancern force-pushed the alloca-const branch from a5f0feb to 5204776 Compare October 2, 2024 16:05

Lancern changed the title ~~[CIR][Transform] Add constant load elimination pass~~ [CIR][CIRGen] Add const attribute to alloca operations Oct 2, 2024

smeenai force-pushed the main branch from 93f58cc to 73f84e8 Compare October 9, 2024 21:57

[CIR][CIRGen] Add const attribute to alloca operations

13b1c8b

This patch adds a new attribute `const` to the alloca operation to indicate that the corresponding local variable declaration is `const`-qualified. Future optimizations may find this new attribute useful.

Lancern force-pushed the alloca-const branch from 5204776 to 13b1c8b Compare October 10, 2024 13:41

bcardosolopes approved these changes Oct 11, 2024

View reviewed changes

bcardosolopes merged commit 959f03e into llvm:main Oct 11, 2024
6 checks passed

Lancern deleted the alloca-const branch October 11, 2024 00:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CIR][CIRGen] Add const attribute to alloca operations #892

[CIR][CIRGen] Add const attribute to alloca operations #892

Lancern commented Sep 27, 2024 •

edited

Loading

ChuanqiXu9 left a comment

ChuanqiXu9 Sep 29, 2024

Lancern commented Sep 30, 2024

ChuanqiXu9 commented Sep 30, 2024

Lancern commented Sep 30, 2024

Lancern commented Sep 30, 2024

bcardosolopes left a comment

Lancern commented Oct 1, 2024 •

edited

Loading

bcardosolopes commented Oct 1, 2024

Lancern commented Oct 2, 2024

Lancern commented Oct 2, 2024

bcardosolopes commented Oct 2, 2024 •

edited

Loading

Lancern commented Oct 10, 2024

[CIR][CIRGen] Add const attribute to alloca operations #892

[CIR][CIRGen] Add const attribute to alloca operations #892

Conversation

Lancern commented Sep 27, 2024 • edited Loading

ChuanqiXu9 left a comment

Choose a reason for hiding this comment

ChuanqiXu9 Sep 29, 2024

Choose a reason for hiding this comment

Lancern commented Sep 30, 2024

ChuanqiXu9 commented Sep 30, 2024

Lancern commented Sep 30, 2024

Lancern commented Sep 30, 2024

bcardosolopes left a comment

Choose a reason for hiding this comment

Lancern commented Oct 1, 2024 • edited Loading

bcardosolopes commented Oct 1, 2024

Lancern commented Oct 2, 2024

Lancern commented Oct 2, 2024

bcardosolopes commented Oct 2, 2024 • edited Loading

Lancern commented Oct 10, 2024

Lancern commented Sep 27, 2024 •

edited

Loading

Lancern commented Oct 1, 2024 •

edited

Loading

bcardosolopes commented Oct 2, 2024 •

edited

Loading