Apply folders before canonicalize in heir-simd-vectorizer #601

j2kun · 2024-04-06T00:19:18Z

This new pass gives an empty set of patterns to the greedy pattern rewrite engine, which ends up applying each op's folding routine, which is enough to enable us to handle all examples in heir_simd_vectorizer

Avoids the slowdown mentioned in #586

j2kun · 2024-04-06T00:49:48Z

Looks like the change resulted in two additional rotations being added to the box_blur_64x64 IR, so I will investigate that on Monday

j2kun · 2024-04-08T17:59:12Z

I didn't find the source of the additional inserted rotations, but instead added some extra tensor_ext canonicalization patterns that restored the original behavior. Also included some minor cleanup discovered along the way.

asraa

Nice fix! Is there any downside in just running (void)applyPatternsAndFoldGreedily(getOperation(), std::move(patterns)); with empty patterns in InsertRotate before adding the other patterns and running the actual pass? (This would avoid the pass boilerplate, and other passes that need this as a pre-pass can do the same thing)

j2kun · 2024-04-09T23:15:54Z

Nice fix! Is there any downside in just running (void)applyPatternsAndFoldGreedily(getOperation(), std::move(patterns)); with empty patterns in InsertRotate before adding the other patterns and running the actual pass? (This would avoid the pass boilerplate, and other passes that need this as a pre-pass can do the same thing)

The reason I didn't do this is because I wanted to try putting canonicalize in between this new pass and insert-rotate. I suppose I could put it at the end of the loop unroll pass? I just think having it more visible will make it less surprising.

This new pass gives an empty set of patterns to the greedy pattern rewrite engine, which ends up applying each op's folding routine, which simplifies the IR enough to make a normal canonicalize pass fast. However, this reduces some of the optimality of the final IR for some tests, via inserting additional rotations that are not necessary. So I added a few additional canonicalization patterns to tensor_ext that restore the original behavior.

asraa · 2024-04-11T17:25:00Z

I wanted to try putting canonicalize in between this new pass and insert-rotate. I suppose I could put it at the end of the loop unroll pass?

Ohhh, I see. Nah, putting it at the end of loop unroll seems like it'd be a surprise, I'd expect it to be applied as a pre-condition in the pass it's needed.

j2kun requested review from asraa and AlexanderViand-Intel April 6, 2024 00:19

j2kun force-pushed the fold-only branch from 44d4058 to bd8533c Compare April 6, 2024 00:20

j2kun force-pushed the fold-only branch 4 times, most recently from 53fd239 to a3c90f4 Compare April 8, 2024 17:57

j2kun changed the title ~~Replace canonicalize with post-unroll-simplify~~ Apply folders before canonicalize in heir-simd-vectorizer Apr 8, 2024

This was referenced Apr 8, 2024

Avoid fully unrolling loops for insert-rotate #589

Open

Upgrade roberts cross and gx_kernel to 64x64 #604

Merged

asraa approved these changes Apr 9, 2024

View reviewed changes

j2kun force-pushed the fold-only branch from ec476a6 to 6be85b0 Compare April 10, 2024 03:42

j2kun added the pull_ready Indicates whether a PR is ready to pull. The copybara worker will import for internal testing label Apr 10, 2024

asraa approved these changes Apr 11, 2024

View reviewed changes

copybara-service bot merged commit c9af3be into google:main Apr 11, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply folders before canonicalize in heir-simd-vectorizer #601

Apply folders before canonicalize in heir-simd-vectorizer #601

j2kun commented Apr 6, 2024

j2kun commented Apr 6, 2024

j2kun commented Apr 8, 2024

asraa left a comment

j2kun commented Apr 9, 2024

asraa commented Apr 11, 2024

Apply folders before canonicalize in heir-simd-vectorizer #601

Apply folders before canonicalize in heir-simd-vectorizer #601

Conversation

j2kun commented Apr 6, 2024

j2kun commented Apr 6, 2024

j2kun commented Apr 8, 2024

asraa left a comment

Choose a reason for hiding this comment

j2kun commented Apr 9, 2024

asraa commented Apr 11, 2024