Fix several bugs on Hexagon and some cleanup #5570

dsharletg · 2020-12-17T07:50:01Z

This PR fixes bugs:

vmpa is not correctly interleaved after refactoring in VectorReduce peephole matching for Hexagon #5424. This PR reverts the changes associated with this. This does have test coverage in simd_op_check, but we aren't running apps/simd_op_check on the build bots currently.
LetStmt visitor in EliminateInterleaves is incorrect. This has been a bug for a long time, it only appears if there are dead lets in the IR, which doesn't happen in the default lowering configuration (but does happen when attempting to debug by commenting off some of the passes).

It also removes VtmpyGenerator. This has been disabled for a long time, and now we have VectorReduce. Additionally, I have a WIP pass in another branch that finds vector reductions in a target independent lowering pass. If we try to bring back automatic finding of stencil vector reductions, we should build it into that pass instead of fixing VtmpyGenerator.

dsharletg · 2020-12-17T07:50:31Z

@aankit-ca @pranavb-ca please take a look.

steven-johnson · 2020-12-17T17:41:42Z

Windows failure is unrelated

aankit-ca · 2020-12-21T20:59:54Z

@dsharletg For vmpa, I think we can simply change a01 to
Shuffle::make_interleave({mpys[0].first, mpys[1].first});
to make it work. This we way we can avoid adding the patterns in hvx_128.ll file.

aankit-ca · 2020-12-21T21:00:14Z

Rest LGTM

dsharletg · 2020-12-21T21:02:59Z

I think we need to do the interleaving in hvx_128.ll, because the interleaving is different. Shuffle::make_interleave interleaves the vectors before splitting to native vector widths, while this implementation interleaves them after splitting to native vector widths. What do you think?

aankit-ca · 2020-12-21T21:27:39Z

I'm sorry. I meant changing a01 to Shuffle::make_concat({mpys[0].first, mpys[1].first}); for vmpa
We will be calling make_interleave only to generate the scalar.

dsharletg · 2020-12-21T21:31:34Z

But I think the same problem applies. By doing Shuffle::make_concat, we concatenate the vectors before slicing to native vector widths. But we need to concatenate the two operands after slicing to native vector widths, which is what the runtime hvx_128.ll implementation does.

aankit-ca · 2020-12-21T21:48:55Z

Aah. Right. Yeah we will need to the concatenation in .ll

dsharletg · 2020-12-21T21:50:02Z

Cool, thanks for confirming :) This is a tricky issue, I've had this bug before myself.

Fix several bugs on Hexagon.

3177cc6

clang-format actually found a bug

ed7f14d

steven-johnson approved these changes Dec 21, 2020

View reviewed changes

dsharletg merged commit 5ac8808 into master Dec 21, 2020

dsharletg deleted the dsharletg/fix-hexagon branch December 21, 2020 21:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix several bugs on Hexagon and some cleanup #5570

Fix several bugs on Hexagon and some cleanup #5570

dsharletg commented Dec 17, 2020

dsharletg commented Dec 17, 2020

steven-johnson commented Dec 17, 2020

aankit-ca commented Dec 21, 2020

aankit-ca commented Dec 21, 2020

dsharletg commented Dec 21, 2020 •

edited

aankit-ca commented Dec 21, 2020 •

edited

dsharletg commented Dec 21, 2020

aankit-ca commented Dec 21, 2020

dsharletg commented Dec 21, 2020

Fix several bugs on Hexagon and some cleanup #5570

Fix several bugs on Hexagon and some cleanup #5570

Conversation

dsharletg commented Dec 17, 2020

dsharletg commented Dec 17, 2020

steven-johnson commented Dec 17, 2020

aankit-ca commented Dec 21, 2020

aankit-ca commented Dec 21, 2020

dsharletg commented Dec 21, 2020 • edited

aankit-ca commented Dec 21, 2020 • edited

dsharletg commented Dec 21, 2020

aankit-ca commented Dec 21, 2020

dsharletg commented Dec 21, 2020

dsharletg commented Dec 21, 2020 •

edited

aankit-ca commented Dec 21, 2020 •

edited