[RISCV] Combine trunc (sra sext (x), zext (y)) to sra (x, smin (y, scalarsizeinbits(y) - 1)) #65728

LWenH · 2023-09-08T09:06:08Z

For RVV, If we want to perform an i8 or i16 element-wise vector arithmetic right shift in the upper C/C++ program, the value to be shifted would be first sign extended to i32, and the shift amount would also be zero_extended to i32 to perform the vsra.vv instruction, and followed by a truncate to get the final calculation result, such pattern will later expanded to a series of "vsetvli" and "vnsrl" instructions later, this is because the RVV spec only support 2 * SEW -> SEW truncate. But for vector, the shift amount can also be determined by smin (Y, ScalarSizeInBits(Y) - 1)). Also, for the vsra instruction, we only care about the low lg2(SEW) bits as the shift amount.

Alive2: https://alive2.llvm.org/ce/z/u3-Zdr
C++ Test cases : https://gcc.godbolt.org/z/q1qE7fbha

…NFC. Add a series of pre-commit tests for later patch to perform trunc (sra sext(X), zext(Y)) -> sra (X, smin (Y, scalarsize(Y) - 1)) combine.

…alarsize(Y) - 1)) For i8/i16 element-wise vector arithmetic right shift, the src value would be first sign_extended to i32 and the shift amount would be zero_extended to i32 to perform the vsra instruction, and followed by a trunc to get the final calcualtion result. For RVV, the truncate would be lowered into n-levels TRUNCATE_VECTOR_VL to satisfy RVV's SEW*2->SEW truncate restriction, such pattern would be expanded into a series of "vsetvli" and "vnsrl" instructions later. For RVV, we can use smin(Y, ScalarSizeInBits(Y)-1) to determine the actual shift amount for the vsra instruction, because we only care about the low lg2(SEW) bits as the shift amount. For more transformation validation, please see alive2 links: https://alive2.llvm.org/ce/z/wXLrLT

LWenH · 2023-09-08T09:56:12Z

How to add reviewers like in phabricator to request for review in github? I can't find that button in github interfaces(I tried that in the right side "Reviewers" button but that button is not clickable, may be that need some permission in github?).

@topperc @lukel97
Request for review, thank you.

lukel97 · 2023-09-08T10:49:46Z

How to add reviewers like in phabricator to request for review in github? I can't find that button in github interfaces(I tried that in the right side "Reviewers" button but that button is not clickable, may be that need some permission in github?).

That's strange, clicking on the "Reviewers" label brings up this menu for me. I don't think you should need to be a member of the llvm-org since it's your own PR?

LWenH · 2023-09-08T11:22:30Z

Yeah, I tried that one, but it still unclickable. I even tried to restart the pull request to invoke that lable, but it still doesn't work. BTW, thank you for helping me to add reviewers.

lukel97 · 2023-09-08T12:01:13Z

No problem, I think you're right btw, it looks like only those with write access can request a review. I've flagged it in the discourse thread https://discourse.llvm.org/t/update-on-github-pull-requests/71540/105?u=lukel

LWenH · 2023-09-15T09:08:57Z

Kindly Ping.

topperc · 2023-09-15T15:27:53Z

llvm/lib/Target/RISCV/RISCVISelLowering.cpp

+        SDValue N10 = N1.getOperand(0);
+
+        if (N00.getValueType().isVector() &&
+            N00.getValueType() == N10.getValueType() && N->hasOneUse() &&


Why does N need to have a single use?

Yeah, I agree with you. There is no need to judge hasOneUse for N here. Address comment.

topperc

LGTM

LWenH · 2023-09-17T03:25:07Z

Hi, folks. I might still not have the write access to the github repository. Could reviewers merge this pull request for me please? BTW, I want to know how to get the write access for the github repository now？I didn't see the similar description in the new version of the github submission instructions. May be also need that write access to get the permission to add reviewers for pull requests in github right now?

Thank you for your time to read this question.

dtcxzyw · 2023-09-17T09:05:11Z

I will merge this pull request after this patch passes all regression tests on my machine (I cannot rerun the failed workflow on buildkite).
To obtain commit access, please refer to the developer policy:
https://llvm.org/docs/DeveloperPolicy.html#obtaining-commit-access.

LWenH · 2023-09-17T09:08:52Z

I will merge this pull request after this patch passes all regression tests on my machine (I cannot rerun the failed workflow on buildkite). To obtain commit access, please refer to the developer policy: https://llvm.org/docs/DeveloperPolicy.html#obtaining-commit-access.

Thank you folks, that's a big help for me.

…alarsizeinbits(y) - 1)) (llvm#65728) For RVV, If we want to perform an i8 or i16 element-wise vector arithmetic right shift in the upper C/C++ program, the value to be shifted would be first sign extended to i32, and the shift amount would also be zero_extended to i32 to perform the vsra.vv instruction, and followed by a truncate to get the final calculation result, such pattern will later expanded to a series of "vsetvli" and "vnsrl" instructions later, this is because the RVV spec only support 2 * SEW -> SEW truncate. But for vector, the shift amount can also be determined by smin (Y, ScalarSizeInBits(Y) - 1)). Also, for the vsra instruction, we only care about the low lg2(SEW) bits as the shift amount. - Alive2: https://alive2.llvm.org/ce/z/u3-Zdr - C++ Test cases : https://gcc.godbolt.org/z/q1qE7fbha

…alarsize(Y) - 1) Like llvm#65728, for i8/i16 element-wise vector logical right shift, the src value would be first zext to i32 and the shift amount would be zext to i32 to perform the vsrl instruction, and followed by a trunc to get the final calculation result. This would be expanded into a series of "vsetvli" and "vnsrl" instructions later. For RVV, the vsrl instruction only treats the lg2(sew) bits as the shift amount, so we can calculate the shift amount by using umin(Y, scalarsize(Y) - 1).

…alarsizeinbits(y) - 1)) (llvm#65728) For RVV, If we want to perform an i8 or i16 element-wise vector arithmetic right shift in the upper C/C++ program, the value to be shifted would be first sign extended to i32, and the shift amount would also be zero_extended to i32 to perform the vsra.vv instruction, and followed by a truncate to get the final calculation result, such pattern will later expanded to a series of "vsetvli" and "vnsrl" instructions later, this is because the RVV spec only support 2 * SEW -> SEW truncate. But for vector, the shift amount can also be determined by smin (Y, ScalarSizeInBits(Y) - 1)). Also, for the vsra instruction, we only care about the low lg2(SEW) bits as the shift amount. - Alive2: https://alive2.llvm.org/ce/z/u3-Zdr - C++ Test cases : https://gcc.godbolt.org/z/q1qE7fbha

LWenH added 2 commits September 8, 2023 16:49

[RISCV] Add pre-commit test for later trunc(sra(sext,zext)) combine, …

2f38484

…NFC. Add a series of pre-commit tests for later patch to perform trunc (sra sext(X), zext(Y)) -> sra (X, smin (Y, scalarsize(Y) - 1)) combine.

LWenH requested a review from a team as a code owner September 8, 2023 09:06

github-actions bot added the backend:RISC-V label Sep 8, 2023

LWenH marked this pull request as draft September 8, 2023 09:11

LWenH marked this pull request as ready for review September 8, 2023 09:17

LWenH closed this Sep 8, 2023

LWenH reopened this Sep 8, 2023

LWenH changed the title ~~[RISCV] Combine trunc (sra sext (X), zext (Y)) to sra (X, smin (Y, ScalarSizeInBits(Y) - 1))~~ [RISCV] Combine trunc (sra sext (x), zext (y)) to sra (x, smin (y, scalarsizeinbits(y) - 1)) Sep 8, 2023

lukel97 requested review from lukel97 and topperc September 8, 2023 10:47

topperc reviewed Sep 15, 2023

View reviewed changes

Address comment and reformat the code

53a4018

LWenH requested a review from topperc September 17, 2023 02:28

topperc approved these changes Sep 17, 2023

View reviewed changes

dtcxzyw merged commit ddae50d into llvm:main Sep 17, 2023

LWenH mentioned this pull request Oct 15, 2023

[RISCV] Combine trunc (srl zext (x), zext (y)) to srl (x, umin (y, scalarsizeinbits(y) - 1)) #69092

Closed

LWenH deleted the fixvsra branch October 31, 2023 09:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RISCV] Combine trunc (sra sext (x), zext (y)) to sra (x, smin (y, scalarsizeinbits(y) - 1)) #65728

[RISCV] Combine trunc (sra sext (x), zext (y)) to sra (x, smin (y, scalarsizeinbits(y) - 1)) #65728

Uh oh!

LWenH commented Sep 8, 2023 •

edited

Loading

Uh oh!

LWenH commented Sep 8, 2023

Uh oh!

lukel97 commented Sep 8, 2023

Uh oh!

LWenH commented Sep 8, 2023

Uh oh!

lukel97 commented Sep 8, 2023

Uh oh!

LWenH commented Sep 15, 2023 •

edited

Loading

Uh oh!

topperc Sep 15, 2023 •

edited

Loading

Uh oh!

LWenH Sep 15, 2023

Uh oh!

topperc left a comment

Uh oh!

LWenH commented Sep 17, 2023

Uh oh!

dtcxzyw commented Sep 17, 2023

Uh oh!

LWenH commented Sep 17, 2023

Uh oh!

Uh oh!

[RISCV] Combine trunc (sra sext (x), zext (y)) to sra (x, smin (y, scalarsizeinbits(y) - 1)) #65728

[RISCV] Combine trunc (sra sext (x), zext (y)) to sra (x, smin (y, scalarsizeinbits(y) - 1)) #65728

Uh oh!

Conversation

LWenH commented Sep 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LWenH commented Sep 8, 2023

Uh oh!

lukel97 commented Sep 8, 2023

Uh oh!

LWenH commented Sep 8, 2023

Uh oh!

lukel97 commented Sep 8, 2023

Uh oh!

LWenH commented Sep 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

topperc Sep 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LWenH Sep 15, 2023

Choose a reason for hiding this comment

Uh oh!

topperc left a comment

Choose a reason for hiding this comment

Uh oh!

LWenH commented Sep 17, 2023

Uh oh!

dtcxzyw commented Sep 17, 2023

Uh oh!

LWenH commented Sep 17, 2023

Uh oh!

Uh oh!

LWenH commented Sep 8, 2023 •

edited

Loading

LWenH commented Sep 15, 2023 •

edited

Loading

topperc Sep 15, 2023 •

edited

Loading