-
Notifications
You must be signed in to change notification settings - Fork 609
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Flow] Change the definition of "dequantization" recognizer. #17711
[Flow] Change the definition of "dequantization" recognizer. #17711
Conversation
Abbreviated Benchmark Summary@ commit 88f32fab4c2503c4b8cf1d43f1480ae1467e5788 (vs. base 90f29a66d5bbd58167d84b2011d27c7ffb9a1ee1) Data-Tiling Comparison TableClick to show
Regressed Latencies 🚩
[Top 3 out of 6 results showed] Improved Latencies 🎉
[Top 3 out of 12 results showed] Regressed Total Dispatch Sizes 🚩
[Top 3 out of 8 results showed] Regressed Total Artifact Sizes 🚩
Improved Stream IR Dispatch Count (# of cmd.dispatch ops) 🎉
[Top 3 out of 11 results showed] For more information: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. @pashu123 you can check if the PR addresses your fusion issue, FYI.
compiler/src/iree/compiler/Dialect/Flow/Transforms/RegionOpUtils.cpp
Outdated
Show resolved
Hide resolved
1e4b977
to
abd591b
Compare
The dequantization operation today is trying to enforce that the input indexing map is an identity. This is overly conservative for newer quantization schemes. This changes the logic to just look at operand ranks to check if the operation is a dequantization operation. Signed-off-by: MaheshRavishankar <mahesh.ravishankar@gmail.com>
abd591b
to
a1888f9
Compare
883f0a1
to
27533a3
Compare
Signed-off-by: MaheshRavishankar <mahesh.ravishankar@gmail.com>
27533a3
to
d8a3fe5
Compare
…g#17711) The dequantization operation today is trying to enforce that the input indexing map is an identity. This is overly conservative for newer quantization schemes. This changes the logic to just look at operand ranks to check if the operation is a dequantization operation. --------- Signed-off-by: MaheshRavishankar <mahesh.ravishankar@gmail.com> Signed-off-by: Lubo Litchev <lubol@google.com>
The dequantization operation today is trying to enforce that the input indexing map is an identity. This is overly conservative for newer quantization schemes. This changes the logic to just look at operand ranks to check if the operation is a dequantization operation.