Add support for s8s8 dynamic quant #313

jondea · 2024-05-20T15:21:04Z

This will be used in PyTorch for accelerating dynamic quantization by using s8s8 matmul accelerated by ACL (see
uxlfoundation/oneDNN#1885). This can go in without oneDNN 3.5, but will only offer performance benefits after it does.

Also, do you have a policy on copyright headers? I noticed that on this branch at least, most of the copyright headers were removed a few years ago. However, Arm's usual policy when contributing to open source projects is to include a copyright header on any file which is modified. Would this be acceptable? If not, is there somewhere else suitable to note copyright?

This will be used in PyTorch for accelerating dynamic quantization by using s8s8 matmul accelerated by ACL (see uxlfoundation/oneDNN#1885). This can go in without oneDNN 3.5, but will only offer performance benefits after it does.

jgong5

The change LGTM. I'm not sure about the copyright though. cc @jingxu10 and @Guobing-Chen

jondea · 2024-06-04T07:05:58Z

Thanks @jgong5. Given that this file doesn't have an existing copyright notice, would it be acceptable to add an Arm copyright line in the top level header file? This seems to be one of the only references to copyright on the pytorch_ideep branch

ideep/include/ideep.hpp

Line 2 in a9e5602

jondea · 2024-08-07T12:09:30Z

I'm happy to get this in without the copyright notices, and work it out later. What's the process for getting this merged?

yanbing-j · 2024-08-13T05:25:21Z

Do you include this change in PyTorch CI? If so, I can merge directly.
cc @Xia-Weiwen to review more.

Xia-Weiwen · 2024-08-13T05:33:36Z

Do you include this change in PyTorch CI? If so, I can merge directly. cc @Xia-Weiwen to review more.

LGTM

oneDNN+ACL has optimized kernels for s8s8 matmul, so input is signed. This change leaves behaviour on all other platforms the same. This change requires intel/ideep#313 to go in, and oneDNN 3.5 for the optimized kernels. This change speeds up dynamic quantized linear by ~10x. Signed-off-by: Jonathan Deakin <jonathan.deakin@arm.com>

oneDNN+ACL has optimized kernels for s8s8 matmul, so input is signed. This change leaves behaviour on all other platforms the same. This change requires intel/ideep#313 to go in, and oneDNN 3.5 for the optimized kernels. This change speeds up dynamic quantized linear by ~10x. Also, do you have a policy on copyright headers? Arm's usual policy when contributing to open source projects is to include a copyright header on any file which is modified. Would this be acceptable? If not, is there somewhere else suitable to note copyright? Pull Request resolved: #126687 Approved by: https://github.com/jgong5, https://github.com/malfet, https://github.com/snadampal Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>

Add support for s8s8 dynamic quant

3147d01

This will be used in PyTorch for accelerating dynamic quantization by using s8s8 matmul accelerated by ACL (see uxlfoundation/oneDNN#1885). This can go in without oneDNN 3.5, but will only offer performance benefits after it does.

jondea mentioned this pull request May 20, 2024

Enable optimized dynamic quantization on aarch64 pytorch/pytorch#126687

Closed

jgong5 approved these changes May 30, 2024

View reviewed changes

yanbing-j merged commit 8e21a72 into intel:ideep_pytorch Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for s8s8 dynamic quant #313

Add support for s8s8 dynamic quant #313

Uh oh!

jondea commented May 20, 2024

Uh oh!

jgong5 left a comment

Uh oh!

jondea commented Jun 4, 2024

Uh oh!

jondea commented Aug 7, 2024

Uh oh!

yanbing-j commented Aug 13, 2024

Uh oh!

Xia-Weiwen commented Aug 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add support for s8s8 dynamic quant #313

Add support for s8s8 dynamic quant #313

Uh oh!

Conversation

jondea commented May 20, 2024

Uh oh!

jgong5 left a comment

Choose a reason for hiding this comment

Uh oh!

jondea commented Jun 4, 2024

Uh oh!

jondea commented Aug 7, 2024

Uh oh!

yanbing-j commented Aug 13, 2024

Uh oh!

Xia-Weiwen commented Aug 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants