weight_quantize/weight_only_linear support Volta Arch #58082

MARD1NO · 2023-10-13T08:54:43Z

PR types

New features

PR changes

OPs

Description

when in volta arch, weightonly needs Row major, we do not need to transpose it.
And we do not write rowmajor gemv, so when M=1 and arch=70, it still use Weightonly Gemm.
Since the layout has changed in sm70, weightonly linear grad maynot suit for sm70, here we just force arch==80 when using weightonly linear grad operator.

Pcard-72603

paddle-bot · 2023-10-13T08:55:02Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

yuanlehome · 2023-10-13T09:10:46Z

paddle/phi/infermeta/multiary.cc

@@ -3846,7 +3846,13 @@ void WeightOnlyLinearInferMeta(const MetaTensor& x,
                               const MetaTensor& bias,
                               const MetaTensor& weight_scale,
                               const std::string& weight_dtype,
+                               const int32_t arch, 


int类型的值传递就不需要const修饰了

python/paddle/nn/quant/quantized_linear.py

…atically

paddle/phi/kernels/cpu/weight_quantize_kernel.cc

paddle/phi/kernels/gpu/weight_only_linear_kernel.cu

yuanlehome · 2023-10-20T08:25:19Z

paddle/phi/api/yaml/ops.yaml

@@ -2807,7 +2807,7 @@
  backward: weight_only_linear_grad

 - op : weight_quantize
-  args : (Tensor x, str algo="weight_only_int8")
+  args : (Tensor x, str algo = "weight_only_int8", int arch = -1)


改为默认是80

yuanlehome · 2023-10-20T08:27:31Z

python/paddle/nn/quant/quantized_linear.py

@@ -146,9 +159,24 @@ def weight_only_linear(
            ...    print(out.shape)
            [1, 2, 32]
    """
+    if arch is None:
+        # Get SMVersion from device.
+        cuda_version = version.cuda()


这块代码封装成一个函数给这两个api调用

paddle-ci-bot · 2023-11-09T03:22:26Z

Sorry to inform you that f744bcb's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

CLAassistant · 2023-11-21T03:45:52Z

All committers have signed the CLA.

CLAassistant · 2023-11-21T03:45:52Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ MARD1NO
❌ paddle

paddle seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

yuanlehome

LGGGTM

sunzhongkai588

LGTM for docs～

jeff41404

weight_quantize and weight_only_linear have added default value parameters arch=Nonewhich belongs to compatibility upgrade, so approve.

…8082) * fix volta arch weight quantize error * set default arch as 0 and use getSMVersion to get device's arch automatically * move getSmVersion to python api

fix volta arch weight quantize error

7c3e020

MARD1NO marked this pull request as ready for review October 13, 2023 08:55

MARD1NO marked this pull request as draft October 13, 2023 09:15

yuanlehome reviewed Oct 13, 2023

View reviewed changes

set default arch as 0 and use getSMVersion to get device's arch autom…

d518971

…atically

MARD1NO marked this pull request as ready for review October 13, 2023 09:59

MARD1NO requested a review from yuanlehome October 13, 2023 09:59

fix code style

486c162

yuanlehome reviewed Oct 16, 2023

View reviewed changes

paddle/phi/kernels/cpu/weight_quantize_kernel.cc Outdated Show resolved Hide resolved

yuanlehome reviewed Oct 16, 2023

View reviewed changes

paddle/phi/kernels/gpu/weight_only_linear_kernel.cu Outdated Show resolved Hide resolved

MARD1NO added 5 commits October 17, 2023 15:23

move getSmVersion to python api

32298c8

remove default value

1f6d8fc

fix weightonly linear

2fdca1f

fix default value

30be57e

limit sm70 config

b0e514c

yuanlehome reviewed Oct 20, 2023

View reviewed changes

fix comment

f744bcb

MARD1NO changed the title ~~Fix volta arch weight quantize error~~ weight_quantize/weight_only_linear support Volta Arch Oct 20, 2023

MARD1NO marked this pull request as draft November 1, 2023 05:33

MARD1NO marked this pull request as ready for review November 1, 2023 05:33

Merge branch 'develop' into support_weightonly_sm70_volta_arch

fa9be37

MARD1NO requested a review from yuanlehome November 21, 2023 02:32

yuanlehome approved these changes Nov 21, 2023

View reviewed changes

MARD1NO added 2 commits November 21, 2023 11:48

split a common func

30ce074

Merge branch 'develop' into support_weightonly_sm70_volta_arch

238b7c0

sunzhongkai588 approved these changes Nov 22, 2023

View reviewed changes

jeff41404 approved these changes Nov 22, 2023

View reviewed changes

heavengate approved these changes Nov 22, 2023

View reviewed changes

zyfncg approved these changes Nov 22, 2023

View reviewed changes

yuanlehome merged commit b391825 into PaddlePaddle:develop Nov 22, 2023
28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

weight_quantize/weight_only_linear support Volta Arch #58082

weight_quantize/weight_only_linear support Volta Arch #58082

MARD1NO commented Oct 13, 2023 •

edited

paddle-bot bot commented Oct 13, 2023

yuanlehome Oct 13, 2023

yuanlehome Oct 20, 2023

yuanlehome Oct 20, 2023

paddle-ci-bot bot commented Nov 9, 2023

CLAassistant commented Nov 21, 2023 •

edited

CLAassistant commented Nov 21, 2023

yuanlehome left a comment

sunzhongkai588 left a comment

jeff41404 left a comment

weight_quantize/weight_only_linear support Volta Arch #58082

weight_quantize/weight_only_linear support Volta Arch #58082

Conversation

MARD1NO commented Oct 13, 2023 • edited

PR types

PR changes

Description

paddle-bot bot commented Oct 13, 2023

yuanlehome Oct 13, 2023

Choose a reason for hiding this comment

yuanlehome Oct 20, 2023

Choose a reason for hiding this comment

yuanlehome Oct 20, 2023

Choose a reason for hiding this comment

paddle-ci-bot bot commented Nov 9, 2023

CLAassistant commented Nov 21, 2023 • edited

CLAassistant commented Nov 21, 2023

yuanlehome left a comment

Choose a reason for hiding this comment

sunzhongkai588 left a comment

Choose a reason for hiding this comment

jeff41404 left a comment

Choose a reason for hiding this comment

MARD1NO commented Oct 13, 2023 •

edited

CLAassistant commented Nov 21, 2023 •

edited