[Arm] Fix fuse_attention support old_quant_format #10027

sprouteer · 2023-02-24T08:22:08Z

Arm

Bug fixes

OP

Fix fuse_attention support old_quant_format, 旧量化格式下融合attention结构后，在前面插入calib算子的时候，需要去fused_attention op的attribute中找到scale
修复infer shape bug
支持reshape 和transpose 无xshape参数，支持dropout无mask输出，支持matmul或者matmul_v2
修复打开sve开关后编译报错问题

zhupengyang

LGTM

fix fuse_attention support old_quant_format test=develop

1f93dbd

sprouteer requested review from mjp9527, zhupengyang and hong19860320 as code owners February 24, 2023 08:22

sprouteer added 4 commits March 1, 2023 21:32

fix dropout support no_mask, reshape2 transpose2 xshape test=develop

be115ea

fix conv gemm_sve bug test=develop

7cebdf8

fix fuse_transformer support matmul op test=develop

1089c56

fix dropout bug test=develop

7bc2da2

sprouteer force-pushed the fix_fuse_attention_support_old_quant_format branch from 2cd5f5a to 7bc2da2 Compare March 7, 2023 06:37

zhupengyang approved these changes Mar 7, 2023

View reviewed changes

sprouteer merged commit f3e367f into PaddlePaddle:develop Mar 10, 2023

Provide feedback