[Paddle-Inference]support GQA in variable_length_memory_efficient_attention #58836

zhoutianzi666 · 2023-11-09T00:57:18Z

Others

Others

Pcard-71500

sunzhongkai588

no docs changes. LGTM

yuanlehome

LGTM

…ention (PaddlePaddle#58836) [Paddle-Inference]support GQA in variable_length_memory_efficient_attention (PaddlePaddle#58836)

support GQA in

6970620

zhoutianzi666 changed the title ~~support GQA in~~ [Paddle-Inference]support GQA in variable_length_memory_efficient_attention Nov 9, 2023

commit

dab49cd

vivienfanghuagood approved these changes Nov 9, 2023

View reviewed changes

add UT

64df80c

sunzhongkai588 approved these changes Nov 9, 2023

View reviewed changes

yuanlehome approved these changes Nov 9, 2023

View reviewed changes

zhoutianzi666 merged commit f8137fb into PaddlePaddle:develop Nov 9, 2023
27 of 28 checks passed

zhoutianzi666 mentioned this pull request Nov 9, 2023

[llm] support GQA in chatglmv2 PaddlePaddle/PaddleNLP#7412

Merged

Provide feedback