Skip to content

Conversation

@YuanRisheng
Copy link
Collaborator

@YuanRisheng YuanRisheng commented Sep 4, 2025

Ernie纯文与多模kv cache量化适配v1 loader,h20 4卡加载ernie45 21b模型,v1耗时40s,原版耗时43s,基本持平

@paddle-bot
Copy link

paddle-bot bot commented Sep 4, 2025

Thanks for your contribution!

@yuanlehome yuanlehome merged commit b3fac5b into PaddlePaddle:develop Sep 9, 2025
15 of 17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants