Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fuse qkv and fix bart decoding #5111

Merged
merged 2 commits into from
Mar 6, 2023
Merged

Conversation

gongel
Copy link
Member

@gongel gongel commented Mar 6, 2023

PR types

New features & Bug fixes

PR changes

APIs

Description

  • Fuse qkv for faster decoding.
  • Fix: When parallel decoding, the cache size is zero which causes to wrong decoding result.

TODO: Fix other similar models when upgrading to FT 5.x

@paddle-bot
Copy link

paddle-bot bot commented Mar 6, 2023

Thanks for your contribution!

@codecov
Copy link

codecov bot commented Mar 6, 2023

Codecov Report

Merging #5111 (4b74d1d) into develop (9e23230) will increase coverage by 0.06%.
The diff coverage is 0.00%.

@@             Coverage Diff             @@
##           develop    #5111      +/-   ##
===========================================
+ Coverage    49.82%   49.89%   +0.06%     
===========================================
  Files          456      456              
  Lines        65500    65415      -85     
===========================================
+ Hits         32636    32637       +1     
+ Misses       32864    32778      -86     
Impacted Files Coverage Δ
...lenlp/ops/fast_transformer/transformer/decoding.py 8.25% <0.00%> (+0.53%) ⬆️
...s/fast_transformer/transformer/fast_transformer.py 13.38% <ø> (ø)
paddlenlp/utils/downloader.py 65.92% <0.00%> (+0.44%) ⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

@gongel gongel requested a review from guoshengCS March 6, 2023 05:38
@gongel gongel merged commit cfc5a47 into PaddlePaddle:develop Mar 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants