support fused weights for export_model #8554

ronny1996 · 2024-06-05T10:19:35Z

PR types

Others

PR changes

Others

Description

support fused weights for export_model

paddle-bot · 2024-06-05T10:19:40Z

Thanks for your contribution!

codecov · 2024-06-05T10:52:59Z

Codecov Report

Attention: Patch coverage is 0% with 21 lines in your changes missing coverage. Please review.

Project coverage is 53.96%. Comparing base (f36ed75) to head (ff61d4a).
Report is 4 commits behind head on develop.

Files	Patch %	Lines
...dlenlp/experimental/transformers/llama/modeling.py	0.00%	21 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8554      +/-   ##
===========================================
- Coverage    53.97%   53.96%   -0.01%     
===========================================
  Files          618      618              
  Lines        96827    96833       +6     
===========================================
+ Hits         52258    52260       +2     
- Misses       44569    44573       +4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ronny1996 · 2024-06-05T13:56:52Z

paddlenlp/experimental/transformers/llama/modeling.py

-        self.embed_tokens.weight.set_value(paddle.to_tensor(state_dict["llama.embed_tokens.weight"]))
-        self.norm.weight.set_value(paddle.to_tensor(state_dict["llama.norm.weight"], dtype=self.norm.weight.dtype))
+        self.embed_tokens.weight.set_value(
+            paddle.to_tensor(state_dict["llama.embed_tokens.weight"]).cast(self.embed_tokens.weight.dtype)


.cast 支持原始权重为bfloat16

wawltor

LGTM

ronny1996 force-pushed the llama2_dev branch from 8dc2cf7 to 67a23e6 Compare June 5, 2024 10:48

ronny1996 force-pushed the llama2_dev branch 3 times, most recently from feeefad to 4896348 Compare June 5, 2024 13:12

support fused weights for export_model

ff61d4a

ronny1996 force-pushed the llama2_dev branch from 4896348 to ff61d4a Compare June 5, 2024 13:27

ronny1996 commented Jun 5, 2024

View reviewed changes

wawltor approved these changes Jun 5, 2024

View reviewed changes

wawltor merged commit 87edf28 into PaddlePaddle:develop Jun 6, 2024
7 of 12 checks passed

ronny1996 deleted the llama2_dev branch June 6, 2024 08:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support fused weights for export_model #8554

support fused weights for export_model #8554

ronny1996 commented Jun 5, 2024

paddle-bot bot commented Jun 5, 2024

codecov bot commented Jun 5, 2024 •

edited

Loading

ronny1996 Jun 5, 2024

wawltor left a comment

support fused weights for export_model #8554

support fused weights for export_model #8554

Conversation

ronny1996 commented Jun 5, 2024

PR types

PR changes

Description

paddle-bot bot commented Jun 5, 2024

codecov bot commented Jun 5, 2024 • edited Loading

Codecov Report

ronny1996 Jun 5, 2024

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

codecov bot commented Jun 5, 2024 •

edited

Loading