【Infer】MLA matrix absorption separation #10249

ckl117 · 2025-03-21T11:36:09Z

Before submitting

Lint code. If there are lint issues, please format the code first.

# Install and register `pre-commit` in the project folder
pip install pre-commit && pre-commit install

# Process previous code files separately
pre-commit run --file XXXX.py

Add test cases into tests folder. If there are codecov issues, please add tests cases first.

PR types

Performance optimization

PR changes

Others

Description

DeepSeeK MLA矩阵吸收分离，降低显存占用，提高极限吞吐性能。

paddle-bot · 2025-03-21T11:36:14Z

Thanks for your contribution!

codecov · 2025-03-21T12:12:30Z

Codecov Report

Attention: Patch coverage is 0% with 70 lines in your changes missing coverage. Please review.

Project coverage is 49.96%. Comparing base (d1e156a) to head (63f3e2f).
Report is 11 commits behind head on develop.

Files with missing lines	Patch %	Lines
...erimental/transformers/fused_transformer_layers.py	0.00%	56 Missing ⚠️
.../experimental/transformers/deepseek_v2/modeling.py	0.00%	14 Missing ⚠️

❌ Your patch status has failed because the patch coverage (0.00%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.
❌ Your project status has failed because the head coverage (49.96%) is below the target coverage (58.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop   #10249      +/-   ##
===========================================
+ Coverage    49.70%   49.96%   +0.25%     
===========================================
  Files          761      761              
  Lines       124218   124105     -113     
===========================================
+ Hits         61744    62009     +265     
+ Misses       62474    62096     -378

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

yuanlehome · 2025-03-25T06:07:08Z

辛苦同步修改一下，bf16/wint8的组网

yuanlehome

LGTM

* bf16 batch gemm * bf16 and wint8 matrix_absorption

* 【Infer】MLA matrix absorption separation (#10249) * bf16 batch gemm * bf16 and wint8 matrix_absorption * [MLA] move compute_out_linear out and fix bug when q_lora_rank is None (#10275) --------- Co-authored-by: Yuanle Liu <yuanlehome@163.com>

bf16 batch gemm

5922f4b

ckl117 changed the title ~~【Infer】bf16 batch gemm~~ 【Infer】MLA matrix absorption separation Mar 21, 2025

yuanlehome self-requested a review March 25, 2025 06:07

bf16 and wint8 matrix_absorption

63f3e2f

ckl117 force-pushed the develop_absorption_batch_gemm_bf16 branch from 80fea35 to 63f3e2f Compare March 25, 2025 12:00

yuanlehome approved these changes Mar 26, 2025

View reviewed changes

ZHUI merged commit a0c08ba into PaddlePaddle:develop Mar 26, 2025
10 of 12 checks passed

yuanlehome mentioned this pull request Mar 26, 2025

[MLA] move compute_out_linear out and fix bug when q_lora_rank is None #10275

Merged

2 tasks

ckl117 added a commit to ckl117/PaddleNLP that referenced this pull request Mar 27, 2025

【Infer】MLA matrix absorption separation (PaddlePaddle#10249)

3a27e02

* bf16 batch gemm * bf16 and wint8 matrix_absorption

ckl117 mentioned this pull request Mar 27, 2025

【Infer】MLA matrix absorption separation and fix #10283

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【Infer】MLA matrix absorption separation #10249

【Infer】MLA matrix absorption separation #10249

ckl117 commented Mar 21, 2025

paddle-bot bot commented Mar 21, 2025

codecov bot commented Mar 21, 2025 •

edited

Loading

yuanlehome commented Mar 25, 2025

yuanlehome left a comment

【Infer】MLA matrix absorption separation #10249

【Infer】MLA matrix absorption separation #10249

Conversation

ckl117 commented Mar 21, 2025

Before submitting

PR types

PR changes

Description

paddle-bot bot commented Mar 21, 2025

codecov bot commented Mar 21, 2025 • edited Loading

Codecov Report

yuanlehome commented Mar 25, 2025

yuanlehome left a comment

Choose a reason for hiding this comment

codecov bot commented Mar 21, 2025 •

edited

Loading