Skip to content

Conversation

@cqulilujia
Copy link
Contributor

小算子版本xpu_moe支持0-size输入,以适配VL模型中图像、文本中某一类型token可能为0的场景

@paddle-bot
Copy link

paddle-bot bot commented Oct 14, 2025

Thanks for your contribution!

@paddle-bot paddle-bot bot added the XPU label Oct 14, 2025
Copy link
Collaborator

@EmmonsCurse EmmonsCurse left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

ffn_out.shape[1],
permute_indices_per_token.shape[1],
)
if layer.reduce_results and layer.tp_size > 1:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个layer.reduce_results的意义是什么?在哪里定义的?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

GPU在适配VL模型时增加的一个参数,默认为true,加上这句不会影响文本模型只有。VL模型reduce_results取值false,在moe之后不需要进行all_reduce。

@EmmonsCurse EmmonsCurse merged commit a5063b9 into PaddlePaddle:develop Oct 16, 2025
43 of 50 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants