[Fix] permute fusion number of forward output and backward input is not match by RuibinCheung · Pull Request #161 · ROCm/TransformerEngine

RuibinCheung · 2025-03-31T08:06:01Z

Description

Fix permute fusion number of forward output and backward input is not match when inp.numel() is zero.

This is a nvidia's upstream code bug. And it was fixed on PR NVIDIA/TransformerEngine#1468. But this PR involves a feature which relates to PTX. It can't cherry-pick it directly now.

It will return two tensor when expert of MoE dispatched token is 0 but backward function only accepts one tensor. It will raise a runtime error because the number of parameters and arguments is not matched.

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refractor

Changes

Please list the changes introduced in this PR:

Change A
Change B

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

…ot match

RuibinCheung · 2025-04-01T03:10:23Z

@wangye805 PTAL

wangye805

LGTM

RuibinCheung changed the title ~~[Fix permute fusion number of forward output and backward input is not match~~ [Fix[ permute fusion number of forward output and backward input is not match Mar 31, 2025

[Fix] permute fusion number of forward output and backward input is n…

52d03cf

…ot match

RuibinCheung force-pushed the fix_permute branch from 217f3bf to 52d03cf Compare March 31, 2025 08:06

RuibinCheung changed the title ~~[Fix[ permute fusion number of forward output and backward input is not match~~ [Fix] permute fusion number of forward output and backward input is not match Apr 1, 2025

wangye805 self-requested a review April 1, 2025 04:23

wangye805 approved these changes Apr 1, 2025

View reviewed changes

wangye805 merged commit 3d2a780 into ROCm:dev Apr 1, 2025

RuibinCheung deleted the fix_permute branch April 1, 2025 08:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] permute fusion number of forward output and backward input is not match#161

[Fix] permute fusion number of forward output and backward input is not match#161
wangye805 merged 1 commit into
ROCm:devfrom
RuibinCheung:fix_permute

RuibinCheung commented Mar 31, 2025 •

edited

Loading

Uh oh!

RuibinCheung commented Apr 1, 2025

Uh oh!

wangye805 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RuibinCheung commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Changes

Checklist:

Uh oh!

RuibinCheung commented Apr 1, 2025

Uh oh!

wangye805 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RuibinCheung commented Mar 31, 2025 •

edited

Loading