Fix MOE tests by ksivaman · Pull Request #1476 · NVIDIA/TransformerEngine

ksivaman · 2025-02-11T14:38:15Z

Description

The code drop for 2.0 results in breaking certain tests in the CI. This PR fixes the MOE/permutation tests.

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

timmoon10 · 2025-02-11T19:07:21Z

        return _ReshapeFunc.apply(self, shape)

+    @classmethod
+    def to_float8(


I prefer naming this quantize so that we can generalize to the other quantized tensor classes.

This is meant as a standalone quantization for a high precision tensor and I think it would be better for this to not conflict with existing quantize function in QuantizedTensor.

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

ptrendx

Wait wait wait. Why are we adding the to_float8 function again? We should just fix the tests to create and use the quantizer instead.

yaox12 · 2025-02-13T01:40:40Z

I think the to_float8 issues has been fixed in #1468 using dequantize. Can you review that PR?

ksivaman · 2025-02-18T03:51:08Z

@ptrendx The to_float8 function is meant purely as a utility/convenience function for one-time/throwaway quantization use cases that just creates a defaults quantizer and returns quantized tensor. Either way, closing as this change has been included in #1468

ksivaman added 2 commits February 11, 2025 13:01

Fix MOE

76b56ce

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

Cleanup Float8Tensor test

c6a6786

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

ksivaman requested a review from timmoon10 February 11, 2025 14:38

timmoon10 reviewed Feb 11, 2025

View reviewed changes

ksivaman added 2 commits February 12, 2025 10:46

Reviews

87bdf3d

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

Bug fixes

586e467

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

ptrendx requested changes Feb 12, 2025

View reviewed changes

ksivaman closed this Feb 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix MOE tests#1476

Fix MOE tests#1476
ksivaman wants to merge 4 commits into
NVIDIA:mainfrom
ksivaman:fix_fp8_tests

ksivaman commented Feb 11, 2025

Uh oh!

Uh oh!

timmoon10 Feb 11, 2025

Uh oh!

ksivaman Feb 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ptrendx left a comment

Uh oh!

yaox12 commented Feb 13, 2025 •

edited

Loading

Uh oh!

ksivaman commented Feb 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ksivaman commented Feb 11, 2025

Description

Type of change

Checklist:

Uh oh!

Uh oh!

timmoon10 Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

ksivaman Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ptrendx left a comment

Choose a reason for hiding this comment

Uh oh!

yaox12 commented Feb 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ksivaman commented Feb 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ksivaman Feb 12, 2025 •

edited

Loading

yaox12 commented Feb 13, 2025 •

edited

Loading