Fix shape of new quantized tensor in `make_like` by ksivaman · Pull Request #1515 · NVIDIA/TransformerEngine

ksivaman · 2025-02-27T08:52:35Z

Description

In functions such as split, chunk etc. in which the shape of the input and output differs, the returned tensor is correct but with the incorrect shape which leads to bugs, e.g. in FSDP2 or checkpoint loading. A small repro:

import torch
from transformer_engine.pytorch.tensor.float8_tensor import Float8Quantizer
import transformer_engine_torch as tex

t = torch.randn(4, 4)
quantizer = Float8Quantizer(
    scale=torch.full([1], 1.0, dtype=torch.float32, device="cuda"),
    amax=torch.empty([1], dtype=torch.float32, device="cuda"),
    fp8_dtype=tex.DType.kFloat8E4M3,
)
x = quantizer(t.cuda())
a, b = x.chunk(2, dim=0)
print(x.shape, a.shape, b.shape)
print(x, a, b)

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refactoring

Changes

If data

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

ksivaman · 2025-02-27T10:40:19Z

/te-ci pytorch

timmoon10

This is fine as a hack.

Rambling digression: I don't like the data kwarg. It's not generic and going forward there's no reason to expect this logic to be valid in the future (FP6 will probably require a randomly sized blob of bytes). Also, if you're providing data you should also probably provide scale_inv (and maybe column-wise data as well). I think it's fine if it's an optional kwarg in Float8Tensor and MXFP8Tensor, but we should stop exposing it in QuantizedTensor.

ptrendx · 2025-02-27T23:03:11Z

I agree with Tim, why not just add the shape argument to the usage of the make_like in float8tensor:

diff --git a/transformer_engine/pytorch/tensor/float8_tensor.py b/transformer_engine/pytorch/tensor/float8_tensor.py
index 49bf4facf..0063b286a 100644
--- a/transformer_engine/pytorch/tensor/float8_tensor.py
+++ b/transformer_engine/pytorch/tensor/float8_tensor.py
@@ -402,7 +402,7 @@ class Float8Tensor(Float8TensorBase, QuantizedTensor):
                 [data] + list(args[1:]),
                 kwargs,
             )
-            return [Float8Tensor.make_like(tensor, data=split_tensor) for split_tensor in func_out]
+            return [Float8Tensor.make_like(tensor, data=split_tensor, shape=split_tensor.shape) for split_tensor in func_out]
         if func == aten.new_zeros.default:
             tensor = args[0]
             data = tensor._data
@@ -412,7 +412,7 @@ class Float8Tensor(Float8TensorBase, QuantizedTensor):
                 [data] + list(args[1:]),
                 kwargs,
             )
-            return Float8Tensor.make_like(tensor, data=func_out)
+            return Float8Tensor.make_like(tensor, data=func_out, shape=data.func_out.shape)
         if func == torch.ops.aten.as_strided.default:
             tensor = args[0]
             data = tensor._data
@@ -422,7 +422,7 @@ class Float8Tensor(Float8TensorBase, QuantizedTensor):
                 [data] + list(args[1:]),
                 kwargs,
             )
-            return Float8Tensor.make_like(tensor, data=func_out)
+            return Float8Tensor.make_like(tensor, data=func_out, shape=data.func_out.shape)
         if func == torch.ops.aten.detach.default:
             return cls.detach(args[0])
         if func == torch.ops.aten.clone.default:

I confirmed that it also resolves the given repro.
Also, could you add that repro to the sanity tests?

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

ksivaman · 2025-02-28T07:25:39Z

/te-ci pytorch

ksivaman · 2025-02-28T07:30:50Z

Added the shapes to make_like calls in float8_tensor, but I'm also keeping the original change to quantized_tensor since that is logically the correct shape to use in case the tensor's and data's shape do differ. We can remove the data kwarg completely from quantized_tensor which will require implementation of make_like in the Float8Tensor and MXFP8Tensor, so leaving that for a different PR.

* Fix quantized tensor shape Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> * add shape to make_like; add test for chunk Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Fix typo from suggestion Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com> --------- Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

Fix quantized tensor shape

5fd2bc7

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

ksivaman added the bug Something isn't working label Feb 27, 2025

ksivaman requested a review from timmoon10 February 27, 2025 08:52

timmoon10 approved these changes Feb 27, 2025

View reviewed changes

ksivaman added 3 commits February 28, 2025 12:01

Merge branch 'main' into fix_quantized_tensor_shape

f2a9927

add shape to make_like; add test for chunk

7034ede

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

Fix typo from suggestion

605d33e

Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>

ksivaman merged commit 9588109 into NVIDIA:main Feb 28, 2025

hungryGeek16 mentioned this pull request May 31, 2026

fix unfused padding causal sdpa #3063

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix shape of new quantized tensor in `make_like`#1515

Fix shape of new quantized tensor in `make_like`#1515
ksivaman merged 4 commits into
NVIDIA:mainfrom
ksivaman:fix_quantized_tensor_shape

ksivaman commented Feb 27, 2025

Uh oh!

ksivaman commented Feb 27, 2025

Uh oh!

timmoon10 left a comment •

edited

Loading

Uh oh!

ptrendx commented Feb 27, 2025

Uh oh!

ksivaman commented Feb 28, 2025

Uh oh!

ksivaman commented Feb 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ksivaman commented Feb 27, 2025

Description

Type of change

Changes

Checklist:

Uh oh!

ksivaman commented Feb 27, 2025

Uh oh!

timmoon10 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ptrendx commented Feb 27, 2025

Uh oh!

ksivaman commented Feb 28, 2025

Uh oh!

ksivaman commented Feb 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

timmoon10 left a comment •

edited

Loading