torch.rand can sample the upper bound for lower precision floating point dtypes on CUDA #96947

pmeier · 2023-03-16T09:43:31Z

Returns a tensor filled with random numbers from a uniform distribution on the [half-open] interval [0,1)

import itertools

import torch

for device, dtype in itertools.product(
    ["cpu", "cuda"],
    [
        torch.float16,
        torch.bfloat16,
        torch.float32,
        torch.float64,
        torch.complex32,
        torch.complex64,
        torch.complex128,
    ],
):
    torch.manual_seed(0)
    # I only used this high number of samples to make sure the other dtypes are not affected
    # On my machine 1_000 was sufficient for the check to fail for bfloat16, 
    # and 10_000 for float16 and complex32
    t = torch.rand(10_000_000, dtype=dtype, device=device)
    if dtype.is_complex:
        t = torch.view_as_real(t)

    print(f"{dtype}, {device}: {'PASS' if (t != 1).all() else 'FAIL'}")

torch.float16, cpu: PASS
torch.bfloat16, cpu: PASS
torch.float32, cpu: PASS
torch.float64, cpu: PASS
torch.complex32, cpu: PASS
torch.complex64, cpu: PASS
torch.complex128, cpu: PASS
torch.float16, cuda: FAIL
torch.bfloat16, cuda: FAIL
torch.float32, cuda: PASS
torch.float64, cuda: PASS
torch.complex32, cuda: FAIL
torch.complex64, cuda: PASS
torch.complex128, cuda: PASS

Failures happen for float16, bfloat16, and complex32 only on CUDA.

This was detected in #96331, which uses Tensor.uniform_ under the hood, but I guess internally it is the same kernel.

cc @ezyang @gchanan @zou3519 @pbelevich

The text was updated successfully, but these errors were encountered:

Fixes pytorch#96947 If we generate 1.0 - float_eps, the BFloat16 and Half constructors will round this to 1.0 which is outside of the half-open range. This changes the rounding of the last bit in the BFloat16 representation to never round up. The result is we never go outside the end point and also the from point now equally likely where before it was half as likely.

Fixes pytorch#96947

pmeier mentioned this issue Mar 16, 2023

add proper tests for torch.testing.make_tensor #96331

Closed

peterbell10 mentioned this issue Mar 16, 2023

Fix uniform returning end point for BFloat16 and Half #96962

Closed

pytorch-bot bot added the triage review label Mar 16, 2023

ezyang added the module: bfloat16 label Mar 16, 2023

yuantailing added a commit to yuantailing/pytorch that referenced this issue Mar 19, 2023

Fix the range of random uniform

57f4dbb

Fixes pytorch#96947

yuantailing added a commit to yuantailing/pytorch that referenced this issue Mar 19, 2023

Fix the range of random uniform

aee8c06

Fixes pytorch#96947

peterbell10 self-assigned this Mar 20, 2023

peterbell10 pushed a commit to peterbell10/pytorch that referenced this issue Mar 20, 2023

Fix the range of random uniform

421f11b

Fixes pytorch#96947

cpuhrsch removed the triage review label Mar 20, 2023

pytorchmergebot closed this as completed in 4e05417 Mar 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.rand can sample the upper bound for lower precision floating point dtypes on CUDA #96947

torch.rand can sample the upper bound for lower precision floating point dtypes on CUDA #96947

pmeier commented Mar 16, 2023 •

edited by pytorch-bot bot

torch.rand can sample the upper bound for lower precision floating point dtypes on CUDA #96947

torch.rand can sample the upper bound for lower precision floating point dtypes on CUDA #96947

Comments

pmeier commented Mar 16, 2023 • edited by pytorch-bot bot

pmeier commented Mar 16, 2023 •

edited by pytorch-bot bot