Skip to content

pad on broadcast dimensions hitting assert during transform replay #3660

@jjsjann123

Description

@jjsjann123
RuntimeError:  INTERNAL ASSERT FAILED at "/opt/pytorch/nvfuser/csrc/contiguity.cpp":51, please report a bug with repro script to NVFuser at https://github.com/NVIDIA/Fuser/issues. Error replaying transforms in contiguous ID checker, expected iS10{9} to be in the active ID set.
Exception raised from checkExclusivelyConsumesAllocs at /opt/pytorch/nvfuser/csrc/contiguity.cpp:51 (most recent call first):
frame #0: nvfuser::nvfCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0x103 (0x7f423ae23815 in /opt/pytorch/nvfuser/nvfuser/_C.cpython-312-x86_64-linux-gnu.so)
frame #1: nvfuser::nvfErrorFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0x62 (0x7f423b241462 in /opt/pytorch/nvfuser/nvfuser/_C.cpython-312-x86_64-linux-gnu.so)

Repro:

import torch
from nvfuser import FusionDefinition, DataType

def nvfuser_fusion_id11745(fd : FusionDefinition) -> None :
    T0 = fd.define_tensor(shape=[1, 1], contiguity=[None, None], dtype=DataType.Float, is_cpu=False, stride_order=[1, 0])
    S1 = fd.define_scalar(-3.09978, dtype=DataType.Double)
    T7 = fd.ops.pad(T0, [0, 2, 0, 2], S1)
    fd.add_output(T7)
    
with FusionDefinition() as fd:
    nvfuser_fusion_id11745(fd)
    
inputs = [
    torch.testing.make_tensor((1, 1), dtype=torch.float32, device='cuda:0'),
]
fd.execute(inputs)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions