Error in test suite: an illegal memory access was encountered #41340
Labels
high priority
module: cublas
Problem related to cublas support
module: cuda
Related to torch.cuda, and CUDA support in general
triaged
This issue has been looked at a team member, and triaged and prioritized into an appropriate module
🐛 Bug
Running the test suite fails on our system. The issue seems to be with
TestTorchDeviceTypeCUDA
where starting withtest_blas_alpha_beta_empty_cuda_float16
all tests fail withRuntimeError: CUDA error: an illegal memory access was encountered
To Reproduce
Steps to reproduce the behavior:
One of the traceback is:
Maybe related to #21819 or #36722
Environment
PyTorch version: 1.6.0-rc2
Is debug build: N/A
CUDA used to build PyTorch: N/A
OS: Red Hat Enterprise Linux Server release 7.8 (Maipo)
GCC version: (GCC) 8.3.0
CMake version: version 3.15.3
Python version: 3.7
Is CUDA available: N/A
CUDA runtime version: 10.1.243
GPU models and configuration:
GPU 0: Tesla K80
GPU 1: Tesla K80
GPU 2: Tesla K80
GPU 3: Tesla K80
Nvidia driver version: 450.36.06
cuDNN version: Could not collect
Versions of relevant libraries:
[pip3] numpy==1.17.3
cc @ezyang @gchanan @zou3519 @csarofeen @ptrblck @ngimel
The text was updated successfully, but these errors were encountered: