🐛 [Bug] Compilation failure for HuggingFace T5-base Model #1583

gs-olive · 2023-01-10T01:27:48Z

Bug Description

When compiling the T5-base network (https://huggingface.co/t5-base), the following error is encountered:

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument index in method wrapper__index_select)

To Reproduce

Steps to reproduce the behavior:

Run torch_tensorrt.compile with t5-base model as input, using fp32 precision.
Choose two fixed-size inputs of shape [1, 128] and [1, 128] and enable truncate_long_and_double with 12 GB workspace.
Pass in model keyword args to disable attention and hidden state outputs
Run inference using the compiled model on two sample inputs.

Expected behavior

Model should successfully compile with Torch-TRT. Specifically, internal device mismatch issues should either be addressed with a warning at compile time, or should otherwise not cause errors.

Environment

Torch-TensorRT Version: 1.4.0.dev0+f43be5b6
PyTorch Version: 1.14.0.dev20221114+cu116
CPU Architecture: Intel Xeon CPU
OS: Ubuntu 20.04
How you installed PyTorch: pip
Build command you used: python setup.py develop
Are you using local sources or building from archives: local
Python version: 3.8.13
CUDA version: 11.6

Additional context

The problem seems related to #1416 which was intended to address device mismatch issues of this sort. Since this case is not caught by that PR, it likely arises in a different area, for example as a result of an internal computation in a Torch block.

The text was updated successfully, but these errors were encountered:

gs-olive · 2023-01-10T23:03:19Z

Root cause is related to various model-internal auxiliary tensors being initialized on CPU. Running model.cuda() and putting both input tensors on GPU resolves the compilation issue.

This model is one operator away from full TensorRT support (only requiring aten::full_like), however full compilation is not currently functional since the model outputs are in Tuple form which is not currently supported by Torch-TensorRT, and could warrant a new feature, as in #629.

Christina-Young-NVIDIA · 2023-01-13T18:17:52Z

Dheeraj assigned changes to George.

gs-olive added the bug Something isn't working label Jan 10, 2023

gs-olive self-assigned this Jan 10, 2023

gs-olive mentioned this issue Jan 11, 2023

fix: Add aten::full_like evaluator #1584

Merged

peri044 closed this as completed in #1584 Jan 17, 2023

gs-olive mentioned this issue May 10, 2023

🐛 [Bug] Compilation issue for HuggingFace T5-base Model [TorchScript] #1899

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 [Bug] Compilation failure for HuggingFace T5-base Model #1583

🐛 [Bug] Compilation failure for HuggingFace T5-base Model #1583

gs-olive commented Jan 10, 2023

gs-olive commented Jan 10, 2023

Christina-Young-NVIDIA commented Jan 13, 2023

🐛 [Bug] Compilation failure for HuggingFace T5-base Model #1583

🐛 [Bug] Compilation failure for HuggingFace T5-base Model #1583

Comments

gs-olive commented Jan 10, 2023

Bug Description

To Reproduce

Expected behavior

Environment

Additional context

gs-olive commented Jan 10, 2023

Christina-Young-NVIDIA commented Jan 13, 2023