torchvision2trt-samples

Read this in Japanese 日本語

What does this application do?

This repository provides a colletion of Jupyer notebooks to demonstrate on how to convert Torchvision pre-trained models to NVIDIA TensorRT.
You can also understand how to develop TensorRT custom layer with NVIDIA CUDA and NVIDIA CuDNN with a sample TensorRT plugin contained in this repository.

Jupyter notebooks

PyTorch inference (torchvision_normal.ipynb)
This notebook shows how to do inference by GPU in PyTorch.
TensorRT inference with ONNX model (torchvision_onnx.ipynb)
This notebook shows how to convert a pre-trained PyTorch model to a ONNX model first, and also shows how to do inference by TensorRT with the ONNX model.
TensorRT inference with Torch-TensorRT (torchvision_torch_tensorrt.ipynb)
This notebook shows how to import a pre-trained PyTorch model to TensorRT with Torch-TensorRT.
You need to install Torch-TensorRT in the Docker container separately. Please refer to "Install Torch-TensorRT" for the details.
Jetson Nano/TX1/TX2 are not supported by this notebook.
TensorRT Inference with TensorRT API (torchvision_trtapi.ipynb)
This notebook shows how to import a pre-trained PyTorch model data (weights and bias) with a user-defined network with the TensorRT API. This notebook also shows how to use custom layers with the TensorRT API.

Prerequisites

Jetson

NVIDIA Jetson Series Developer Kits
NVIDIA JetPack 4.4 or later
- The Torch-TensorRT sample (torchvision_torch_tensorrt.ipynb) needs JetPack 5.0 or later.

dGPU

x86 64-bit Computer and NVIDIA GPU card
NVIDIA NGC Account

Installation (For Jetson)

This application can be installed with Dockerfile so that you don't need to clone this repository manually.
This application will be built on Machine Learning for Jetson/L4T which is distributed from NVIDIA NGC.

Change docker configuration

Set the default docker runtime to nvidia as described at this link
Reboot your Jetson

Increase swap memory (Only for Jetson Nano)

The default 2GB swap memory is insufficient. Increse it to 4GB as described at JetsonHacks - Jetson Nano – Even More Swap
You need to restart Jetson after the swap memory expansion.

Build a docker image locally

Clone this repository.

git clone https://github.com/MACNICA-CLAVIS-NV/torchvision2trt-samples

Build a docker image

cd torchvision2trt-samples

./scripts/docker_build.sh

Install Torch-TensorRT

Please note that only the Torch-TensorRT sample (torchvision_torch_tensorrt.ipynb) requires this installation.

After the container build, please install Torch-TensorRT with the install_torch_tensorrt notebook.

[ Only for JetPack 5.0 / L4T(Jetson Linux) 34.1 or later ]

Launch a named (persistent) container with the docker_run_named.sh script.
```
./scripts/docker_run_named.sh
```
Open localhost:8888 from Web browser, and input the password "nvidia".
You can find "install_torch_tensorrt" notebook at the /torchvision2trt-samples directory. Please follow the instruction in the notebook. The build process takes about one hour. After the build is completed, exit from Jupyter, then exit from the Docker container.
Committed the container to the image.
```
./scripts/docker_commit.sh
```

Now you can remove the container.

sudo docker rm my-torchvision2trt-samples

Installation (For dGPU)

Build a docker image locally

Clone this repository.

git clone https://github.com/MACNICA-CLAVIS-NV/torchvision2trt-samples

Build a docker image

cd torchvision2trt-samples

./scripts/docker_build_x86.sh

Torch-TensorRT is preinstalled in the PyTorch container images which are the base image of this application.

Usage

For Jetson Nano, you sometimes see the low memory warning on Jetson's L4T desktop while you run these notebooks. To run these notebooks on Jetson Nano, logout the desktop, and login to the Jetson Nano from your PC with network access, and open these notebooks in a Web browser of your PC remotely. It seems that this method reduces Jetson Nano's memory usage.

Run a docker container generated from the image built as the above.

For Jetson
```
./scripts/docker_run.sh
```
For dGPU
```
./scripts/docker_run_x86.sh
```
Open localhost:8888 from Web browser, and input the password "nvidia".
You can find these samples at the /torchvision2trt-samples directory as the following picture.

How to rebuild the pooling plugin library

Open a terminal (Click the terminal button as shown in the following figure.)

Follow the following the following instruction.

cd /torchvision2trt-samples/plugin

protoc --cpp_out=./ --python_out=./ trt_plugin.proto

mv trt_plugin.pb.cc trt_plugin.pb.cpp

rm -rf build

mkdir build

cd build

cmake ..

-- The CXX compiler identification is GNU 7.5.0
-- The CUDA compiler identification is NVIDIA 10.2.89
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Check for working CUDA compiler: /usr/local/cuda/bin/nvcc
-- Check for working CUDA compiler: /usr/local/cuda/bin/nvcc -- works
-- Detecting CUDA compiler ABI info
-- Detecting CUDA compiler ABI info - done
-- Looking for C++ include pthread.h
-- Looking for C++ include pthread.h - found
-- Looking for pthread_create
-- Looking for pthread_create - not found
-- Looking for pthread_create in pthreads
-- Looking for pthread_create in pthreads - not found
-- Looking for pthread_create in pthread
-- Looking for pthread_create in pthread - found
-- Found Threads: TRUE  
-- Found Protobuf: /usr/lib/aarch64-linux-gnu/libprotobuf.so;-lpthread (found version "3.0.0") 
-- Configurable variable Protobuf_VERSION set to 3.0.0
-- Configurable variable Protobuf_INCLUDE_DIRS set to /usr/include
-- Configurable variable Protobuf_LIBRARIES set to /usr/lib/aarch64-linux-gnu/libprotobuf.so;-lpthread
-- Found CUDA: /usr/local/cuda (found version "10.2") 
-- Configurable variable CUDA_VERSION set to 10.2
-- Configurable variable CUDA_INCLUDE_DIRS set to /usr/local/cuda/include
-- Found CUDNN: /usr/include  
-- Found cuDNN: v?  (include: /usr/include, library: /usr/lib/aarch64-linux-gnu/libcudnn.so)
-- Configurable variable CUDNN_VERSION set to ?
-- Configurable variable CUDNN_INCLUDE_DIRS set to /usr/include
-- Configurable variable CUDNN_LIBRARIES set to /usr/lib/aarch64-linux-gnu/libcudnn.so
-- Configurable variable CUDNN_LIBRARY_DIRS set to 
-- Found TensorRT: /usr/lib/aarch64-linux-gnu/libnvinfer.so (found version "..") 
-- Configurable variable TensorRT_VERSION_STRING set to ..
-- Configurable variable TensorRT_INCLUDE_DIRS set to /usr/include/aarch64-linux-gnu
-- Configurable variable TensorRT_LIBRARIES set to /usr/lib/aarch64-linux-gnu/libnvinfer.so
-- Configuring done
-- Generating done
-- Build files have been written to: /torchvision2trt-samples/plugin/build

make

Scanning dependencies of target PoolingPlugin
[ 12%] Building CUDA object CMakeFiles/PoolingPlugin.dir/PoolingAlgo.cu.o
[ 25%] Building CXX object CMakeFiles/PoolingPlugin.dir/CudaPooling.cpp.o
[ 37%] Building CXX object CMakeFiles/PoolingPlugin.dir/trt_plugin.pb.cpp.o
[ 50%] Building CXX object CMakeFiles/PoolingPlugin.dir/PoolingPlugin.cpp.o
[ 62%] Building CXX object CMakeFiles/PoolingPlugin.dir/CuDnnPooling.cpp.o
[ 75%] Building CXX object CMakeFiles/PoolingPlugin.dir/CopyPlugin.cpp.o
[ 87%] Linking CUDA device code CMakeFiles/PoolingPlugin.dir/cmake_device_link.o
[100%] Linking CXX shared module libPoolingPlugin.so
[100%] Built target PoolingPlugin

Name		Name	Last commit message	Last commit date
Latest commit History 136 Commits
doc		doc
plugin		plugin
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.x86		Dockerfile.x86
LICENSE		LICENSE
README.jp.md		README.jp.md
README.md		README.md
WORKSPACE.torch_tensorrt_l4t_34_1_1		WORKSPACE.torch_tensorrt_l4t_34_1_1
bellpepper.jpg		bellpepper.jpg
common.py		common.py
install_torch_tensorrt.ipynb		install_torch_tensorrt.ipynb
plugin_test.ipynb		plugin_test.ipynb
plugin_test2.ipynb		plugin_test2.ipynb
torchvision_normal.ipynb		torchvision_normal.ipynb
torchvision_onnx.ipynb		torchvision_onnx.ipynb
torchvision_torch_tensorrt.ipynb		torchvision_torch_tensorrt.ipynb
torchvision_trtapi.ipynb		torchvision_trtapi.ipynb
trt_analyzer.py		trt_analyzer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

torchvision2trt-samples

What does this application do?

Jupyter notebooks

Prerequisites

Jetson

dGPU

Installation (For Jetson)

Change docker configuration

Increase swap memory (Only for Jetson Nano)

Build a docker image locally

Install Torch-TensorRT

Installation (For dGPU)

Build a docker image locally

Usage

How to rebuild the pooling plugin library

References

About

Releases 2

Packages

Languages

License

MACNICA-CLAVIS-NV/torchvision2trt-samples

Folders and files

Latest commit

History

Repository files navigation

torchvision2trt-samples

What does this application do?

Jupyter notebooks

Prerequisites

Jetson

dGPU

Installation (For Jetson)

Change docker configuration

Increase swap memory (Only for Jetson Nano)

Build a docker image locally

Install Torch-TensorRT

Installation (For dGPU)

Build a docker image locally

Usage

How to rebuild the pooling plugin library

References

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages