Setting up the environment

Check the v2_api branch for codes using newer TRITON API.

This repository hosts codes for the Healthcare on Tap series webinar titled "Deeper Dive into TensorRT and TRITON" recorded on 08/06/2020.

Setting up the environment

All tests were performed using 
- Docker version 19.03
- NVIDIA GPUS (RTX 8000 and V100) with driver 450.57

Use the startDocker.sh script as follows to mount a data directory and choose GPU 2 for your tests. Current setup uses nvcr.io/nvidian/pytorch:20.06-py3 as the base image.

./startDocker.sh 2 <PATH_TO_DATA>

Once inside the container, please use the following script to enable GPU dashboards and start jupyterlab

./start_jupyter_lab.sh

TRITON server

A separate container for the server needs to be launched using the script

./start_triton_server.sh 2 <PATH_TO_MODEL_REPO>

TRITON metrics

To launch Grafana dashboards for monitoring of metrics, please run docker-compose up from the monitoring folder and navigate to localhost:3000/. Additional steps here.

Notebooks

The three notebooks in this repository walkthrough the example steps for using

TensorRT NB1_PyTorch_TRT_ONNX_Inference
TRITON NB2_TRITON_ClientInference
NB3_lung_segmentation_3d walks through a simple 3D example with a graphdef backend.

For replicating the experiments, additional clients can be launched to test inference with multiple models. For ex.

python sim_inference_req_triton.py --model model_cxr_onnx

License

This project is being distributed under the MIT License

The following tools were used as part of this code base and are governed by their respective license agreements. These are in addition to tools distributed within the NGC Docker containers (Pytorch / TRITON).

Any contributions to this repository are subject to the Contributor License Agreement

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Docker		Docker
NB_images		NB_images
lib		lib
monitoring		monitoring
.gitignore		.gitignore
CXR14_dataset.json		CXR14_dataset.json
NB1_PyTorch_TRT_ONNX_Inference.ipynb		NB1_PyTorch_TRT_ONNX_Inference.ipynb
NB2_TRITON_ClientInference.ipynb		NB2_TRITON_ClientInference.ipynb
NB3_lung_segmentation_3d.ipynb		NB3_lung_segmentation_3d.ipynb
README.md		README.md
contributing.md		contributing.md
license.md		license.md
luna_dataset.json		luna_dataset.json
sim_inference_req_triton.py		sim_inference_req_triton.py
startDocker.sh		startDocker.sh
start_jupyter_lab.sh		start_jupyter_lab.sh
start_triton_server.sh		start_triton_server.sh

License

NVIDIA/healthcare-on-tap-TRT-TRITON-demo

Folders and files

Latest commit

History

Repository files navigation

Setting up the environment

TRITON server

TRITON metrics

Notebooks

License

Additional Resources

About

Resources

License

Stars

Watchers

Forks

Languages