Name		Name	Last commit message	Last commit date
parent directory ..
docs		docs
scripts		scripts
src		src
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
Dockerfile_CUDA_11_2		Dockerfile_CUDA_11_2
Dockerfile_CUDA_11_2_TRT_8_5_PADDLE_2_4_2		Dockerfile_CUDA_11_2_TRT_8_5_PADDLE_2_4_2
Dockerfile_CUDA_11_4_TRT_8_4		Dockerfile_CUDA_11_4_TRT_8_4
Dockerfile_cpu		Dockerfile_cpu
Dockerfile_ipu		Dockerfile_ipu
Dockerfile_xpu		Dockerfile_xpu
Dockerfile_xpu_encrypt_auth		Dockerfile_xpu_encrypt_auth
README.md		README.md
README_CN.md		README_CN.md

README.md

简体中文 | English

FastDeploy Serving Deployment

Introduction

FastDeploy builds an end-to-end serving deployment based on Triton Inference Server. The underlying backend uses the FastDeploy high-performance Runtime module and integrates the FastDeploy pre- and post-processing modules to achieve end-to-end serving deployment. It can achieve fast deployment with easy-to-use process and excellent performance.

FastDeploy also provides an easy-to-use Python service deployment method, refer PaddleSeg deployment example for its usage.

Prepare the environment

Environment requirements

Linux
If using a GPU image, NVIDIA Driver >= 470 is required (for older Tesla architecture GPUs, such as T4, the NVIDIA Driver can be 418.40+, 440.33+, 450.51+, 460.27+)

Obtain Image

CPU Image

CPU images only support Paddle/ONNX models for serving deployment on CPUs, and supported inference backends include OpenVINO, Paddle Inference, and ONNX Runtime

docker pull registry.baidubce.com/paddlepaddle/fastdeploy:1.0.4-cpu-only-21.10

GPU Image

GPU images support Paddle/ONNX models for serving deployment on GPU and CPU, and supported inference backends including OpenVINO, TensorRT, Paddle Inference, and ONNX Runtime

docker pull registry.baidubce.com/paddlepaddle/fastdeploy:1.0.4-gpu-cuda11.4-trt8.5-21.10

Users can also compile the image by themselves according to their own needs, referring to the following documents:

FastDeploy Serving Deployment Image Compilation

Task	Model
Classification	PaddleClas
Detection	PaddleDetection
Detection	ultralytics/YOLOv5
NLP	PaddleNLP/ERNIE-3.0
NLP	PaddleNLP/UIE
Speech	PaddleSpeech/PP-TTS
OCR	PaddleOCR/PP-OCRv3

Files

serving

Directory actions

More options

Directory actions

More options

Latest commit

History

serving

Folders and files

parent directory