dl_model_infer

Introduce

This project is modified based on the AIInfer, Tanks for this project.

This is a c++ version of the AI reasoning library. Currently, it only supports the reasoning of the tensorrt model. The follow-up plan supports the c++ reasoning of frameworks such as Openvino, NCNN, and MNN. There are two versions for pre- and post-processing, c++ version and cuda version. It is recommended to use the cuda version., This repository provides accelerated deployment cases of deep learning CV popular models, and cuda c supports dynamic-batch image process, infer, decode, NMS.

Update

2023.05.27 update yolov5、yolov7、yolov8、yolox
2023.05.28 update rt_detr
2023.06.01 update yolov8_seg、yolov8_pose
2023.06.09 update yolov7_cutoff
2023.06.14 update yolov7-pose
2023.06.15 Adding Producer-Consumer Inference Model for yolov8-det
2023.06.24 update 3D objection detection algorithm smoke
2023.09.06 update deploy for detr in mmdetection
2024.01.26 update yolov8-obb
2024.02.06 update depth-anything
2024.02.12 update yolop & yolopv2
2024.02.26 update yolov9

Environment

The following environments have been tested：

ubuntu16.04
cuda11.1
cudnn8.6.0
TensorRT-8.5.1.7
gcc5.4.0
cmake-3.24.0
opencv-4.5.5
Eigen3
yaml

You can also use docker, How to use it is as follows:

docker pull longxiaowyh/dl_model_infer:v1.0
nvidia-docker run -itu root:root --name dl_model_infer --gpus all -v /your_path:/target_path -v /tmp/.X11-unix/:/tmp/.X11-unix/ -e DISPLAY=unix$DISPLAY -e GDK_SCALE -e GDK_DPI_SCALE  -e NVIDIA_VISIBLE_DEVICES=all -e NVIDIA_DRIVER_CAPABILITIES=compute,utility --shm-size=64g longxiaowyh/dl_model_infer:v1.0 /bin/bash

Model Export Tutorial

RT-DETR model export tutorial
- https://zhuanlan.zhihu.com/p/623794029
Yolov8 model export tutorial
- https://github.com/yhwang-hub/dl_model_infer/tree/master/application/yolov8_app/README.md
Yolov5 model export tutorial
- https://github.com/ultralytics/yolov5
Yolov7 model export tutorial
- https://github.com/WongKinYiu/yolov7
yolov7_cutoff model export tutorial
- https://github.com/yhwang-hub/dl_model_deploy/tree/master/yolov7_cutoff_TensorRT
yolov7-pose model export tutorial
- https://github.com/yhwang-hub/dl_model_infer/tree/master/application/yolov7_pose_app
smoke model export tutorial
- https://github.com/yhwang-hub/dl_model_infer/blob/dev/application/smoke_det_app/README.md
DETR model export tutorial
- Put the workspaces/detr_pytorch2onnx.py file under the mmdetection path.
- Modify the config_file and checkpoint_file paths in the detr_pytorch2onnx.py file.
- Use the detr_pytorch2onnx.py file to generate onnx file.
- Use trtexec to generate engine files.
DepthAnything model export tutorial
- https://github.com/spacewalk01/depth-anything-tensorrt
- https://github.com/daniel89710/trt-depth-anything
YOLOP model export tutorial
- Refer to the workspace/yolop_model_compile.sh file
Yolov9 model export tutorial
- https://github.com/yhwang-hub/dl_model_infer/tree/master/application/yolov9_app/README.md

Use of CPM (wrapping the inference as producer-consumer)

cpm.hpp Producer-consumer model
- For direct inference tasks, cpm.hpp can be turned into an automatic multi-batch producer-consumer model

cpm::Instance<BoxArray, Image, yolov8_detector> cpmi;
auto result_futures = cpmi.commits(yoloimages);
for (int ib = 0; ib < result_futures.size(); ++ib)
{
    auto objs = result_futures[ib].get();
    auto image = images[ib].clone();
    for (auto& obj : objs)
    {
        process....
    }
}

Quick Start

Take yolov8 target detection as an example，modify CMakeLists.txt and run the command below：

git clone git@github.com:yhwang-hub/dl_model_infer.git
cd dl_model_infer
mkdir build && cd build
cmake .. && make
cd ../workspaces
./infer -f yolov8n.transd.trt -i res/dog.jpg -b 10 -c 10 -o cuda_res -t yolov8_det

You can also use a script to execute，The above instructions are written in the compile_and_run.sh script，for example：

rm -rf build && mkdir -p build && cd build && cmake ..  && make -j9 && cd ..
# mkdir -p build && cd build && cmake ..  && make -j48 && cd ..
cd workspaces

rm -rf cuda_res/*

# ./infer -f yolov8n.transd.trt -i res/dog.jpg -b 10 -c 10 -o cuda_res -t yolov8_det

cd ..

Then execute the following command to run

cd dl_model_infer
bash compile_and_run.sh

Project directory introduction

AiInfer
   |--application # Implementation of model inference application, your own model inference can be implemented in this directory
     |--yolov8_det_app # Example: A yolov8 detection implemented
     |--xxxx
   |--utils # tools directory
     |--backend # here implements the reasoning class of backend
     |--common # There are some commonly used tools in it
       |--arg_parsing.h # Command line parsing class, similar to python's argparse
       |--cuda_utils.h # There are some common tool functions of cuda in it
       |--cv_cpp_utils.h # There are some cv-related utility functions in it
       |--memory.h # Tools related to cpu and gpu memory application and release
       |--model_info.h # Commonly used parameter definitions for pre- and post-processing of the model, such as mean variance, nms threshold, etc.
       |--utils.h # Commonly used tool functions in cpp, timing, mkdir, etc.
       |--cpm.h # Producer-Consumer Inference Model
     |--post_process # Post-processing implementation directory, cuda post-processing acceleration, if you have custom post-processing, you can also write it here
     |--pre_process # pre-processing implementation directory, cuda pre-processing acceleration, if you have custom pre-processing can also be written here
     |--tracker # This is the implementation of the target detection and tracking library, which has been decoupled and can be deleted directly if you don’t want to use it
   |--workspaces # Working directory, where you can put some test pictures/videos, models, and then directly use the relative path in main.cpp
   |--mains # This is the collection of main.cpp, where each app corresponds to a main file, which is easy to understand, and it is too redundant to write together
   |--main.cpp # Project entry

Speed Test

Tested on Jetson Orin, the test includes the entire process (image preprocessing + model inference + post-processing decoding)、

Model	Precision	Resolution	FPS(bs=1)
rtdetr_r50	FP16	640x640	19
yolov8n	FP16	640x640	126
yolov8n-seg	FP16	640x640	92
yolov8s-pose	FP16	640x640	58
yolov8s-obb	FP16	1024x1024	38
yolov5s	FP16	640x640	92
yolov7	FP16	640x640	34
yolov7_cutoff	FP16	640x640	32
yolov7-w6-pose	FP16	960x960	22
yolox_s	FP16	640x640	91
detr	FP16	800x1190	18
depth_anything_vits14	FP16	518x518	19
yolop	FP16	640x640	40
yolopv2	FP16	480x640	26

onnx downloads

model	baiduyun
yolov5	链接: https://pan.baidu.com/s/1Bwwo8--JS8Vkw6METz2dIw 提取码: 47ax
yolov7	链接: https://pan.baidu.com/s/1gb0W177xhnrseJF6CfdMEA 提取码: rvg5
yolov7_cutoff	链接: https://pan.baidu.com/s/16bKgt_DWNmk26q-utLyCfA 提取码: q7kf
yolov8	链接: https://pan.baidu.com/s/18Cm-tN21cus3XyirqLE_eg 提取码: j8br
yolov8-seg	链接: https://pan.baidu.com/s/1s2Gp_Jedhi9-p_Z2utJV-Q 提取码: wr5t
yolov8-pose	链接: https://pan.baidu.com/s/1lP8kiKu2a6h_FAZSSkhUgg 提取码: 7p6a
yolox	链接: https://pan.baidu.com/s/1U0gzW_YMbvNMtKzo4_cluA 提取码: 4xct
rt-detr	链接: https://pan.baidu.com/s/1Ft0-ewuCTK2BxTS1q1Vdtw 提取码: ekms
yolov7-pose	链接: https://pan.baidu.com/s/1uI6u5oKDrnroluQufIWF7w 提取码: ed9r
DETR	链接: https://pan.baidu.com/s/1_PQVPKy0QiFWJaB7HhyTSg 提取码: j7fs
yolov8-obb	链接: https://pan.baidu.com/s/1bMZuZPtTNjo5tl5heOdJRw 提取码: fudj
Depth-Anything	链接: https://pan.baidu.com/s/1ZiE1AVvpB6owND5wEqHwMw 提取码: cp97
YOLOP	链接: https://pan.baidu.com/s/1Q0itb-TMoYpx27x0stwUag 提取码: rc3p
YOLOPV2	链接: https://pan.baidu.com/s/19sNBrwIx2TAD7iOJtZrDuA 提取码: 15vm
YOLOV9	链接: https://pan.baidu.com/s/1Cotnig9BgeJt7gAW-Vy8Pg 提取码: n4hm

Reference

Thanks for the following items

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

application

application

mains

mains

utils

utils

workspaces

workspaces

CMakeLists.txt

CMakeLists.txt

README.md

README.md

compile_and_run.sh

compile_and_run.sh

main.cpp

main.cpp

Repository files navigation

dl_model_infer

Introduce

Update

Environment

Model Export Tutorial

Use of CPM (wrapping the inference as producer-consumer)

Quick Start

Project directory introduction

Speed Test

onnx downloads

Reference

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
application		application
mains		mains
utils		utils
workspaces		workspaces
CMakeLists.txt		CMakeLists.txt
README.md		README.md
compile_and_run.sh		compile_and_run.sh
main.cpp		main.cpp

yhwang-hub/dl_model_infer

Folders and files

Latest commit

History

Repository files navigation

dl_model_infer

Introduce

Update

Environment

Model Export Tutorial

Use of CPM (wrapping the inference as producer-consumer)

Quick Start

Project directory introduction

Speed Test

onnx downloads

Reference

About

Resources

Stars

Watchers

Forks

Languages