Support dynamic batch inference with onnx/onnxruntime #45

zhiqwang · 2021-02-01T08:41:59Z

🚀 Feature

Support dynamic batch inference with onnx/onnxruntime.

Motivation

As @makaveli10 pointed out in #39 (comment), the current implementation of onnx/onnxruntime mechanism only supports dynamic shapes inference, not dynamic batch size.

I didn't know how to implement the dynamic batch inference, any help is welcome here.

The text was updated successfully, but these errors were encountered:

makaveli10 · 2021-02-01T08:43:10Z

Thanks. I'll run some experiments and see how it goes.

timmh · 2022-08-03T12:57:01Z

I followed the ONNX deployment walkthrough and run export_onnx(..., skip_preprocess=True). However, during the inference using PredictorORT(weights).predict(inputs) I get the following error when trying to use a batch size different from 1:

onnxruntime.capi.onnxruntime_pybind11_state.Fail: [ONNXRuntimeError] : 1 : FAIL : Non-zero status code returned while running Split node. Name:'Split_48' Status Message: Cannot split using values in 'split' attribute. Axis=0 Input shape={10,3,450,600} NumOutputs=1 Num entries in 'split' (must equal number of outputs) was 1 Sum of sizes in 'split' (must equal size of selected axis) was 1

This error occcurs independently of the batch_size parameter used in tracing. The model I export is just a vanilla YOLOv5 model.

zhiqwang · 2022-08-03T15:45:09Z

Hi @timmh , The default example only supports ONNX models with preprocessing, could you please open a new ticket about inferencing without preprocessing for easier to track this issue?

zhiqwang added enhancement New feature or request help wanted Extra attention is needed labels Feb 1, 2021

zhiqwang added the deployment Inference acceleration for production label Feb 17, 2021

zhiqwang mentioned this issue Sep 21, 2021

Add ONNXRuntime C++ interfaces for inferencing #139

Closed

zhiqwang mentioned this issue Oct 7, 2021

Support skipping preprocess when exporting ONNX #193

Merged

zhiqwang closed this as completed in #193 Oct 7, 2021

zhiqwang linked a pull request Oct 8, 2021 that will close this issue

Support skipping preprocess when exporting ONNX #193

Merged

philipp-schmidt mentioned this issue Jul 20, 2022

End2end WongKinYiu/yolov7#61

Merged

timmh mentioned this issue Aug 4, 2022

Dynamic batch dimension not working with ONNX export #452

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support dynamic batch inference with onnx/onnxruntime #45

Support dynamic batch inference with onnx/onnxruntime #45

zhiqwang commented Feb 1, 2021 •

edited

makaveli10 commented Feb 1, 2021

timmh commented Aug 3, 2022 •

edited by zhiqwang

zhiqwang commented Aug 3, 2022

Support dynamic batch inference with onnx/onnxruntime #45

Support dynamic batch inference with onnx/onnxruntime #45

Comments

zhiqwang commented Feb 1, 2021 • edited

🚀 Feature

Motivation

makaveli10 commented Feb 1, 2021

timmh commented Aug 3, 2022 • edited by zhiqwang

zhiqwang commented Aug 3, 2022

zhiqwang commented Feb 1, 2021 •

edited

timmh commented Aug 3, 2022 •

edited by zhiqwang