How to deploy models where the shape of output tensor is not known #5

srihari-humbarwadi · 2018-11-29T12:21:10Z

I have a tensorflow frozen graph of a objection detection model, i am unclear about creating a config.pbtxt file for this model since i cannot determine the output shapes before hand and i cannot start the inference server without the "dim" specified. i wanted to know how can i create a config file for this

name: "NF1"
    platform: "tensorflow_graphdef"
    max_batch_size: 16
    
    input [
      {
        name: "image_tensor"
        data_type: TYPE_UINT8
        format: FORMAT_NHWC
        dims: [ 1024, 800, 3 ]
      }
    ]
    
    output [
      {
        name: "num_detections"
        data_type: TYPE_FP32
        dims: [ 300 ]
      },

      {
        name: "detection_boxes"
        data_type: TYPE_FP32
        dims: [ 300, 4  ]
      },

      {
        name: "detection_scores"
        data_type: TYPE_FP32
        dims: [ 300 ]        
      },

      {
        name: "detection_classes"
        data_type: TYPE_FP32
        dims: [ 300 ]        
      }
    ]
    instance_group [    
      {
        gpus: [ 0 ]
      },
      {
        gpus: [ 1 ]
      },
      {
        gpus: [ 2 ]
      },
      {
        gpus: [ 3 ]
      }                  
    ]    
    dynamic_batching {
      preferred_batch_size: [ 16 ]
      max_queue_delay_microseconds: 100
    }

this my config which does not work, i tried fixing the shape to the maximum proposals ie 300. Which i knew wouldn't work

The text was updated successfully, but these errors were encountered:

dcyoung · 2018-11-29T21:51:30Z

Did you solve this issue? And if so, could you share your solution?

I am also interested in how to serve models with variably sized outputs.

srihari-humbarwadi · 2018-11-30T05:40:43Z

I was wrong, as the output shape is not variable, there is an upper bound for number of objects detected. So jus set the dims to this upper bound. That should work fine

dcyoung · 2018-11-30T17:16:51Z

Thanks for getting back @srihari-humbarwadi. Seems defining an upper bound is fine because your model type returns a fixed size tensor, but I'm still curious if variable sizes are supported in tensorrt-inference-server. Perhaps a dev can point me to the relevant docs??

For context:
I'm assuming your model (possibly from here ?) outputs tensors of fixed size, intending for boxes to be ignored based on the associated score.

However, returning a fixed size output is not ideal for performance reasons. While it doesn't matter much for simple result types, consider the case where the served model is a MaskRCNN and the return type includes a pixel mask for each detected object. Without an output signature with variable sized tensors, the payload size would be worst-case for every return. I like to support variable outputs to reduce the payload for the common case (where less than max objects are detected). For tf-serving, this involved modifying the output before exporting a saved model, such that the return type only includes results for object's whose score exceeds some threshold.

Is this behavior supported in tensorrt-inference-server

deadeyegoodwin · 2018-11-30T18:16:59Z

TRTIS only supports variable-sized dimension for batching, but this is a common request so we are planning on fixing it. Issue #8 is tracking this request so add upvotes there to indicate that you are interested in it.

dcyoung · 2018-11-30T18:50:17Z

Thanks @deadeyegoodwin !

tilaba · 2018-12-25T09:30:09Z

hello，have you solved it?

srihari-humbarwadi closed this as completed Nov 29, 2018

zoidburg mentioned this issue Nov 30, 2018

got problem while serving with TensorRT plan #7

Closed

taomiao mentioned this issue Nov 20, 2019

pytorch bert model error #900

Closed

zhouxuan009 mentioned this issue Feb 19, 2020

Problem when running sequence models #1123

Closed

rarvind33 mentioned this issue Aug 27, 2020

Failed to load 'resnet50_netdef' #1935

Closed

ruilongzhang mentioned this issue Sep 9, 2020

[enforce fail at operator.cc:76] blob != nullptr. op Cast: Encountered a non-existing input blob: data #1993

Closed

arunsu mentioned this issue Mar 4, 2021

Running torchscript exported model in Triton throws InferenceServerException #2594

Closed

lincong8722 mentioned this issue Nov 30, 2021

About PyTorch execute failure: forward() is missing value for argument 'input'. error #3633

Closed

jackzhou121 mentioned this issue Mar 3, 2022

triton server failed exited with coredump #4010

Closed

This was referenced Apr 25, 2022

[LibTorch] Expected Tensor but got None with inception v3 #2526

Closed

Running inference with Pytorch backend on Jetson nano #4298

Closed

jackzhou121 mentioned this issue Aug 17, 2022

triton pytorch backend malloc coredump #4778

Closed

zhaotyer mentioned this issue Aug 22, 2022

Core dump when dynamic batch Infer using tensorflow backend #4769

Closed

jackzhou121 mentioned this issue Sep 5, 2022

use triton container 22.07 sdk load torchscript model failed #4848

Closed

Tsingjie89 mentioned this issue Sep 8, 2022

python backend crash #4857

Closed

zhaotyer mentioned this issue Dec 22, 2022

Core dump when load model with config which containning repoagent in explicit mode #5189

Closed

rmccorm4 mentioned this issue Feb 3, 2023

Add gdb backtrace to qa tests when server fails to start within timeout #5310

Merged

lawliet0823 mentioned this issue Feb 17, 2024

Encountering a segmentation fault issue when attempting to send multiple images via gRPC #6891

Open

vonchenplus mentioned this issue Mar 14, 2024

[Pytorch model] Triton inference server didn't response the second request from client (only run with first request) #6593

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to deploy models where the shape of output tensor is not known #5

How to deploy models where the shape of output tensor is not known #5

srihari-humbarwadi commented Nov 29, 2018

dcyoung commented Nov 29, 2018

srihari-humbarwadi commented Nov 30, 2018

dcyoung commented Nov 30, 2018

deadeyegoodwin commented Nov 30, 2018

dcyoung commented Nov 30, 2018

tilaba commented Dec 25, 2018

How to deploy models where the shape of output tensor is not known #5

How to deploy models where the shape of output tensor is not known #5

Comments

srihari-humbarwadi commented Nov 29, 2018

dcyoung commented Nov 29, 2018

srihari-humbarwadi commented Nov 30, 2018

dcyoung commented Nov 30, 2018

deadeyegoodwin commented Nov 30, 2018

dcyoung commented Nov 30, 2018

tilaba commented Dec 25, 2018