Make TorchServe multi framework #1208

msaroufim · 2021-08-19T21:06:49Z

We've been assuming so far that Torchserve can only work with Pytorch Eager mode or Torchscripted models but our current handler is general enough to make it possible to support ONNX models.

The idea is a hack one of our partners mentioned that involves

Adding onnx as a dependency in docker file or requirements.txt
Loading onnx model in initialize handler
Making an inference in the inference handler

It may not necessarily be the best way to serve ONNX models but it lets people avoid having to use a different serving infrastructure for each different type of model

This is a good level 3-4 bootcamp task - the goal would be to

Get a Pytorch model like Resnet 18
Export it using ONNX exporter
Run and inference with it in an ONNX handler and submit it as an example in this repo

The text was updated successfully, but these errors were encountered:

ozancaglayan · 2022-08-11T12:30:16Z

Hi,

Why this was completed, i.e. is there a doc/example for onnx?

msaroufim · 2022-08-11T21:52:23Z

Hi @ozancaglayan not quite, we're now tracking this item in #1631 - @HamidShojanazeri has a promising proposal there to package configurations using the torch-model-archiver so please feel free to put any feedback on that issue. Thanks!

amit-cashify · 2022-08-25T05:58:49Z

@msaroufim we are also working on serving yolov7 using either ONNX or TensorRT through TorchServe. Are there any clear best-practices for that?

Repo: https://github.com/WongKinYiu/yolov7/tree/main/deploy/triton-inference-server

cc @saurav-cashify @abhinav-cashify

amit-cashify · 2022-08-26T08:12:28Z

@msaroufim I understand that it is possible to use TorchServe with ONNX and TensorRT. Is it encouraged or discouraged?

Should one expect better support moving forward or will TorchServe remain focused only on native PyTorch and TorchScript model serving and a platform like Triton be a better choice for deploying different model flavors?

msaroufim · 2022-08-26T18:28:02Z

Hi @amit-cashify we want to encourage more use of ONNX and TensorRT and I'm personally working on making this as easy to use as possible. It took a while because we had a couple of proposals floating around #1631 but I think I have a better one and will experiment with it and run some benchmarks starting next week and will keep you posted on progress

joaquincabezas · 2022-11-14T13:45:34Z

Hello @msaroufim

Thanks for your initiative! Would love to see Torchserve serving ONNX "out-of-the-box". Any feedback on these benchmarks?

msaroufim · 2022-11-14T15:01:44Z

This was just merged, will be featured in next release today

msaroufim added the good first issue Good for newcomers label Aug 20, 2021

msaroufim closed this as completed Aug 30, 2021

msaroufim reopened this Aug 25, 2022

msaroufim removed the good first issue Good for newcomers label Nov 14, 2022

msaroufim linked a pull request Nov 14, 2022 that will close this issue

Add ONNX and ORT support + Docs for TensorRT #1857

Merged

msaroufim closed this as completed Nov 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make TorchServe multi framework #1208

Make TorchServe multi framework #1208

msaroufim commented Aug 19, 2021 •

edited

ozancaglayan commented Aug 11, 2022 •

edited

msaroufim commented Aug 11, 2022

amit-cashify commented Aug 25, 2022

amit-cashify commented Aug 26, 2022 •

edited

msaroufim commented Aug 26, 2022

joaquincabezas commented Nov 14, 2022

msaroufim commented Nov 14, 2022

Make TorchServe multi framework #1208

Make TorchServe multi framework #1208

Comments

msaroufim commented Aug 19, 2021 • edited

ozancaglayan commented Aug 11, 2022 • edited

msaroufim commented Aug 11, 2022

amit-cashify commented Aug 25, 2022

amit-cashify commented Aug 26, 2022 • edited

msaroufim commented Aug 26, 2022

joaquincabezas commented Nov 14, 2022

msaroufim commented Nov 14, 2022

msaroufim commented Aug 19, 2021 •

edited

ozancaglayan commented Aug 11, 2022 •

edited

amit-cashify commented Aug 26, 2022 •

edited