Investigate Nvidia's TensorRT #121

ukclivecox · 2018-03-20T17:35:16Z

The ability for users to serve their models using the optimization framework Nvidia's TensorRT could be useful.

At present it comes hardwired with a set REST server. Would need to modify this to use our microservice prediction API.

ukclivecox · 2018-03-28T14:05:09Z

This shows more understanding is needed as to what are the blockers for use of TensorRT before we do any work within seldon-core

jlewi · 2018-07-02T22:56:47Z

In Kubeflow we would like to create an example for serving with GPUs: kubeflow/examples#145

ukclivecox · 2018-11-08T16:44:13Z

We have an example notebook for TensorRT now.

ukclivecox changed the title ~~Investigate NvidiaRT~~ Investigate Nvidia's TensorRT Mar 20, 2018

ukclivecox closed this as completed Nov 8, 2018

Provide feedback