The mlserver extension is used to run model in onnx format. mlserver doc
onnx/mlseerver_onnx/...
|
__init__.py
onnx.py
place to
- copy directory
$ cp /onnx mlserver/runtimes
- update Dockerfile
COPY ../onnx ./runtimes/onnx
- insert in pyproject.toml
mlserver-onnx = {path = "./runtimes/onnx", develop = true}