Nvidia Triton model packaging feature #437

karbyshevds · 2020-11-20T14:03:53Z

As Data Scientist I want to be able to pack (containerize) trained mode to Nvidia Triton or other server and deploy it using ODAHU deployment.
Deployed model should be able to accept tensor, now it can only accept JSON that should be de-serialized before passing data to model.
Triton provides ability to pass raw data to models and such features as micro-batching and GPU sharing so it also may help to save costs.

karbyshevds added feature [Added] for new features. 1.4 WPM labels Nov 20, 2020

karbyshevds assigned keshamin Nov 20, 2020

karbyshevds added this to Backlog in odahu-kanban via automation Nov 20, 2020

karbyshevds moved this from Backlog to In development in odahu-kanban Nov 20, 2020

keshamin mentioned this issue Dec 24, 2020

[#437] Added Triton packager odahu/odahu-packager#36

Merged

keshamin closed this as completed in odahu/odahu-packager#36 Jan 11, 2021

odahu-kanban automation moved this from In development to In QA Jan 11, 2021

vlad-tokarev reopened this Feb 8, 2021

odahu-kanban automation moved this from In QA to To Do Feb 8, 2021

vlad-tokarev moved this from To Do to In QA in odahu-kanban Feb 8, 2021

BPylypenko closed this as completed Feb 10, 2021

BPylypenko moved this from In QA to Done in odahu-kanban Feb 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nvidia Triton model packaging feature #437

Nvidia Triton model packaging feature #437

karbyshevds commented Nov 20, 2020

Nvidia Triton model packaging feature #437

Nvidia Triton model packaging feature #437

Comments

karbyshevds commented Nov 20, 2020