Models performance optimization w/o API changes #357

vlad-tokarev · 2020-09-24T13:38:59Z

We need to check whether we can optimize performance of ML models that are deployed on ODAHU as a web api
because our customers report about very bad performance on batch predictions (millions rows) in comparison with local prediction (direct class invoking)

As a result we should have:

optimized ODAHU stack without models invoke API
or
Proposal to change or ADD new API for batch processing

Generally we have next layers on top of original model

original model – Python Class that usually provided by ML framework (scikit, tensorflow, etc)
MLFlow flavor – Our MLFlow toolchain works with original models wrapped in MLFlow pyfunc flavor (https://github.com/odahu/odahu-trainer/blob/6c1b4d33f4bc755402f42e8b989ca5c6b811cdcc/mlflow/odahuflow/mlflowrunner/templates/entrypoint.py#L95)
GPPI wrapper – Our MLFlow toolchain add layer on top of MLFLow pyfunc flavor (https://github.com/odahu/odahu-trainer/blob/6c1b4d33f4bc755402f42e8b989ca5c6b811cdcc/mlflow/odahuflow/mlflowrunner/templates/entrypoint.py#L99)
ODAHU packager that add http api + json parser, before model invokation (https://github.com/odahu/odahu-packager/blob/058056694c3a71cdf1961bdd5f0dddc02e341050/packagers/docker/odahuflow/packager/rest/resources/odahuflow_handler.py#L231)
ODAHU packager pack model into docker image so we add docker network overhead
Network overhead after docker image is deploying into kubernetes knative

==

Task:

Find where the most overhead is located
Decide whether we can optimize (in case of not optimal code, etc)
Optimize or report proposal for API changes

Pre-conditions: we assume that user should find reasonable balance between amount of data in body of an each API request and count of such requests to decrease network latency

vlad-tokarev added 1.4 need description labels Sep 24, 2020

vlad-tokarev self-assigned this Sep 24, 2020

vlad-tokarev added this to Backlog in odahu-kanban via automation Sep 24, 2020

vlad-tokarev moved this from Backlog to To Do in odahu-kanban Sep 24, 2020

vlad-tokarev assigned keshamin and vlad-tokarev and unassigned vlad-tokarev Sep 24, 2020

vlad-tokarev removed the need description label Sep 25, 2020

vlad-tokarev assigned keshamin and unassigned keshamin and vlad-tokarev Sep 30, 2020

vlad-tokarev added the WG:deployment Working Group: All about model deployment functionality label Oct 2, 2020

vlad-tokarev moved this from To Do to In development in odahu-kanban Oct 2, 2020

vlad-tokarev added the WPM label Nov 13, 2020

This was referenced Jan 15, 2021

[#357] Predict Performance Optimization odahu/odahu-trainer#24

Merged

[#357] Predict Performance Optimization odahu/odahu-packager#32

Merged

keshamin closed this as completed in odahu/odahu-trainer#24 Feb 12, 2021

keshamin moved this from In development to In QA in odahu-kanban Feb 12, 2021

BPylypenko moved this from In QA to Done in odahu-kanban Feb 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models performance optimization w/o API changes #357

Models performance optimization w/o API changes #357

vlad-tokarev commented Sep 24, 2020 •

edited

Loading

Models performance optimization w/o API changes #357

Models performance optimization w/o API changes #357

Comments

vlad-tokarev commented Sep 24, 2020 • edited Loading

vlad-tokarev commented Sep 24, 2020 •

edited

Loading