Support API parallelism #590

ospillinger · 2019-11-20T02:02:14Z

Description

Replace Flask with another web framework (Fast API?)

Motivation

Improve API, concurrency, performance and efficiency.

Additional Context

currently, waitress serves 4 threads (default)
ensure readiness probe/health check is responded to in a timely fashion, otherwise, pod status and in load balancing may be affected (requests may start being queued in load balancer if readiness probes are not responding in a timely fashion)
https://www.reddit.com/r/MachineLearning/comments/dy8hjh/p_cortex_deploy_models_from_any_framework_as

Note from previous ticket #525:

Something like FastAPI would support multithreading, which may improve throughput

API refactor checklist

Revisit Python error wrapping
Expose multiple-workers for parallelism

kinoute · 2019-12-07T16:50:59Z

Starlette would be a nice alternative too. It seems faster than Fast-API:

https://fastapi.tiangolo.com/benchmarks/

ospillinger added enhancement New feature or request research Determine technical constraints labels Nov 20, 2019

ospillinger added this to Prioritize in Cortex via automation Nov 20, 2019

deliahu moved this from Prioritize to Design in Cortex Nov 25, 2019

deliahu changed the title ~~Replace Flask~~ Replace Flask [.5] Nov 25, 2019

deliahu changed the title ~~Replace Flask [.5]~~ Replace Flask [0.5] Nov 25, 2019

deliahu assigned vishalbollu Nov 25, 2019

deliahu added the v0.12 label Nov 25, 2019

deliahu changed the title ~~Replace Flask [0.5]~~ Support API parallelism [0.5] Nov 29, 2019

deliahu moved this from Design to In progress in Cortex Dec 9, 2019

vishalbollu moved this from In progress to Prioritize in Cortex Dec 18, 2019

deliahu removed the v0.12 label Dec 19, 2019

deliahu changed the title ~~Support API parallelism [0.5]~~ Support API parallelism Dec 20, 2019

deliahu moved this from Prioritize to Design in Cortex Jan 22, 2020

deliahu added v0.14 and removed v0.14 labels Jan 22, 2020

vishalbollu mentioned this issue Mar 2, 2020

Update in-flight request counter, switch to FastAPI + Uvicorn #838

Merged

4 tasks

deliahu closed this as completed in #838 Mar 4, 2020

Cortex automation moved this from Design to Done Mar 4, 2020

deliahu added v0.14 and removed v0.15 labels Mar 20, 2020

deliahu added this to the v0.14 milestone Nov 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support API parallelism #590

Support API parallelism #590

ospillinger commented Nov 20, 2019 •

edited by vishalbollu

kinoute commented Dec 7, 2019

Support API parallelism #590

Support API parallelism #590

Comments

ospillinger commented Nov 20, 2019 • edited by vishalbollu

Description

Motivation

Additional Context

Note from previous ticket #525:

API refactor checklist

kinoute commented Dec 7, 2019

ospillinger commented Nov 20, 2019 •

edited by vishalbollu