feat(ray): support containerized model serving #114

heiruwu · 2024-03-13T17:02:56Z

Because

We are going to support containerized model serving with Instill Model

This commit

add deployment handle return
add modules for building and pushing model image
expose cpu/gpu/memory resource allocation configs
update instill_deployable decorator

Because - `model-backend` needs `Ray` CLI to deploy dockerized application This commit - return deployment handle for CLI to reference

Because - we need to provide easy-to-use script for user to build and push containerized model to desired registry This commit - add `docker` dependency - add `build` module script for easy image building and pushing

Because - we need to copy model weight files along with config and model.py This commit - update `dockerfile` to copy all files in the same directory

Because - It is not practical to determine vram usage solely from model file size This commit - expose cpu/gpu/ram resource allocation config to user

Because - It is hard to know what went wrong without build logs - pip install tends to timeout for large packages installation This commit - print build logs - add default timeout for pip package installation

Because - we remove the restriction of the model folder structure after deprecating triton model support This commit - update custom model guide

Because - user may want to push to multiple registries, it is undesirable to define in `instill.yaml` This commit - separate `build` and `push` script - remove `registry` from model config

Because - We are going to support containerized model serving with `Instill Model` This commit - add deployment handle return - add modules for building and pushing model image - expose cpu/gpu/memory resource allocation configs - update `instill_deployable` decorator

heiruwu and others added 8 commits March 14, 2024 01:00

feat(ray): add deployment handle return for ray CLI (#107)

18044b3

Because - `model-backend` needs `Ray` CLI to deploy dockerized application This commit - return deployment handle for CLI to reference

feat(ray): support containerized model build and push (#108)

7c42ae7

Because - we need to provide easy-to-use script for user to build and push containerized model to desired registry This commit - add `docker` dependency - add `build` module script for easy image building and pushing

fix(dockerfile): copy all files in the same model dir (#109)

db795c2

Because - we need to copy model weight files along with config and model.py This commit - update `dockerfile` to copy all files in the same directory

feat(ray): expose cpu/gpu resource allocation config (#110)

c263c9b

Because - It is not practical to determine vram usage solely from model file size This commit - expose cpu/gpu/ram resource allocation config to user

fix(ray): add forcerm to avoid missing packages

5499886

fix(ray): show build logs and add pip timeout (#111)

727c9d3

Because - It is hard to know what went wrong without build logs - pip install tends to timeout for large packages installation This commit - print build logs - add default timeout for pip package installation

docs(readme,notebook): update description (#112)

20992af

Because - we remove the restriction of the model folder structure after deprecating triton model support This commit - update custom model guide

feat(ray): separate build and push functionality (#113)

4c5da04

Because - user may want to push to multiple registries, it is undesirable to define in `instill.yaml` This commit - separate `build` and `push` script - remove `registry` from model config

droplet-bot added the instill core label Mar 13, 2024

heiruwu merged commit 05863a0 into main Mar 13, 2024

heiruwu deleted the dockerized branch March 13, 2024 17:16

droplet-bot mentioned this pull request Mar 13, 2024

chore(main): release 0.8.0 #106

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(ray): support containerized model serving #114

feat(ray): support containerized model serving #114

Uh oh!

heiruwu commented Mar 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(ray): support containerized model serving #114

feat(ray): support containerized model serving #114

Uh oh!

Conversation

heiruwu commented Mar 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants