GitHub - MincYu/gillis-open-source

Gillis

A serverless-based ML model serving framework with automatic model partitioning.

Our paper: Minchen Yu, Zhifeng Jiang, Hok Chun Ng, Wei Wang, Ruichuan Chen, and Bo Li, ''Gillis: Serving Large Neural Networks in Serverless Functions with Automatic Model Partitioning,'' in the Proceedings of the 41st IEEE International Conference on Distributed Computing Systems (ICDCS'21), Virtual Conference, July 2021. (Best Paper Runner Up)

1. Intro

For a large DNN model, Gillis can divide it into multiple partitions using two partitioning algorithms, latency-optimal and SLO-aware, then automatically deploy model partitions on serverless platforms, including AWS Lambda, Google Cloud Functions and KNIX.

Currently, Gillis supports ONNX models and MXNet runtime.

2. Toy example

2.1 Prepare a model

cd partition
# download vgg-16
wget https://s3.amazonaws.com/onnx-model-zoo/vgg/vgg16/vgg16.onnx
mkdir -p models
mv vgg16.onnx models/

2.2 Partition a model using latency-optimal scheme

python main.py lo -n vgg16.onnx -p true

2.3 Deploy partitions on AWS Lambda

First, we copy the generated model partitions to the deployment directory, e.g., aws_lambda_deploy.

cd ..
mv vgg16_workspace/ aws_lambda_deploy/

Then, we deploy partitions on AWS Lambda.

cd aws_lambda_deploy
bash deploy.sh -j vgg16_workspace

Then you can follow the guides of aws-sam to finish the deployment.

2.4 After deployment

If everything is going well, you can see an API for model inference. Copy it and try the following command out!

curl [API]

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
aws_lambda_deploy		aws_lambda_deploy
google_function_deploy		google_function_deploy
knix_deploy		knix_deploy
partition		partition
tool		tool
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gillis

1. Intro

2. Toy example

2.1 Prepare a model

2.2 Partition a model using latency-optimal scheme

2.3 Deploy partitions on AWS Lambda

2.4 After deployment

About

Releases

Packages

Languages

MincYu/gillis-open-source

Folders and files

Latest commit

History

Repository files navigation

Gillis

1. Intro

2. Toy example

2.1 Prepare a model

2.2 Partition a model using latency-optimal scheme

2.3 Deploy partitions on AWS Lambda

2.4 After deployment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages