FaST-GShare

Introduction

FaST-GShare: Enabling Efficient Spatio-Temporal GPU Sharing in Serverless Computing for Deep Learning Inference.

FaST-GShare is a Kubernetes-based GPU sharing mechanism that enables fine-grained resource allocation across both temporal and spatial dimensions. Users can allocate corresponding GPU computing resources by configuring the appropriate temporal quota_limit and quota_req, as well as spatial sm_partition and memory, within the annotations. More details, please refer to the paper.

FaSTPod

FaSTPod is a Custom Resource with Controller that enables temporal and spacial GPU sharing. Users can specify fine-grained GPU resources for each Pod in the FaSTPod's YAML specification annotations, such as:

annotations: 
  fastgshare/gpu_quota_request: "0.7"
  fastgshare/gpu_quota_limit: "0.8"
  fastgshare/gpu_sm_partition: "30"
  fastgshare/gpu_mem: "2700000000"

Additionally, users can define the required replicas:

spec:
  replicas: 2

A sample FaSTPod deployment example is available in yaml/fastpod/testfastpod.yaml.

Deployment

Infrastruction Install

Install K8S infrastructure, NVIDIA Driver && Toolkit, and other prerequisite, please follow Installation Guide.

Install FaST-GShare FaSTPod

Deploy FaSTPod CRD (Custom Resource Definition)

$ kubectl apply -f ./yaml/crds/fastpod_crd.yaml

Deploy FaSTPod Controller Manager and GPU Resource Configurator
```
$ bash ./yaml/fastgshare/apply_deploy_ctr_mgr_node_daemon.sh
```

Test the FaSTPod example

$ kubectl apply -f ./yaml/fastpod/testfastpod.yaml

check the FaSTPods and correponding Pods deployed:

$ kubectl get pods -n fast-gshare
$ kubectl get fastpods -n fast-gshare

Uninstall the FaST-GShare FaSTPod deployment

$ bash ./yaml/fastgshare/clean_deploy_ctr_mgr_node_daemon.sh

Install and Uninstall FaST-GShare-Function (without Autoscaler)

The deployment of the FaST-GShare-Function in this project does not include the FaST-GShare-Autoscaler and only deploys with a fixed number of replicas. The complete FaST-GShare serverless version should include the FaST-GShare-Autoscaler, and the basic verion of Autoscaler plugin can be found at FaST-GShare-Autoscaler.

Install the FaST-GShare-Function Components

$ bash ./install/install_fast-gshare-fn.sh

Test if the FaST-GShare-Function is successfully deployed:

$ kubectl apply -f yaml/fastpod/test-fastgshare-fn.yaml

Uninstall the FaST-GShare-Function Components

$ bash ./install/uninstall_fast-gshare-fn.sh

Build FaST-GShare from Scratch based on the Code (for further Developing)

The detailed introduction to the FaST-GShare project's construction from the source code can be found in the ./develope directory and README.

Citation

If you use FaST-GShare for your research, please cite our, please cite our paper paper:

@inproceedings{gu2023fast,
  title={FaST-GShare: Enabling Efficient Spatio-Temporal GPU Sharing in Serverless Computing for Deep Learning Inference},
  author={Gu, Jianfeng and Zhu, Yichao and Wang, Puxuan and Chadha, Mohak and Gerndt, Michael},
  booktitle={Proceedings of the 52nd International Conference on Parallel Processing},
  pages={635--644},
  year={2023}
}

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
chart/fastgshare		chart/fastgshare
cmd		cmd
develop		develop
docker		docker
hack		hack
install		install
konton_test		konton_test
pkg		pkg
yaml		yaml
.gitmodules		.gitmodules
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
boilerplate.go.txt		boilerplate.go.txt
code-gen.sh		code-gen.sh
generate-groups.sh		generate-groups.sh
go.mod		go.mod
go.sum		go.sum
main.go		main.go
namespace.yaml		namespace.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FaST-GShare

Introduction

FaSTPod

Deployment

Infrastruction Install

Install FaST-GShare FaSTPod

Install and Uninstall FaST-GShare-Function (without Autoscaler)

Build FaST-GShare from Scratch based on the Code (for further Developing)

Citation

License

About

Releases

Packages

Languages

KontonGu/FaST-GShare

Folders and files

Latest commit

History

Repository files navigation

FaST-GShare

Introduction

FaSTPod

Deployment

Infrastruction Install

Install FaST-GShare FaSTPod

Install and Uninstall FaST-GShare-Function (without Autoscaler)

Build FaST-GShare from Scratch based on the Code (for further Developing)

Citation

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages