GitHub - Criss-Wang/dpai: Deployable AI is a simple toolkit to serve your ML model inferences quickly

Introduction

Deployable AI aims to enable quick inference serving in local environment in various styles of your choice.

Getting Started

Installation

To install this package, the easiest is to run pip install dpai. If you prefer directly install using this repo code, you can clone it and run make command directly.

Basic Usage

save your model in .joblib format. Example:

from joblib import dump

your_model_artifact = {
    "model": your_model,
    # other metadata
    "tokenizer": ...,
    "quantization": ...,
    ...
}

dump(your_model_artifact, "MODEL_ARTIFACT_PATH.joblib")

Create inference script inference.py with two functions input_fn and predict_fn (similar to how sagemaker inference does). Usually you'll create an inference file for each model you register. Example:

def input_fn(data):
    processed_data_for_model_input = ...  # some transformation logic
    return processed_data_for_model_input

def predict_fn(input, model):
    result = model(input)
    return result

Register model: run deployaible register --name=your_model_name --model_path=your_model_path --inference_path=your_inference_path
Serve your model: run deployaible serve --port=your_port You will get a backend running on your_port (default is 9000). A sample endpoint will be localhost:9000/your_model_name/predict.
Format your data input in JSON style: {"data": your_input_data}. Make sure it aligns with the input_fn your infrence script

Test endpoint: example request

curl -X POST -H "Content-Type: application/json" -d '{"data": ["val"]}' http://localhost:9100/GPT4/predict

You can also the APIs via swagger UI on http://localhost:your_port/docs

Sample notebook

Highlights

Supports multiple types of model serving
Sample UI
Works on Linux/MacOS/Windows

Limitations

Currently only supported request type is application/json.

Documentation

See the doc here

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
docs		docs
src/dpai		src/dpai
tests		tests
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
folder_structure.ipynb		folder_structure.ipynb
pre-commit-config.yml		pre-commit-config.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Getting Started

Installation

Basic Usage

Sample notebook

Highlights

Limitations

Documentation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Introduction

Getting Started

Installation

Basic Usage

Sample notebook

Highlights

Limitations

Documentation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages