Skip to content

A RESTful API to return predictions from a trained ML model, built with Python 3 and FastAPI

License

Notifications You must be signed in to change notification settings

jgbustos/ml_rest_fastapi

Repository files navigation

ml_rest_fastapi

CircleCI Build and publish Docker image Sonarcloud Status Known Vulnerabilities

A RESTful API to return predictions from a trained ML model, built with Python 3 and FastAPI.

Development set-up instructions

Windows

First, open Powershell and clone the GitHub repo in your workspace:

PS > cd $WORKSPACE_PATH$
PS > git clone https://github.com/jgbustos/ml_rest_fastapi
PS > cd ml_rest_fastapi

Create and activate a Python virtual environment, then install the required Python packages using pip:

PS > virtualenv venv
PS > venv\scripts\activate.ps1
(venv) PS > pip install -r requirements.txt

Once dependencies are installed, set up the project for development:

(venv) PS > pip install -e .

Finally, run the project:

(venv) PS > uvicorn ml_rest_fastapi.app:app --host 0.0.0.0 --port 8888 --reload

Linux

First, open a terminal and clone the GitHub repo in your workspace:

$ cd $WORKSPACE_PATH
$ git clone https://github.com/jgbustos/ml_rest_fastapi
$ cd ml_rest_fastapi

Create and activate a Python virtual environment, then install the required Python packages using pip:

$ virtualenv venv
$ source ./venv/bin/activate
(venv) $ pip install -r ./requirements.txt

Once dependencies are installed, set up the project for development:

(venv) $ pip install -e .

Finally, run the project:

(venv) $ gunicorn -c gunicorn.conf.py

Interfaces exposed

Open the URL http://localhost:8888/docs/ with your browser and see the Swagger UI page. Alternatively, open the URL http://localhost:8888/redoc/ to see the ReDoc UI.

OpenAPI JSON available from URL http://localhost:8888/openapi.json.

Health router

These two methods are meant to be used as the liveness and readiness probes in a Kubernetes deployment:

Model router

  • POST http://localhost:8888/model/predict will return a prediction using the ML model. The data_point structure shows the JSON argument that must be supplied, and example values for each of the fields. The service will validate that all the mandatory values are passed. Return values are:
    • 200/Predicted value based on JSON input
    • 422/"Validation Error" if any mandatory parameter is missing or if any wrong data type (e.g. str, int, bool, datetime...) is supplied
    • 500/"Internal Server Error" as catch-all exception handler
    • 503/"Not Ready" if model is not initialised

Config settings

Configuration parameters are contained in the file ml_rest_fastapi/settings.py, but they can also be overriden by setting env vars:

settings: Dict[str, Any] = {
    # Trained ML/AI model settings
    "TRAINED_MODEL_MODULE_NAME": "sample_model",
    # Module settings
    "EXPLAIN_PREDICTIONS": False,
    "DEBUG": True,
    "MULTITHREADED_INIT": True,
}
Parameter Values Details
TRAINED_MODEL_MODULE_NAME e.g.: sample_model Name of the Python module that initialises the ML model and returns predictions (see section below)
EXPLAIN_PREDICTIONS False/True When True, the model should return an explanation (using eli5, LIME, SHAP, etc.) along with the prediction
DEBUG False/True When Debug = True, Uvicorn will run in reload mode, and FastAPI will return a colourful traceback instead of a sober 500/"Internal Server Error"
MULTITHREADED_INIT False/True Multi-threaded initialisation of the trained model

Setting up the model

The trained ML model is meant to be initialised and invoked to make predictions in the context of a Python unit saved inside the directory ml_rest_fastapi/trained_model. The structure of this Python module is explained in this document.

Build automation

This project is built into a container image using GitHub Actions and pushed to the Docker Hub at https://hub.docker.com/r/jgbustos/ml-rest-fastapi/.

Running the Docker container

> docker run -d -p8888:8888 jgbustos/ml-rest-fastapi:latest

About

A RESTful API to return predictions from a trained ML model, built with Python 3 and FastAPI

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published