Packaged Open Source Basic Image Classification Model

This repository contains a packaged basic image classification model sample, exposed as a web app microservice.

Open Source Basic Image Classification Repository | Python Model Template Repository | Documentation

Introduction

This repository contains a packaged basic image classification model sample, exposed as a web app microservice.

The model service accepts a job configuration object that informs the model about the filesystem locations from which to read inputs and write outputs.

This is the Modzy-compatible version of the basic image classification repository.

A quick tour

Relevant files and directories:

File / directory	Description
`flask_psc_model/*`	A utility package that implements the container specification API with Flask.
`model_lib/*`	A sample model library package.
`model_lib/model.py`	A file that contains the `BasicImageClassification` class that wraps the model logic into an interface that the `flask_psc_model` package can understand.
`tests/*`	The unit tests.
`app.py`	The model app that wraps the model in `model_lib` with the utilities from `flask_psc_model`.
`Dockerfile`	The app container definition.
`entrypoint.sh`	The script that starts the app server inside the container.
`gunicorn.conf.py`	The Gunicorn web server configuration file used in the Docker container.
`model.yaml`	The model metadata file with documentation and technical requirements.
`requirements.txt`	Pinned python library dependencies for reproducible environments.

Installation

Clone the repository:

git clone https://github.com/modzy/packaged-basic-image-classification.git

Usage

Once your environment is set up and the requirements are installed:

Run the download_weights.py script to download the weights fromt the PyTorch Torchvision models library:

python download_weights.py

Build and run the container

Build the app server image:

docker build -t basic-image-classification .

Run the app server container on port 8080 and mount the /container-test directory onto the /data directory within the container:

docker run --name basic-image-classification -e PSC_MODEL_PORT=8080 -p 8080:8080 -v $(pwd)/container-test:/data -d basic-image-classification:latest

Check the container’s status:

curl -s "http://localhost:8080/status"

Run some inference jobs using curl. Send the data from the /data container directory to the model for inference:

curl -s -X POST -H "Content-Type: application/json" \
    --data "{\"type\":\"file\",\"input\":\"/data\",\"output\":\"/data\"}" \
    "http://localhost:8080/run"
cat $(pwd)/container-test/results.json

Stop the app server:

curl -s -X POST "http://localhost:8080/shutdown"

Check that the exit code is 0:

docker inspect basic-image-classification --format="{{.State.Status}} {{.State.ExitCode}}"

Cleanup the exited Docker container:

docker rm basic-image-classification

Save the container to a TAR file:

docker save -o basic-image-classification.tar basic-image-classification:latest

Run the unit tests

Locally

python -m unittest

In Docker

docker run --rm --memory 512m --cpus 1 --shm-size 0m model-template:latest python -m unittest

The memory and cpus values must match the model.yaml file’s resources values and the resources later set to the container. shm-size is set to 0 to check that the container is not using shared memory that may be limited when deployed.

Adjust the values as needed when running the container and remember to update the values in the model.yaml file.

In Docker with test files mounted as a volume

If test files are large it may be better to exclude them from the model container. If excluded, mount the test directory as a volume into the application container and run the tests that way:

docker run --rm --memory 512m --cpus 1 --shm-size 0m -v $(pwd)/test:/opt/app/test model-template:latest python -m unittest

While it is very useful to ensure that the model code is working properly, the unit tests don’t check if the container is configured properly to communicate with the outside world.

You can manually test the container API using curl or other HTTP clients or the cli runner discussed above.

Minimal checklist to implement a new model

These are the basic steps needed to update this repository with your own model:

	Create a copy of the repository or copy these files into an existing repository.
	Update the `model.yaml` metadata file with information about the model. Ignore the `resources` and `timeout` sections until the containerized model is fully implemented.
	Replace `model_lib` with the model’s code.
	Update the `requirements.txt` file with any additional dependencies for the model.
	Define a class that extends from the `flask_psc_model.ModelBase` abstract base class and implements the required abstract methods. Define: . `input_filenames` . `output_filenames` . `run` See `model_lib/model.py` for a sample implementation and `flask_psc_model.ModelBase` docstrings for more info.
	Update `app.py` to configure the model app with the newly implemented model class.
	Update and write new unit tests in `tests/`: Add new test case data to `tests/data/` with sample inputs and expected outputs. - The `examples` directory should contain files that are expected to run successfully and their expected results. - The `validation-error` directory should contain files that are not expected to run successfully and their expected error message text, to test the model’s error handling. Add any model specific unit tests to `tests/test_model.py`. Update the application unit tests `tests/test_app.py` for the model. In particular, update the `check_results` function to validate that the actual application run results match the expected results.
	Increase the `timeout` in the `model.yaml` file if the model needs more time to run in edge cases. The Gunicorn configuration file loads the `timeout` and uses it to stop the model if it takes too long to run.
	Update the `Dockerfile` with all of the model app’s code, data, and runtime dependencies.
	Use the `Dockerfile` to build the container image and test.
	Use the container image to determine the final values for the `resources` and `timeout` sections of the `model.yaml` metadata file.

Docker container specification

The Docker container must expose an HTTP API on the port specified by the PSC_MODEL_PORT environment variable that implements the /status, /run, and /shutdown routes detailed below.

The container must start the HTTP server process by default when run with no command argument:

docker run image

Define a CMD that starts the server process with the exec syntax in the Dockerfile:

COPY entrypoint.sh ./
CMD ["./entrypoint.sh"]

HTTP API Specification

The flask_psc_model package implements the HTTP API.

Response DTO:

The routes return an application/json MIME type with this format:

{
    "statusCode": 200,
    "status": "OK",
    "message": "The call went well or terribly."
}

If something is wrong, the message returns information to help address the issue.

Status [GET /status]

Returns the model’s status after initialization.

Response

Status 200: the model is ready to run.
Status 500: error loading the model.

Run [POST /run]

Runs the model inference on a given input.

Request Body

Contains the job configuration object with an application/json MIME type:

{
    "type": "file",
    "input": "/path/to/input/directory",
    "output": "/path/to/output/directory"
}

`type` _required	The input and output type; at this time the value needs to be "file".
`input` _required	The filesystem directory path where the model should read input data files.
`output` _required	The filesystem directory path where the model writes output data files.

The filenames for input and output files contained within the input and output directories are specified in the model metadata.

Response

Status 200: successful inference.
Status 400: invalid job configuration object:
→ The job configuration object is malformed or the expected files do no exist, cannot be read, or written.
When running on the platform this should not occur but it may be useful for debugging.
Status 415: invalid media type:
→ the client did not post application/json in the HTTP body.
When running on the platform this should not occur but it may be useful for debugging.
Status 422: unprocessable input file:
→ the model cannot run inference on the input files An input file may have a wrong format, be too large, be too small, etc.
Status 500: error running the model.

Shutdown [POST /shutdown]

The model server process should exit with exit code 0.

Response

The model server is not required to send a response. It may simply drop the connection. However, a response is encouraged.

Status 202: request accepted:
→ the server process will exit after returning the response.
Status 500: unexpected error.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Packaged Open Source Basic Image Classification Model

Introduction

A quick tour

Installation

Usage

Build and run the container

Run the unit tests

Locally

In Docker

In Docker with test files mounted as a volume

Minimal checklist to implement a new model

Docker container specification

HTTP API Specification

Response DTO:

Status [GET /status]

Response

Run [POST /run]

Request Body

Response

Shutdown [POST /shutdown]

Response

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
container-test		container-test
flask_psc_model		flask_psc_model
model_lib		model_lib
tests		tests
.flake8		.flake8
.gitattributes		.gitattributes
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.adoc		README.adoc
app.py		app.py
download_weights.py		download_weights.py
entrypoint.sh		entrypoint.sh
gunicorn.conf.py		gunicorn.conf.py
model.yaml		model.yaml
requirements.in		requirements.in
requirements.txt		requirements.txt

License

modzy/packaged-basic-image-classification

Folders and files

Latest commit

History

Repository files navigation

Packaged Open Source Basic Image Classification Model

Introduction

A quick tour

Installation

Usage

Build and run the container

Run the unit tests

Locally

In Docker

In Docker with test files mounted as a volume

Minimal checklist to implement a new model

Docker container specification

HTTP API Specification

Response DTO:

Status [GET /status]

Response

Run [POST /run]

Request Body

Response

Shutdown [POST /shutdown]

Response

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages