UPDATE (06/09/23)

Heroku no longer has a free-tier, which takes a lot of the "fun" out of this repo. Although this should all generally still work as described, you will need to pay for the dyno (which starts at $5 per month). Much of this repo is dated regardless, so I won't be updating it (if I were to update it, I'd completely re-write it, etc.). But the general concept is still valid, so this may serve as reference or, at worst, a relic of history.

At a high-level, here's how I'd now approach this:

Use PyTorch instead of TensorFlow (sorry TensorFlow)
Use FastAPI instead of Flask (Flask is arguably still faster to work with, but FastAPI is simple enough and a solid choice)
Deploy the app to AWS Lambda (which can be easily served via Function URL).

You can deploy this app to AWS Lambda in one of two general ways:

Creating a Lambda container image, pushing that to ECR, and deploying the image to the Lambda
Packaging the Lambda without the model as a zip file, deploying that to Lambda, push the model to S3, then have the Lambda fetch and load the model on initialization.

The container approach more closely matches what I did here originally, and such infrastructure should fall within AWS' free-tier (depending on usage, at least). I probably would have suggested it originally, but Lambda container images weren't available when I originally wrote this, and keeping everything in a single image makes things a lot simpler.

Deploying a Model

In this repo we:

Setup the Environment

We first create a virtual environment, then install the dependencies found in requirements.txt.

python3 -m venv venv
source venv/bin/activate
pip install --upgrade pip && pip install -r requirements.txt

Train the Model

This trains a simple neural network to classify MNIST digits.

python3 ./train.py

The model is saved to mnist/. It takes a 28x28 grayscale image as input and returns a vector of length 10 with each value corresponding to the probability of the image representing the number at it's index.

Build the Flask App

The Flask app is responsible for two things:

Serving the static front-end site.
Hosting the endpoint which we pass data to for inference.

The Flask app is found at app.py and templates/ and static/ hold resources for the front-end, which is a simple Vue app we can use to interact with the server. The site is hosted at / and one can POST gray-scale image data to /infer to pass the image data to the model for inference.

You can run it locally with python3 ./app.py and navigate to localhost:5000 to test it.

Deploy to Heroku

The container's configuration is found in Dockerfile. The image is built on the Ubuntu image, where we setup the Python environment, then we install the project's requirements, then copy the model and the code to the container. The image's entrypoint is python3 app.py, which runs the Flask app.

We can build the image and directly deploy to Heroku with the following commands. You will need Docker and the Heroku CLI installed and authenticated.

heroku container:push web --app <app-name>
heroku container:release web --app <app-name>

Note: The first time you push the container to Heroku, you will have to push a layer which contains TensorFlow which is ~2GB large. This can be slow on Heroku, but updates to the image will skip this layer making any following uploads quick.

Once deployed, navigate to https://<app-name>.herokuapp.com/ (it may take a minute after release for it setup and load). Check out the demo, which may take a minute to load since free dynos have to wake up after periods of inactivity.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

UPDATE (06/09/23)

Deploying a Model

Setup the Environment

Train the Model

Build the Flask App

Deploy to Heroku

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
mnist		mnist
static		static
templates		templates
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
train.py		train.py

nathanmargaglio/Deployable-Model

Folders and files

Latest commit

History

Repository files navigation

UPDATE (06/09/23)

Deploying a Model

Setup the Environment

Train the Model

Build the Flask App

Deploy to Heroku

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages