Homework

In this homework, we'll deploy the dino or dragon model we trained in the previous homework.

Download the model from here:

https://github.com/SVizor42/ML_Zoomcamp/releases/download/dino-dragon-model/dino_dragon_10_0.899.h5

Question 1

Now convert this model from Keras to TF-Lite format.

What's the size of the converted model?

21 Mb
43 Mb
80 Mb
164 Mb

Question 2

To be able to use this model, we need to know the index of the input and the index of the output.

What's the output index for this model?

3
7
13
24

Preparing the image

You'll need some code for downloading and resizing images. You can use this code:

from io import BytesIO
from urllib import request

from PIL import Image

def download_image(url):
    with request.urlopen(url) as resp:
        buffer = resp.read()
    stream = BytesIO(buffer)
    img = Image.open(stream)
    return img


def prepare_image(img, target_size):
    if img.mode != 'RGB':
        img = img.convert('RGB')
    img = img.resize(target_size, Image.NEAREST)
    return img

For that, you'll need to have pillow installed:

pip install pillow

Let's download and resize this image:

https://upload.wikimedia.org/wikipedia/commons/thumb/d/df/Smaug_par_David_Demaret.jpg/1280px-Smaug_par_David_Demaret.jpg

Based on the previous homework, what should be the target size for the image?

Question 3

Now we need to turn the image into numpy array and pre-process it.

Tip: Check the previous homework. What was the pre-processing we did there?

After the pre-processing, what's the value in the first pixel, the R channel?

0.3353411
0.5529412
0.7458824
0.9654902

Question 4

Now let's apply this model to this image. What's the output of the model?

0.17049132
0.39009996
0.60146114
0.82448614

Prepare the lambda code

Now you need to copy all the code into a separate python file. You will need to use this file for the next two questions.

Tip: you can test this file locally with ipython or Jupyter Notebook by importing the file and invoking the function from this file.

Docker

For the next two questions, we'll use a Docker image that we already prepared. This is the Dockerfile that we used for creating the image:

FROM public.ecr.aws/lambda/python:3.9
COPY dino-vs-dragon-v2.tflite .

And pushed it to svizor42/zoomcamp-dino-dragon-lambda:v2.

A few notes:

The image already contains a model and it's not the same model as the one we used for questions 1-4.
The version of Python is 3.9, so you need to use the right wheel for TF-Lite. For Tensorflow 2.7.0, it's https://github.com/alexeygrigorev/tflite-aws-lambda/raw/main/tflite/tflite_runtime-2.7.0-cp39-cp39-linux_x86_64.whl

Question 5

Download the base image svizor42/zoomcamp-dino-dragon-lambda:v2. You can easily make it by using docker pull command.

So what's the size of this base image?

139 Mb
329 Mb
639 Mb
929 Mb

You can get this information when running docker images - it'll be in the "SIZE" column.

Question 6

Now let's extend this docker image, install all the required libraries and add the code for lambda.

You don't need to include the model in the image. It's already included. The name of the file with the model is dino-vs-dragon-v2.tflite and it's in the current workdir in the image (see the Dockerfile above for the reference).

Now run the container locally.

Score this image: https://upload.wikimedia.org/wikipedia/en/e/e9/GodzillaEncounterModel.jpg

What's the output from the model?

0.12
0.32
0.52
0.72

Publishing it to AWS

Now you can deploy your model to AWS!

Publish your image to ECR
Create a lambda function in AWS, use the ECR image
Give it more RAM and increase the timeout
Test it
Expose the lambda function using API Gateway

This is optional and not graded.

Publishing to Docker hub

This is just for reference, this is how we published our image to Docker hub:

docker build -t zoomcamp-dino-dragon-lambda .
docker tag zoomcamp-dino-dragon-lambda:latest svizor42/zoomcamp-dino-dragon-lambda:v2
docker push svizor42/zoomcamp-dino-dragon-lambda:v2

Submit the results

Submit your results here: https://forms.gle/Pnx563ELg9jgjxHX6
You can submit your solution multiple times. In this case, only the last submission will be used
If your answer doesn't match options exactly, select the closest one

Deadline

The deadline for submitting is 28 November 2022 (Monday), 23:00 CEST (Berlin time).

After that, the form will be closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

homework.md

homework.md

Homework

Question 1

Question 2

Preparing the image

Question 3

Question 4

Prepare the lambda code

Docker

Question 5

Question 6

Publishing it to AWS

Publishing to Docker hub

Submit the results

Deadline

Files

homework.md

Latest commit

History

homework.md

File metadata and controls

Homework

Question 1

Question 2

Preparing the image

Question 3

Question 4

Prepare the lambda code

Docker

Question 5

Question 6

Publishing it to AWS

Publishing to Docker hub

Submit the results

Deadline