Chapter Two

Deployment of Machine Learning Model on Heroku using Flask

Introduction

Flask is a lightweight web framework written in Python. Flask is easy to use, and to get started for beginners. It is classified as a microframework because it does not require particular tools or libraries to work. It has no database abstraction layer, form validation, or any other components where pre-existing third-party libraries provide common functions.
In this section of the tutorial, you will learn how to set up a Flask project and to deploy a Machine Learning model you have developed in the previous chapter. By the end of this section you will be able to deploy any model using Flask on Heroku.

All code files for this project are availabe here - https://github.com/ahmadkhan242/Transfer-Learning-Model-hosted-on-Heroku-using-React-Flask/tree/main/Webapp/Flask

Contents of this Section

Pre-Requisites for the section
Installing Libraries
Define File Structure
Create Prediction Pipeline
Final Flask File
Testing Application
Deploy Project on Heroku
Summary and Conclusions

Pre-Requisites for this section

To implement the complete project you will need the following:

Any operating system Linux, Windows, or Mac OS.
Python 3+ installed https://www.python.org/
Basic Python programming knowledge https://docs.python.org/3/tutorial/
Basic Git knowledge https://git-scm.com/

Step 1 - Installing Flask and related package

Before installing Flask, we will create separate python environment for this project. If you are using Anaconda, open Anaconda Prompt, or else open your Command Prompt in Windows. Linux users can open their terminal instead. Enter the following code to create an environment named as venv.

python3 -m venv env

To use this environment we need to activate it using following code.

source venv/bin/activate

Environment is activated now we can download all packages required for the project.
A few packages are requrired for this project like pytorch, numpy, pandas, transformers, and pickel.

pip install flask pytorch torchvision numpy pandas transformer pickel

Step 2 - File Structure

We need to have a file structure for best practice.

─── Flask
    ├── ml_model
    │    ├── modelFinal.pth ------- (Final saved model)
    │    ├── predict.py ----------- (Python script to create pipeline)
    │    └── vocab.pickel --------- (Contain English vocab used to create word sequence)
    └── app.py -------------------- (Main Flask File)

Step 3 - Creating prediction pipeline.

We define our pipeline script in predict.py file under ml_model folder.
Final pipeline is as follows - Data -> Pre-processing -> Model -> Prediction -> Final Result

Final code present here.

All Code to be written in ml_model/predict.py fie.

    # Importing Libraries
    import re
    import numpy as np
    import pandas as pd
    import torch
    import torch.nn as nn
    from torch.nn.utils.rnn import pack_padded_sequence, pad_packed_sequence
    import torch.nn.functional as F
    from transformers import RobertaTokenizer, RobertaModel, AdamW, get_linear_schedule_with_warmup


    import os
    cwd = os.getcwd() 
    
    import pickle
    b =  pickle.load(open(cwd + '\\ml_model\\vocab.pickle','rb'))
    vocab = b['vocab']

    # Defining pre-processing function
    def text_preprocess(text):
        text = str(text)
        FLAGS = re.MULTILINE | re.DOTALL
        eyes = r"[8:=;]"
        nose = r"['`\-]?"

        def re_sub(pattern, repl):
            return re.sub(pattern, repl, text, flags=FLAGS)
        text = re_sub(r"https?:\/\/\S+\b|www\.(\w+\.)+\S*", "<url>")
        text = re_sub(r"/"," / ")
        text = re_sub(r"@\w+", "<user>")
        text = re_sub(r"{}{}[)dD]+|[)dD]+{}{}".format(eyes, nose, nose, eyes), "<smile>")
        text = re_sub(r"{}{}p+".format(eyes, nose), "<lolface>")
        text = re_sub(r"{}{}\(+|\)+{}{}".format(eyes, nose, nose, eyes), "<sadface>")
        text = re_sub(r"{}{}[\/|l*]".format(eyes, nose), "<neutralface>")
        text = re_sub(r"<3","<heart>")
        text = re_sub(r"[-+]?[.\d]*[\d]+[:,.\d]*", "<number>")
        text = re_sub(r"([!?.]){2,}", r"\1 <repeat>")
        text = re_sub(r"\b(\S*?)(.)\2{2,}\b", r"\1\2 <elong>")
        return text
   
    # Creating RoBERTa Model 
    class ROBERTA(torch.nn.Module):
        def __init__(self, dropout_rate=0.3):
            super(ROBERTA, self).__init__()
            self.roberta = RobertaModel.from_pretrained('roberta-base')
            self.d1 = torch.nn.Dropout(dropout_rate)
            self.l1 = torch.nn.Linear(768, 64)
            self.bn1 = torch.nn.LayerNorm(64)
            self.d2 = torch.nn.Dropout(dropout_rate)
            self.l2 = torch.nn.Linear(64, 2)

        def forward(self, input_ids, attention_mask):
            _, x = self.roberta(input_ids=input_ids, attention_mask=attention_mask)
            x = self.d1(x)
            x = self.l1(x)
            x = self.bn1(x)
            x = torch.nn.Tanh()(x)
            x = self.d2(x)
            x = self.l2(x)
            return x

    # Initialising model and loading weights
    model = ROBERTA()
    state_dict = torch.load(cwd + '\\ml_model\\final_model.pth', map_location=torch.device('cpu'))
    model.load_state_dict(state_dict, strict=False)

    # Final function defining whole pipeline
    def pred(text):
        print("Text Received =>", text)
        text = text_preprocess(text)
        word_seq = np.array([vocab[word] for word in text.split() 
                          if word in vocab.keys()])
        word_seq = np.expand_dims(word_seq,axis=0)
        t = torch.from_numpy(word_seq).to(torch.int64)
        mask = (t != 1).type(torch.uint8)

        output = model(t, attention_mask=mask)
        print("Got output - ",output)
        pro = torch.argmax(output, axis=-1).tolist()[0]
        status = "positive" if pro == 1 else "negative"
        return status

Getting data.
- Data will be recieved in JSON format (we will disscus later about how to recieve data).
```
{
       "review": "Sample review"
 }
```

Pre-Processing
We pre-process the text by turning upper case alphabets to lower case, removing special characters, etc. We will use the function given below.

def text_preprocess(text):
    text = str(text)
    FLAGS = re.MULTILINE | re.DOTALL
    eyes = r"[8:=;]"
    nose = r"['`\-]?"

    def re_sub(pattern, repl):
        return re.sub(pattern, repl, text, flags=FLAGS)
    text = re_sub(r"https?:\/\/\S+\b|www\.(\w+\.)+\S*", "<url>")
    text = re_sub(r"/"," / ")
    text = re_sub(r"@\w+", "<user>")
    text = re_sub(r"{}{}[)dD]+|[)dD]+{}{}".format(eyes, nose, nose, eyes), "<smile>")
    text = re_sub(r"{}{}p+".format(eyes, nose), "<lolface>")
    text = re_sub(r"{}{}\(+|\)+{}{}".format(eyes, nose, nose, eyes), "<sadface>")
    text = re_sub(r"{}{}[\/|l*]".format(eyes, nose), "<neutralface>")
    text = re_sub(r"<3","<heart>")
    text = re_sub(r"[-+]?[.\d]*[\d]+[:,.\d]*", "<number>")
    text = re_sub(r"([!?.]){2,}", r"\1 <repeat>")
    text = re_sub(r"\b(\S*?)(.)\2{2,}\b", r"\1\2 <elong>")
    return text

Defining Machine Learning Model and Loading weights.
- For this problem we first define our Model architecture which is based on RoBERTa and then load pre-trained weights we saved in the previous chapter.
- Important Since we have saved the state file which stores only the parameters in dictoniary form not the complete model, so we need create the model again and load these values.

    class ROBERTA(torch.nn.Module):
        def __init__(self, dropout_rate=0.3):
            super(ROBERTA, self).__init__()
            self.roberta = RobertaModel.from_pretrained('roberta-base')
            self.d1 = torch.nn.Dropout(dropout_rate)
            self.l1 = torch.nn.Linear(768, 64)
            self.bn1 = torch.nn.LayerNorm(64)
            self.d2 = torch.nn.Dropout(dropout_rate)
            self.l2 = torch.nn.Linear(64, 2)

        def forward(self, input_ids, attention_mask):
            _, x = self.roberta(input_ids=input_ids, attention_mask=attention_mask)
            x = self.d1(x)
            x = self.l1(x)
            x = self.bn1(x)
            x = torch.nn.Tanh()(x)
            x = self.d2(x)
            x = self.l2(x)
            return x

After creating the RoBERTa model we load the weights we have saved in previous chapter using these line of code.
- Learn more about saving and loading Model in Pytorch here - https://pytorch.org/tutorials/beginner/saving_loading_models.html

    model = ROBERTA()
    state_dict = torch.load(cwd + '\\ml_model\\final_model.pth', map_location=torch.device('cpu'))
    model.load_state_dict(state_dict, strict=False)

Finally we wrap the whole pipeline in a single Function given below.

    def pred(text):
        text = text_preprocess(text)                              # Preprocessing
        word_seq = np.array([vocab[word] for word in text.split() # Creating Word Sequence
                          if word in vocab.keys()])
        word_seq = np.expand_dims(word_seq,axis=0)
        t = torch.from_numpy(word_seq).to(torch.int64)            # Converting Numpy to torch tensor(int64)
        mask = (t != 1).type(torch.uint8)
        output = model(t, attention_mask=mask)
        print("Got output - ",output)
        pro = torch.argmax(output, axis=-1).tolist()[0]
        status = "positive" if pro == 1 else "negative"
        return status

Step 4 - Final Flask script.

This final script is to be written in app.py file. This file will handel all HTTP requests we are going to use.
In this file we will import pred() function we created in pipeline section then import Flask to create an app instance.

    app = Flask(__name__)

Final code for app.py is file present here.

All this code to be written in app.py fie.

    # Importing pred function from ml_model/predict.py file.
    from ml_model.predict import pred

    import os
    from flask import Flask, render_template, request, make_response, jsonify, send_file
    
    # Create an app instance using Flask
    app = Flask(__name__)

    # Set up the main route
    @app.route('/predict', methods=['POST'])
    def home():
        print("Action Initiated")
        review = request.json['review']
        prediction = pred(review)
        print(prediction)
        return prediction

    if __name__ == '__main__':
        app.run()

We will use this instance to handle HTTP request. First we will define a route decorator @app.route() then pass API end name, here we use /predict. Since, we will be receiving our text data on this route, here we will pass method as POST.
Learn about basics of newtorking if you don't know Learn Here

    @app.route('/predict', methods=['POST'])

Now, we will define a fuction which will be executed when someone make POST request on /predict route. In this function we will take our POST route data and pass to the pred() function we imported from ml_model/predict.py file.

    def home():
        review = request.json['review']
        prediction = pred(review)
        print(prediction)
        return prediction

Step 4 - Testing final app.

To run the Flask open terminal in the project directory and type following command in terminal.

    flask run

Output

Now we can test our API on http://127.0.0.1:5000/ with /predict endpoint we have created in app.py file.
To test your final API you can use any API Client (Postman, Insomnia, etc).
We are using Postman as shown in screenshot, you can learn about it here - https://www.postman.com/

Deployment on Heroku.

We expect you have GitHub account and the knowledge of how to create repository. If not, learn here

Create a file naming Procfile in main directory. This file specify which command to run at app startup, for our app write this command
```
web: gunicorn --bind 0.0.0.0:$PORT main_app:app
```

IMPORTANT In your requrement.txt file remove torch and touchvision, put these url instead.
> https://download.pytorch.org/whl/cpu/torch-1.6.0%2Bcpu-cp37-cp37m-linux_x86_64.whl
> https://download.pytorch.org/whl/cpu/torchvision-0.7.0%2Bcpu-cp37-cp37m-linux_x86_64.whl

Explanation with Screenshot

requirements.txt file is used by the heroku server to download all packages. If we specify torch == 1.6.0, then it will download whole pytorch library. Since we are using free version of heroku, we have only CPU support not GPU. Pytorch library comes with all files required for GPU and CPU support, so we need to download only CPU specific files. As Heroku only provides 500 MB storage in free version and complete Pytorch library is more than 600 Mb, we switch to the CPU version as it takes only 160 MB of space.

Before Update	After Update

Now create a repository and push all code on github repository. If you don't know how to do that, learn it here
Create Heroku account https://signup.heroku.com/ and Create New App.

Detail Screenshot
Choose App name and region.

Detail Screenshot
Link your Github account with Heroku, search your repository where you have pushed all your code and connect.

Detail Screenshot
Choose branch, Enable automatic deploy so that it can automatically build your app when you push any changes to your repository and hit Deploy Branch.

Detail Screenshot
You can see App Build log. It will display any errors if occurs.

Detail Screenshot
Finally after successful build you can launch your app by clicking View
- But in this case we have not include any frontend we will not see anything, although we can test our API in the same way we tested on Local by just replacing Localhost with the new URL.
  
  http://127.0.0.1:5000/predict => http://NEW_URL/predict
Detail Screenshot

Summary and Conclusion

In this blog we have created a Flask app to deploy our RoBERTa model we have created in the previous chapter. You would have read setp by step process of creating flask app and how deploy it on Heroku. After this blog you can implement this blog for your project, you just need to change your pipeline based your machine learning model and them follow step by step tutorial written here.

Refrence

https://www.digitalocean.com/community/tutorials/how-to-make-a-web-application-using-flask-in-python-3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Chapter Two

Deployment of Machine Learning Model on Heroku using Flask

Introduction

Contents of this Section

Pre-Requisites for this section

Step 1 - Installing Flask and related package

Step 2 - File Structure

Step 3 - Creating prediction pipeline.

All Code to be written in ml_model/predict.py fie.

Step 4 - Final Flask script.

All this code to be written in app.py fie.

Step 4 - Testing final app.

Deployment on Heroku.

Summary and Conclusion

Refrence

Chapter Three

Building Frontend using React.

Files

README.md

Latest commit

History

README.md

File metadata and controls

Chapter Two

Deployment of Machine Learning Model on Heroku using Flask

Introduction

Contents of this Section

All Code to be written in ml_model/predict.py fie.

All this code to be written in app.py fie.

Refrence

Building Frontend using React.