E2E NLP MLOps

Introduction

The goal of this project is to implement a semi-complex initial E2E project better described as E2E MLOps.
Most of the scripts presented here are taken from this fantastic 8-week course called MLOps-Basics but are heavily modified, especially on their README.md files.
This project will cover the full MLOps cycle. We'll implement an NLP model based on a pre-trained transformer achitecture which will be be fined-tuned, deployed and ultimately served.

How to use this 9-step tutorials

Each step build on top of the previous one.
Do not assume that, just because files have the same name, they would have the same content.

Step #0: Project Setup

How to get the data?
How to process the data?
How to define a DataModules in ⚡ Pytorch Lightning as opposed to the Vanilla Pytorch DataLoaders
How to build a model to fine tune a pre-trained transformer on a classification task?
How to train the model on both CPUs on your local machine or GPU on GoogleColab?
How to do the inference?

Step #1: Model monitoring - Weights and Biases

How to configure basic logging with W&B?
How to compute metrics and log them in W&B?
How to add plots in W&B?
How to add data samples to W&B?

Step #2: Configurations - Hydra

Basics of Hydra and how is this different than a simple YMAL file
Overridding configurations at run time
Splitting configuration across multiple files

Step #3: Data Version Control - DVC

Basics of DVC
How DVC is similar to Git
Initialising DVC
Configuring Remote Storage
Saving Model to the Remote Storage
Versioning the models

Step #4: Model Packaging - ONNX

Why do we need model packaging?
What is ONNX?
How to convert a trained model to ONNX format?`
What is ONNX Runtime?
How to run ONNX converted model in ONNX Runtime?

Step #5: Model Packaging - Docker

FastAPI wrapper
How to create an app with FastAPI
Basics of Docker
Building Docker Container
Docker Compose

Step #6: CI/CD - GitHub Actions

Basics of GitHub Actions
First GitHub Action
Creating Google Service Account
Giving access to Service account
Configuring DVC to use Google Service account
Configuring Github Action

Step #7: Container Registry - AWS ECR

Container registry
Basics of S3
Programmatic access to S3
Configuring AWS S3 as remote storage in DVC
Basics of ECR
Differences btw AWS S3 and AWS ECR
Configuring GitHub Actions to use S3, ECR

Step #8: Serverless Deployment - AWS Lambda

Basics of Serverless
Basics of AWS Lambda
Triggering Lambda with API Gateway
Deploying Container using Lambda
Automating deployment to Lambda using Github Actions

Step #9: Prediction Monitoring - Kibana

Refer to the Blog Post here

Monitoring systems can help give us confidence that our systems are running smoothly and, in the event of a system failure, can quickly provide appropriate context when diagnosing the root cause.

Things we want to monitor during and training and inference are different. During training we are concered about whether the loss is decreasing or not, whether the model is overfitting, etc.

But, during inference, We like to have confidence that our model is making correct predictions.

There are many reasons why a model can fail to make useful predictions:

The underlying data distribution has shifted over time and the model has gone stale. i.e inference data characteristics is different from the data characteristics used to train the model.
The inference data stream contains edge cases (not seen during model training). In this scenarios model might perform poorly or can lead to errors.
The model was misconfigured in its production deployment. (Configuration issues are common)

In all of these scenarios, the model could still make a successful prediction from a service perspective, but the predictions will likely not be useful. Monitoring machine learning models can help us detect such scenarios and intervene (e.g. trigger a model retraining/deployment pipeline).

In this week, I will be going through the following topics:

Basics of Cloudwatch Logs
Creating Elastic Search Cluster
Configuring Cloudwatch Logs with Elastic Search
Creating Index Patterns in Kibana
Creating Kibana Visualisations
Creating Kibana Dashboard

References

Main GitHub project page

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
project		project
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

E2E NLP MLOps

Introduction

How to use this 9-step tutorials

Step #0: Project Setup

Step #1: Model monitoring - Weights and Biases

Step #2: Configurations - Hydra

Step #3: Data Version Control - DVC

Step #4: Model Packaging - ONNX

Step #5: Model Packaging - Docker

Step #6: CI/CD - GitHub Actions

Step #7: Container Registry - AWS ECR

Step #8: Serverless Deployment - AWS Lambda

Step #9: Prediction Monitoring - Kibana

References

About

Releases

Packages

Languages

kyaiooiayk/MLOPs-NLP-Project-Fine-Tuning-Transformer

Folders and files

Latest commit

History

Repository files navigation

E2E NLP MLOps

Introduction

How to use this 9-step tutorials

Step #9: Prediction Monitoring - Kibana

References

About

Resources

Stars

Watchers

Forks

Languages