Skip to content

This project is intended to provide an example of MLOps architecture.

License

Notifications You must be signed in to change notification settings

esalvucci/thesis-mlops-reference-architecture

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Thesis MLOps reference architecture

License: GPL v3 GitHub issues

About

This project is intended to provide an example of MLOps architecture. It uses the code of a Kaggle Notebook as use case example. The original code has been edited such to adapt it for the example in this project.

You can find the documentation about how each technology is used in the doc folder

In this project is used preferably Free Software (except for Google Cloud Build and Google Cloud Functions).

Technologies

Use the following links to read the detailed documentation about how each technology is used in this project.

  • MLFlow - Tracks the experiments log, the model versions and to store them in a Model Registry
  • Kubeflow - Orchestrates the ML workflow
  • BentoML - Used as serving framework
  • Google Cloud Platform

Advantages of using MLOps

As MLOps can really improve your ML lifecycle not all the possible benefits are met in this project and highlighted here.

This project shows the following advantages and challenges you can cope by using MLOps. Each item of the list is followed by the name (or the logo) of the technologies that address that challenge.

  • Kubeflow Approach to ML as a process instead of only a product
  • Kubeflow Reproduce the whole pipeline
  • Kubeflow MLFlow Reproduce the model building
  • Kubeflow Automate the whole workflow
  • Kubeflow Google Cloud Functions Google Cloud Build Auto retrain
  • Kubeflow Validate the model and the data (as steps of the pipeline)
  • Kubeflow Google Cloud Functions Data Drift
  • MLFlow Increase collaboration between teams
  • MLFlow Track the parameters used for the model training, the metrics and the model itself
  • MLFlow Version your model
  • CI/CD + CT
    • Kubeflow Google Cloud Build Continuous Integration
    • Kubeflow Google Cloud Functions Continuous Training

Architecture

Project Architecture

Licence

This project is licensed under the GPLv3 Licence - see the LICENSE file for details. Any comment, feedback or suggestion will be appreciated