Open Source MLOps

This is the Fuzzy Labs guide to the universe of free and open source MLOps tools.

What is MLOps anyway?

MLOps (machine learning operations) is a discipline that helps people to train, deploy and run machine learning models successfully in production environments. Because this is a new and rapidly-evolving field, there are a lot of tools out there, and new ones appear all the time. If we've missed any, then please do raise a pull request!

Data version control

Just like code, data grows and evolves over time. Data versioning tools help you to keep track of these changes.

You might wonder why you can't just store data in Git (or equivalent). There are a few reasons this doesn't work, but the main one is size: Git is designed for small text files, and typical datasets used in machine learning are just too big. Some tools, like DVC, store the data externally, but also integrate with Git so that data versions can be linked to code versions.

DVC - one of the most popular general-purpose data versioning tools.
Delta Lake - data versioning for data warehouses.
LakeFS - Transform your object storage into a Git-like repository.
Git LFS - while this doesn't specialise in machine learning use-cases, it's another popular way to version datasets.

Experiment tracking

Machine learning involves a lot of experimentation. We end up training a lot of models, most of which are never intended to go into production, but represent progressive steps towards having something production-worthy. Experiment tracking tools are there to help us keep track of each experiment. What exactly do we need to track? typically this includes the code version, data version, input parameters, training performance metrics, as well as the final model assets.

Model training

Feature stores

Feast

Model deployment and serving

Model serving is the process of taking a trained model and presenting it behind a REST API, and this enables other software components to interact with a model. To make deployment of these model servers as simple as possible, it's commonplace to run them inside Docker containers and deploy them to a container orchestration system such as Kubernetes.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Open Source MLOps

Contents

What is MLOps anyway?

Data version control

Experiment tracking

Model training

Feature stores

Model deployment and serving

Model monitoring

Full stacks

More resources

About

Releases

Packages

License

elenasamuylova/awesome-open-mlops

Folders and files

Latest commit

History

Repository files navigation

Open Source MLOps

Contents

What is MLOps anyway?

Data version control

Experiment tracking

Model training

Feature stores

Model deployment and serving

Model monitoring

Full stacks

More resources

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages