Skip to content

Comparison of ML Life Cycle Management (Experiment Tracking, Model Management, etc.): MLflow, DVC, Pachyderm, Sacred, Polyaxon, Allegro Trains, VertaAI ModelDB, Kubeflow Katib, Guild AI, Kubeflow Metadata, Weights & Biases, Neptune.ai, Valohai, Comet

Minyus/Tools_for_ML_Lifecycle_Management

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 

Repository files navigation

Tools_for_ML_Lifecycle_Management

Comparison of tools for ML experiment tracking & model management for companies

Free tools:

  • MLflow
  • DVC
  • Pachyderm (Community)
  • Sacred
  • Polyaxon (Community)
  • Allegro Trains
  • VertaAI ModelDB
  • Kubeflow Katib
  • Guild AI
  • Kubeflow Metadata

Paid tools:

  • MLflow (Managed)
  • Pachyderm (Enterprise / Hub)
  • Polyaxon (Enterprise / Cloud)
  • Weights & Biases (Enterprise)
  • Verta (Enterprise, SaaS)
  • Neptune.ai (Enterprise)
  • Valohai (Enterprise, Pro)
  • Comet (Enterprise)

Notes:

  • Editions limited to a single user, which might be useful for Kaggle competitions and academic research, are excluded.
  • ? means unclear/uncertain.
  • Tools are ordered by the number of stars in the GitHub repository.

 

Free tools

Tool Monthly Fee On-prem supported # Stars on GitHub repository (100s) URL
MLflow Free Yes 8000 https://github.com/mlflow/mlflow
DVC Free Yes 6900 https://github.com/iterative/dvc
Pachyderm (Community) Free Yes 4800 https://github.com/pachyderm/pachyderm
Sacred Free Yes 3200 https://github.com/IDSIA/sacred
Polyaxon (Community) Free Yes 2600 https://github.com/polyaxon/polyaxon
Allegro Trains Free Yes 2000 https://github.com/allegroai/trains
VertaAI ModelDB Free Yes 1100 https://github.com/VertaAI/modeldb
Kubeflow Katib Free Yes 900 https://github.com/kubeflow/katib
Guild AI Free Yes 500 https://github.com/guildai/guildai
Kubeflow Metadata Free Yes 100 https://github.com/kubeflow/metadata
Tool Backend Metadata Database Backend Artifact Storage K8s Server Source Code Client Source Code
MLflow SQL DB (PostgreSQL / MySQL / MS SQL / SQLite) Object (S3, etc), File System, HDFS, FTP, SFTP Optional Open (JavaScript) Open (Python)
DVC Git Repository Object (S3, etc), File System Optional No server Open (Python)
Pachyderm (Community) etcd Object (S3) Required Open (Go) Open (Python)
Sacred MongoDB MongoDB Optional Open (JavaScript) Open (Python)
Polyaxon (Community) PostgreSQL Object (S3, etc), File System Required Open (Python, Java, Go) Open (Python)
Allegro Trains Elasticsearch & MongoDB Object (S3, etc), File System Optional Open (Python) Open (Python)
VertaAI ModelDB PostgreSQL Object (S3, etc), File System Optional Open (Java) Open (Python)
Kubeflow Katib MySQL - Required Open (Go, JavaScript) Open (Python)
Guild AI File System File System Optional No server Open (Python)
Kubeflow Metadata MySQL / SQLite Object (GCS), File System Optional Open (JavaScript, TypeScript, Go, C++) Open (Python)
Tool Experiment Tracking Artifact Repository Model Registry Integrated auto-logging Hyper-parameter Tuning Distributed Hyper-parameter Tuning
MLflow Yes Yes Yes Yes No No
DVC No Yes No No No No
Pachyderm (Community) No Yes No No No No
Sacred Yes Yes No No No No
Polyaxon (Community) Yes Yes No No Yes No
Allegro Trains Yes Yes No Limited Yes No
VertaAI ModelDB Yes Yes Yes No No No
Kubeflow Katib No No No No Yes Yes
Guild AI Yes No No No Yes No
Kubeflow Metadata Yes Yes No No No No

Integrated auto-logging

  • MLflow: PyTorch Lightning, Tensorflow/Keras, Gluon, Scikit-learn, LightGBM, XGboost, Spark 3+
  • Allegro Trains: PyTorch Lightning, PyTorch Ignite, Keras Tuner, AutoKeras
Tool Web UI REST API CLI Python API Bypass server by CLI / Python API Native Access Control
MLflow Yes HTTP Yes Yes Yes No
DVC No (GitLab / GitHub only) No server Yes Yes Yes No
Pachyderm (Community) No ProtoBuf Yes Yes No? No
Sacred Yes (Omniboard) No Yes Yes No? No
Polyaxon (Community) Yes HTTP Yes Yes No? No
Allegro Trains Yes No? No? Yes No? No
VertaAI ModelDB Yes No No? Yes No No
Kubeflow Katib Yes (Kubeflow UI) gRPC Yes Yes No No
Guild AI Limited (TensorBoard only) No server Yes ? Yes No
Kubeflow Metadata Limited (Kubeflow UI) gRPC Yes Yes No No

 

Paid tools

Tool Monthly Fee On-prem supported # Stars on GitHub repository (100s) URL
MLflow (Managed) Paid No 8000 https://databricks.com/product/managed-mlflow
Pachyderm (Enterprise / Hub) Paid Yes (Enterprise) 4800 https://www.pachyderm.com/platform/#enterprise
Polyaxon (Enterprise / Cloud) Paid Yes (Enterprise) 2600 https://polyaxon.com/polyaxon-ee/
Weights & Biases (Enterprise) Paid Yes 2500 https://docs.wandb.com/self-hosted
Verta (Enterprise, SaaS) Paid Yes (Enterprise) 1100 https://www.verta.ai/pricing
Neptune.ai (Enterprise) Paid Yes 100 https://github.com/neptune-ai/neptune-client
Valohai (Enterprise, Pro) Paid Yes (Enterprise) 0 https://github.com/valohai/valohai-cli
Comet (Enterprise) Paid - https://www.comet.ml/site/pricing/
Tool Backend Metadata Database Backend Artifact Storage K8s Server Source Code Client Source Code
MLflow (Managed) SQL DB (PostgreSQL / MySQL / MS SQL / SQLite) Object (S3, etc), File System, HDFS, FTP, SFTP Optional Not Open Open (Python)
Pachyderm (Enterprise / Hub) etcd Object (S3) Required Not Open Not Open
Polyaxon (Enterprise / Cloud) PostgreSQL Object (S3, etc), File System Required Not Open No
Weights & Biases (Enterprise) MySQL Object (S3, etc.) ? Not Open Open (Python)
Verta (Enterprise, SaaS) PostgreSQL Object (S3, etc), File System Optional Not Open Open
Neptune.ai (Enterprise) ? ? ? Not Open Open (Python)
Valohai (Enterprise, Pro) ? ? ? Not Open Open (Python)
Comet (Enterprise) ? ? Optional Not Open No
Tool Experiment Tracking Artifact Repository Model Registry Integrated auto-logging Hyper-parameter Tuning Distributed Hyper-parameter Tuning
MLflow (Managed) Yes Yes Yes Yes No No
Pachyderm (Enterprise / Hub) No Yes No No No No
Polyaxon (Enterprise / Cloud) Yes Yes Yes? No Yes Yes
Weights & Biases (Enterprise) Yes Yes No Yes Yes Yes
Verta (Enterprise, SaaS) Yes Yes Yes No No No
Neptune.ai (Enterprise) Yes Yes No? Yes No No
Valohai (Enterprise, Pro) Yes Yes Yes? No? No? No?
Comet (Enterprise) Yes Yes Yes Limited Yes No?

Integrated auto-logging

  • MLflow: PyTorch Lightning, Tensorflow/Keras, Gluon, Scikit-learn, LightGBM, XGboost, Spark 3+
  • Weights & Biases: PyTorch Lightning, PyTorch Ignite, Fast.ai, Skorch, Tensorflow, Keras, Scikit-learn, LightGBM, XGBoost, Ray Tune, Catalyst, Kubeflow Arena, Kubeflow Pipelines
  • Neptune.ai: PyTorch Lightning, PyTorch Ignite, Fast.ai, Catalyst, LightGBM, XGBoost, Optuna, Skopt
  • Comet: PyTorch Lightning
Tool Web UI REST API CLI Python API Bypass server by CLI / Python API Native Access Control
MLflow (Managed) Yes HTTP Yes Yes No? Yes
Pachyderm (Enterprise / Hub) Yes ProtoBuf Yes Yes No? Yes
Polyaxon (Enterprise / Cloud) Yes HTTP Yes Yes No Yes
Weights & Biases (Enterprise) Yes No? Yes Yes No? ?
Verta (Enterprise, SaaS) Yes No No? Yes No No
Neptune.ai (Enterprise) Yes No? ? Yes No? ?
Valohai (Enterprise, Pro) Yes No? No? Yes No? ?
Comet (Enterprise) Yes HTTP Yes Yes No? Yes

 

MLflow

MLflow Architecture

mlflow_architecture

MLflow Experiment Tracking

mlflow_experiment_tracking

MLflow Model Registry

mlflow_model_registry

Polyaxon Architecture

polyaxon_architecture

Allegro Trains Architecture

trains_architecture

VertaAI ModelDB Architecture

modeldb_architecture

Google ML Metadata (Core of Kubeflow Metadata) Architecture

ml_metadata_architecture

About

Comparison of ML Life Cycle Management (Experiment Tracking, Model Management, etc.): MLflow, DVC, Pachyderm, Sacred, Polyaxon, Allegro Trains, VertaAI ModelDB, Kubeflow Katib, Guild AI, Kubeflow Metadata, Weights & Biases, Neptune.ai, Valohai, Comet

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published