Comparison of tools for ML experiment tracking & model management for companies
Free tools:
- MLflow
- DVC
- Pachyderm (Community)
- Sacred
- Polyaxon (Community)
- Allegro Trains
- VertaAI ModelDB
- Kubeflow Katib
- Guild AI
- Kubeflow Metadata
Paid tools:
- MLflow (Managed)
- Pachyderm (Enterprise / Hub)
- Polyaxon (Enterprise / Cloud)
- Weights & Biases (Enterprise)
- Verta (Enterprise, SaaS)
- Neptune.ai (Enterprise)
- Valohai (Enterprise, Pro)
- Comet (Enterprise)
Notes:
- Editions limited to a single user, which might be useful for Kaggle competitions and academic research, are excluded.
?
means unclear/uncertain.- Tools are ordered by the number of stars in the GitHub repository.
Tool | Monthly Fee | On-prem supported | # Stars on GitHub repository (100s) | URL |
---|---|---|---|---|
MLflow | Free | Yes | 8000 | https://github.com/mlflow/mlflow |
DVC | Free | Yes | 6900 | https://github.com/iterative/dvc |
Pachyderm (Community) | Free | Yes | 4800 | https://github.com/pachyderm/pachyderm |
Sacred | Free | Yes | 3200 | https://github.com/IDSIA/sacred |
Polyaxon (Community) | Free | Yes | 2600 | https://github.com/polyaxon/polyaxon |
Allegro Trains | Free | Yes | 2000 | https://github.com/allegroai/trains |
VertaAI ModelDB | Free | Yes | 1100 | https://github.com/VertaAI/modeldb |
Kubeflow Katib | Free | Yes | 900 | https://github.com/kubeflow/katib |
Guild AI | Free | Yes | 500 | https://github.com/guildai/guildai |
Kubeflow Metadata | Free | Yes | 100 | https://github.com/kubeflow/metadata |
Tool | Backend Metadata Database | Backend Artifact Storage | K8s | Server Source Code | Client Source Code |
---|---|---|---|---|---|
MLflow | SQL DB (PostgreSQL / MySQL / MS SQL / SQLite) | Object (S3, etc), File System, HDFS, FTP, SFTP | Optional | Open (JavaScript) | Open (Python) |
DVC | Git Repository | Object (S3, etc), File System | Optional | No server | Open (Python) |
Pachyderm (Community) | etcd | Object (S3) | Required | Open (Go) | Open (Python) |
Sacred | MongoDB | MongoDB | Optional | Open (JavaScript) | Open (Python) |
Polyaxon (Community) | PostgreSQL | Object (S3, etc), File System | Required | Open (Python, Java, Go) | Open (Python) |
Allegro Trains | Elasticsearch & MongoDB | Object (S3, etc), File System | Optional | Open (Python) | Open (Python) |
VertaAI ModelDB | PostgreSQL | Object (S3, etc), File System | Optional | Open (Java) | Open (Python) |
Kubeflow Katib | MySQL | - | Required | Open (Go, JavaScript) | Open (Python) |
Guild AI | File System | File System | Optional | No server | Open (Python) |
Kubeflow Metadata | MySQL / SQLite | Object (GCS), File System | Optional | Open (JavaScript, TypeScript, Go, C++) | Open (Python) |
Tool | Experiment Tracking | Artifact Repository | Model Registry | Integrated auto-logging | Hyper-parameter Tuning | Distributed Hyper-parameter Tuning |
---|---|---|---|---|---|---|
MLflow | Yes | Yes | Yes | Yes | No | No |
DVC | No | Yes | No | No | No | No |
Pachyderm (Community) | No | Yes | No | No | No | No |
Sacred | Yes | Yes | No | No | No | No |
Polyaxon (Community) | Yes | Yes | No | No | Yes | No |
Allegro Trains | Yes | Yes | No | Limited | Yes | No |
VertaAI ModelDB | Yes | Yes | Yes | No | No | No |
Kubeflow Katib | No | No | No | No | Yes | Yes |
Guild AI | Yes | No | No | No | Yes | No |
Kubeflow Metadata | Yes | Yes | No | No | No | No |
Integrated auto-logging
- MLflow: PyTorch Lightning, Tensorflow/Keras, Gluon, Scikit-learn, LightGBM, XGboost, Spark 3+
- Allegro Trains: PyTorch Lightning, PyTorch Ignite, Keras Tuner, AutoKeras
Tool | Web UI | REST API | CLI | Python API | Bypass server by CLI / Python API | Native Access Control |
---|---|---|---|---|---|---|
MLflow | Yes | HTTP | Yes | Yes | Yes | No |
DVC | No (GitLab / GitHub only) | No server | Yes | Yes | Yes | No |
Pachyderm (Community) | No | ProtoBuf | Yes | Yes | No? | No |
Sacred | Yes (Omniboard) | No | Yes | Yes | No? | No |
Polyaxon (Community) | Yes | HTTP | Yes | Yes | No? | No |
Allegro Trains | Yes | No? | No? | Yes | No? | No |
VertaAI ModelDB | Yes | No | No? | Yes | No | No |
Kubeflow Katib | Yes (Kubeflow UI) | gRPC | Yes | Yes | No | No |
Guild AI | Limited (TensorBoard only) | No server | Yes | ? | Yes | No |
Kubeflow Metadata | Limited (Kubeflow UI) | gRPC | Yes | Yes | No | No |
Tool | Monthly Fee | On-prem supported | # Stars on GitHub repository (100s) | URL |
---|---|---|---|---|
MLflow (Managed) | Paid | No | 8000 | https://databricks.com/product/managed-mlflow |
Pachyderm (Enterprise / Hub) | Paid | Yes (Enterprise) | 4800 | https://www.pachyderm.com/platform/#enterprise |
Polyaxon (Enterprise / Cloud) | Paid | Yes (Enterprise) | 2600 | https://polyaxon.com/polyaxon-ee/ |
Weights & Biases (Enterprise) | Paid | Yes | 2500 | https://docs.wandb.com/self-hosted |
Verta (Enterprise, SaaS) | Paid | Yes (Enterprise) | 1100 | https://www.verta.ai/pricing |
Neptune.ai (Enterprise) | Paid | Yes | 100 | https://github.com/neptune-ai/neptune-client |
Valohai (Enterprise, Pro) | Paid | Yes (Enterprise) | 0 | https://github.com/valohai/valohai-cli |
Comet (Enterprise) | Paid | - | https://www.comet.ml/site/pricing/ |
Tool | Backend Metadata Database | Backend Artifact Storage | K8s | Server Source Code | Client Source Code |
---|---|---|---|---|---|
MLflow (Managed) | SQL DB (PostgreSQL / MySQL / MS SQL / SQLite) | Object (S3, etc), File System, HDFS, FTP, SFTP | Optional | Not Open | Open (Python) |
Pachyderm (Enterprise / Hub) | etcd | Object (S3) | Required | Not Open | Not Open |
Polyaxon (Enterprise / Cloud) | PostgreSQL | Object (S3, etc), File System | Required | Not Open | No |
Weights & Biases (Enterprise) | MySQL | Object (S3, etc.) | ? | Not Open | Open (Python) |
Verta (Enterprise, SaaS) | PostgreSQL | Object (S3, etc), File System | Optional | Not Open | Open |
Neptune.ai (Enterprise) | ? | ? | ? | Not Open | Open (Python) |
Valohai (Enterprise, Pro) | ? | ? | ? | Not Open | Open (Python) |
Comet (Enterprise) | ? | ? | Optional | Not Open | No |
Tool | Experiment Tracking | Artifact Repository | Model Registry | Integrated auto-logging | Hyper-parameter Tuning | Distributed Hyper-parameter Tuning |
---|---|---|---|---|---|---|
MLflow (Managed) | Yes | Yes | Yes | Yes | No | No |
Pachyderm (Enterprise / Hub) | No | Yes | No | No | No | No |
Polyaxon (Enterprise / Cloud) | Yes | Yes | Yes? | No | Yes | Yes |
Weights & Biases (Enterprise) | Yes | Yes | No | Yes | Yes | Yes |
Verta (Enterprise, SaaS) | Yes | Yes | Yes | No | No | No |
Neptune.ai (Enterprise) | Yes | Yes | No? | Yes | No | No |
Valohai (Enterprise, Pro) | Yes | Yes | Yes? | No? | No? | No? |
Comet (Enterprise) | Yes | Yes | Yes | Limited | Yes | No? |
Integrated auto-logging
- MLflow: PyTorch Lightning, Tensorflow/Keras, Gluon, Scikit-learn, LightGBM, XGboost, Spark 3+
- Weights & Biases: PyTorch Lightning, PyTorch Ignite, Fast.ai, Skorch, Tensorflow, Keras, Scikit-learn, LightGBM, XGBoost, Ray Tune, Catalyst, Kubeflow Arena, Kubeflow Pipelines
- Neptune.ai: PyTorch Lightning, PyTorch Ignite, Fast.ai, Catalyst, LightGBM, XGBoost, Optuna, Skopt
- Comet: PyTorch Lightning
Tool | Web UI | REST API | CLI | Python API | Bypass server by CLI / Python API | Native Access Control |
---|---|---|---|---|---|---|
MLflow (Managed) | Yes | HTTP | Yes | Yes | No? | Yes |
Pachyderm (Enterprise / Hub) | Yes | ProtoBuf | Yes | Yes | No? | Yes |
Polyaxon (Enterprise / Cloud) | Yes | HTTP | Yes | Yes | No | Yes |
Weights & Biases (Enterprise) | Yes | No? | Yes | Yes | No? | ? |
Verta (Enterprise, SaaS) | Yes | No | No? | Yes | No | No |
Neptune.ai (Enterprise) | Yes | No? | ? | Yes | No? | ? |
Valohai (Enterprise, Pro) | Yes | No? | No? | Yes | No? | ? |
Comet (Enterprise) | Yes | HTTP | Yes | Yes | No? | Yes |