Model creation must be
- scalable
- collaborative
- reproduceble
Set of rules that ensure a high quality of data to train models
- Versioning
- Model Tracking
- Feature Generation
-
ML (Experiment)
- Data Acquisition
- Business Undertanding
- Initial Modeling
-
Develop
- Modeling + Testing
- Continuous Integration
- Continuous Deployment
-
Operate
- Continuous Delivery
- Data Feedback Loop
- System + Model Monitoring
- Continuous Training
- Business Understanding
- Use Case Identification
- Data Understanding
- Feasibility Study
- Data Preparation
- Learning Algorithms
- Model Building/Training
- Model Experimentation
- Model Evaluation
- Model Serving
- Deploy
- Automate
- Operate
- Monitor
- Optimize
MLOps Parts CRISP-ML
Part | Objective | Software |
---|---|---|
Feature Store | ||
Data Versioning | ||
Metadata Store | ||
Model Versioning | ||
Model Registration | ||
Model Serving | ||
Model Monitoring | ||
Recycling of models | ||
CI/CD |
Stage | Definition |
---|---|
Stage 1 | Model and Data Version Control |
Stage 2 | AutoML + Model and Data Version Control |
Stage 3 | AutoML + Model and Data Version Control + Model Serving |
Stage 4 | AutoML + Model and Data Version Control + Model Serving + Monitoring, Governance and Retraining (Fig 2) |
- Libraries
- Jupyter Notebook
- Docker
Jupyter Notebook environment (Conda)
$ conda install -n [env] ipykernel
$ python -m ipykernel install --user --name [env] --display-name "Python (mlops)"