# Machine Learning Operations

### Framework for Building Machine Learning Models

- The generic MLOps workflow brings together Data Engineering, DevOps and Machine Learning
- It is generally composed of the MLOps pipeline and drivers.

#### *MLOps Pipeline*

- The MLOps pipeline performs operations including building, deploying and monitoring models.
- All models trained, deployed, and
monitored using the MLOps method are end-to-end traceable and their lineage is logged in
order to trace the origins of the model, which includes the source code the model used to
train, the data used to train and test the model, and parameters used to converge the model.

#### *Drivers*

- The key drivers for the MLOps pipeline include data, code, artifacts, middleware and infrastructure.

**Data**
- To manage data in ML applications, data is handled in these steps: data acquisition, data annotation, data cataloging, data preparation, data quality checking, data sampling, and data augmentation.

**Code**
- There are three essential modules of code that drive the MLOps pipeline:
training code, testing code, and application code. 
- These scripts or code are executed using the CI/CD and data pipelines to ensure the robust working of the MLOps pipeline.

**Artifacts**
- The MLOps pipeline generates artifacts such as data, serialized models,
code snippets, system logs, ML model training, and testing metrics information. 
- All these artifacts are useful for the successful working of the MLOps pipeline, ensuring its traceability and sustainability. 
- These artifacts are managed using middleware services such as the model registry, workspaces, logging services, source code management services, databases, and so on.

**Middleware**
- Middleware refers to computer software that provides services to software applications that are more than those available from the OS.
- Middleware services ensure multiple applications to automate and orchestrate
processes for the MLOps pipeline.

**Infrastructure**
- Infrastructure essentially reers to storage and computing resources to ensure the successful working of the MLOps pipeline.
- When it comes to the infrastructure, there are various options such as on-premises resources or infrastructure as a service (IaaS), which is cloud
services.

- A fully automated MLOps workflow can be achieved through the ptimization and synergy of the drivers with the MLOps pipeline.
- An advantage of having an automated MLOps workflow is the increase in the efficiency of the IT team by reducing the time spent working on repeatable tasks.