Agogosml is a data processing pipeline project that addresses the common need for operationalizing ML models. The project enables you to deploy models in production at scale and aspires to provide scoring and monitoring of models on the same infrastructure (coming soon).
- Re-usable/canonical data processing pipeline supporting multiple data streaming technologies (Kafka and Azure EventHub) and deployment to Kubernetes.
- CI/CD pipeline using Azure DevOps to deploy versioned and immutable pipeline.
- Blue/Green deployments, automatic role-backs or redeployment of a specific version.
Quick Install & Run
The following quick install instructions assumes you have the azure-cli, Python 3.7 (with C Compiler tools), Docker and Terraform installed.
# 1. Installing the CLI pip install agogosml_cli # 2. Create a directory for your project mkdir hello-agogosml cd hello-agogosml # 3. Init the project agogosml init # 4. Fill in the manifest.json (Docker Container Registry, Azure Subscription, etc). vi manifest.json # 5. Generate the code for the projects agogosml generate
The generated folder structure consists of the input reader, customer app and output writer as well as the Azure DevOps pipelines for CI/CD.
For more detailed information, see the User Guide
The agogosml package was developed to provide a Data Engineer with a simple configurable data pipeline consisting of three components: an input reader, app (that holds a trained ML model) and an output writer. The three components are instrumented using one Docker container per component.
The input reader acts as the data receiver and obtains the data required as input for the ML model. The package supports both Kafka and EventHub.
The output writer receives the scored data from the app and sends it onto a streaming client (a Kafka or Eventhub instance).
The app receives data from the input reader and feeds it to the ML model for scoring. Once scored the data is sent onto the output writer.
For more information about the design, see the Design Documentation
- User Guide - Getting Started
- Developer Guide
- Microsoft Open Source Code of Conduct
- Design and Architecture
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.
When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.