Project Title: Operationalizing Machine Learning

Project Overview:

In this project, we use the Bank Marketing dataset.
The goal of this project is to use Azure to configure a cloud-based ML production model, deploy it, and consume it.
We create, publish, and consume a pipeline that uses the AutoML step.
Finally, I demonstrate my work by creating a README file and a screencast video.

Architectural Diagram

Key Steps

Step 1: Authentication

I skipped this step since I used Udacity Lab and thus could not create a Service Principal

Step 2: Auto ML Experiment

Upload data (csv file) to default datastore in Azure ML studio, and register a dataset
Create a compute instance to be used by the jupyter notebook later
Create a compute cluster to be used by the AutoML experiment with min nodes as 1, and max nodes as 5
Create an AutoML experiment with the above dataset and compute cluster

Step 3: Deploy Best Model

Once AutoML experiment completes, deploy the best model on ACI with authentication enabled

Step 4: Enable Logging

Create a python virtual/conda environment with libraries required to run the scripts. Activate it.
Download config.json from Azure ML portal and store in same folder as logs.py
In logs.py, insert the deployment name. From terminal, run "python logs.py" to enable logging

Step 5: Swagger Documentation

Copy the swagger.json URL from the deployment, use wget to download file inside 'swagger' folder
From /swagger directory, run "bash swagger.sh" to launch docker in localhost port 10000. You can view swagger UI in localhost:10000.
Run "python serve.py", and insert "http://localhost:8000/swagger.json" in the search bar in Swagger UI. This will show the /score endpoint

Step 6: Consume Model Endpoint

In endpoint.py, insert the scoring_uri and key from the deployment, and run "python endpoint.py"
Benchmark endpoint: Check if Apache Benchmark is installed. In benchmark.sh, insert appropriate key and scoring_uri and run "bash benchmark.sh"

Step 7: Create, Publish, Consume Pipeline

Run the cells in the aml-pipelines-with-automated-machine-learning-step.ipynb notebook after inserting the right cluster name, experiment name, dataset name. This would create and publish pipeline.

How to improve the project in future

The Banking dataset could get more prepared for better results (e.g. feature engineering, if imbalanced classes balancing them, gathering more data...)
The training time and performance of the underlying compute cluster of the auto ml function could be increased to get better results from AutoML
Deep learning could be used (if GPU is activated on the cluster) to achieve possibly even better results

Screen Recording

Click here for Screencast

Standout Suggestions

Performed the Model Endpoint's benchmarking using Apache Benchmark (though it was optional)
Created an environment.yml file with all the necessary packages required for the project to run. This is good practice as it makes it easier to run the project again on another device. When creating a conda environment, you can use the yaml file to install all the packages.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
screenshots		screenshots
swagger		swagger
.gitignore		.gitignore
README.md		README.md
aml-pipelines-with-automated-machine-learning-step.ipynb		aml-pipelines-with-automated-machine-learning-step.ipynb
bankmarketing_train.csv		bankmarketing_train.csv
benchmark.sh		benchmark.sh
config.json		config.json
data.json		data.json
endpoint.py		endpoint.py
environment.yml		environment.yml
logs.py		logs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Title: Operationalizing Machine Learning

Project Overview:

Architectural Diagram

Key Steps

How to improve the project in future

Screen Recording

Standout Suggestions

About

Releases

Packages

Contributors 4

Languages

RohitRanga12/ML_Operations_Azure

Folders and files

Latest commit

History

Repository files navigation

Project Title: Operationalizing Machine Learning

Project Overview:

Architectural Diagram

Key Steps

How to improve the project in future

Screen Recording

Standout Suggestions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages