machine learning pipeline: the challenges and verification with blockchain

For a typical machine learning development process, it will go through two stages: training and production stages. During the two stages, tangled with various platforms, it will introduce room for errors. Such as mismatch config files, trained model object or test data corruption. This can also happen during file transmission. This code is to demonstrate the typical model train --> production framework and where it can go wrong. Most importantly, how to fix this problem with a block chain process. I have build a block chain class to track the process and a demo included.

A diagram for typical machine learning model development and deployment:

Training data process--

Training data -> training --> persisted model ~~]]]
##Production
Streaming Input -------------------------------->Prediction --> Results.

##Block chain transaction process (like a linked list):
Stage 1: Configuration --> raw data Blockchain [0|signature of config] -->[signature of config | sigature of raw data]
Stage 2: Configuration --> raw data --> clean data --> persisted model
Stage 3: Configuration --> persisted model --> *****

Tutorial

first run main.py uncomment line 23-29 to generate golden model outputs
reverse the comment/uncomment in step 1, rerun it and the blockchain verification will be valid. You'll get a produciton results.
uncomment line 24 and comment 23 to generate random data to demostrate the issue. When code changes, all the pipeline messed up.
Check the ./logs/main.log for verification using blockchain class. You'll see the error message since the data is random during training process.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
ML_in_Tandem		ML_in_Tandem
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

machine learning pipeline: the challenges and verification with blockchain

Training data process--

Tutorial

About

Uh oh!

Releases

Packages

Languages

eulertech/machine_learning_blockchain_verification_framework

Folders and files

Latest commit

History

Repository files navigation

machine learning pipeline: the challenges and verification with blockchain

Training data process--

Tutorial

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages