Skip to content

hoangdesu/Spark-MongoDB-MLflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Data Pipeline using Spark, MongoDB and MLflow

Table of Contents

About

This pipeline can be found in the notebook hoang.ipynb, consisting of the following tasks:

  • Task 1: MongoDB
  • Task 2: Data ingestion and data cleaning/transformation
  • Task 3: Model training and tracking with data pipeline and MLflow
  • Task 4: Visualisation

Final visualiaztion on MongoDB charts:

MongoDB charts

Connect with me

If you find this project useful, you can let me know. I would love to hear about it! 🍣

Notes: the model training with MLflow is incomplete.

About

Data Pipeline using Spark, MongoDB and MLflow

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published