Skip to content

Implemented random forest machine learning algorithm using pyspark on AWS EMR to classify the wines. The model is then deployed in docker container.

Notifications You must be signed in to change notification settings

Chan2k20/Wine-Prediction-Prediction-Model-On-AWS-EMR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project Title: Wine Quality Prediction on AWS EMR

Description/Problem Statement:Developed a wine quality prediction machine learning (ML) model using Spark on AWS Elastic MapReduce (EMR). The project involved training the model in parallel on multiple EC2 instances and subsequently deploying it to predict wine quality in an application running on a single EC2 instance.

Skills Utilized:

  • AWS (EMR, EC2)
  • Spark
  • Machine Learning
  • Model Deployment

Solution:

  • Utilized AWS EMR to create a Spark cluster for distributed data processing.
  • Employed Spark to build a wine quality prediction ML model, leveraging its parallel computing capabilities across multiple EC2 instances for efficient model training.
  • Trained the ML model on a dataset of wine features and quality ratings.
  • Implemented model saving and loading functionalities to ensure seamless integration into the prediction application.
  • Developed an application running on a single EC2 instance to perform wine quality prediction using the trained model.

Program Output: The application provided predictions for wine quality based on input features, allowing users to assess the quality of wine samples.

About

Implemented random forest machine learning algorithm using pyspark on AWS EMR to classify the wines. The model is then deployed in docker container.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published