Skip to content

Using machine learning to predict/classify the quality of a given wine.

License

Notifications You must be signed in to change notification settings

noah-kg/predicting-wine-quality

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Predicting Wine Quality Using Multiclass Classification Algorithms

In this project, I am to tackle the wine data set from the UCI Machine Learning Repository. Though there are two sets of data (one for red wines and the other for white wines), this project will only focus on the red wine data set (for now). The dataset contains about 1,600 wines of varying qualities ranging from 0-10 (though they all fall within the 3-8 range). I aim to train and test multiple models to see which one can most accurately predict the wine's quality score. This data set is wildly imbalanced when it comes to quality, and that is a big hurdle I tackle in this project through the use of SMOTE.

Project Goals:

  • Exploratory Data Analysis
  • Data Visualization of different compounds in wine
  • Create pipeline structure to easily train/test multiple models
  • Train/Test multiple models both before and after balancing out the data with SMOTE
  • Use GridSearchCV to assist in hyperparameter tuning of the top performing models

To view the notebook, click on the .ipynb file above, or view it here.