Goals

Our goal is to create classification models using a collection of wine reviews on Kaggle, originally from Wine Enthusiast.

Lindsey will be creating an a classification model that attempts to discern whether a wine is white or red.

Harrison will be seeing if he can predict the score or price of the wine based on the length of the review, using a classification model after creating score and price categories.

Results

Lindsey iterated through many different types of classification models, to arrive at an XGBoosted model which was able to predict whether a wine was white or red with 80% accuracy on the test data.

Harrison found that review length as a predictor of point value was not much better than random chance, but review length was a much better predictor of a wine's price - but only for wines under $100! For wines above $100, the review length was not a good predictor of price.

Harrison also worked through a supplemental dataset on wine composition to see the effects of the random forest classifier in a different context.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
Depleted-Glass-Empty-Wine-Glass-Consumes-Bottle-3551160_sourcedfrom-maxpixel.jpg		Depleted-Glass-Empty-Wine-Glass-Consumes-Bottle-3551160_sourcedfrom-maxpixel.jpg
Harrison_Final.ipynb		Harrison_Final.ipynb
Harrison_Scratch.ipynb		Harrison_Scratch.ipynb
Lindsey_FinalModel.ipynb		Lindsey_FinalModel.ipynb
Lindsey_ModelSelection.ipynb		Lindsey_ModelSelection.ipynb
Mod5_CorrelatedVariableInteraction_pricevspoints.png		Mod5_CorrelatedVariableInteraction_pricevspoints.png
Mod5_DescriptionLengthVSPoints.png		Mod5_DescriptionLengthVSPoints.png
Mod5_DescriptionLengthVSPoints_simplified.png		Mod5_DescriptionLengthVSPoints_simplified.png
Mod5_DescriptionLengthVSPrice.png		Mod5_DescriptionLengthVSPrice.png
Mod5_DescriptionLengthVSPrice_cluster.png		Mod5_DescriptionLengthVSPrice_cluster.png
Mod5_ProjectPresentation.key		Mod5_ProjectPresentation.key
Mod5_VariablesByWineColor.png		Mod5_VariablesByWineColor.png
Mod5_XGBoostModel_results.png		Mod5_XGBoostModel_results.png
Mod5_XGBoostModel_resultswithinteractions.png		Mod5_XGBoostModel_resultswithinteractions.png
README.md		README.md
giphy_questions.mp4		giphy_questions.mp4
wine-glass-grapes-and-barrel-in-france_sourcedfrom-fshoqblog.jpg		wine-glass-grapes-and-barrel-in-france_sourcedfrom-fshoqblog.jpg
winequality-red.csv		winequality-red.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Goals

Results

About

Releases

Packages

Contributors 2

Languages

lindseyberlin/Mod5Project

Folders and files

Latest commit

History

Repository files navigation

Goals

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages