Skip to content

judemoon/wine

Repository files navigation

Explore and Summarize White Wine Quality Data Using R

July 2017, by Jude Moon

Overview

In this project, I explored a white wine quality data set for Portuguese "Vinho Verde" wine, whose data were collected and published by Cortez et al (2009). This tidy data set contains 4,898 white wines with 11 variables on quantifying the physicochemical properties of each wine. At least 3 wine experts rated the quality of each wine, providing a rating between 0 (very bad) and 10 (very excellent). Details and units of the variables are provided here. The objectives of the analysis is to summarize the data to determine (1) whether any relationship between the physicochemical properties and sensory quality exists and (2) if so, which physicochemical properties are the most effective to describe quality, in other words, which properties are highly correlated with the quality score.

Files

  • wineQualityWhites.csv: main data file
  • wineQualityReds.csv: supplementary data file
  • wine_project.Rmd: data exploration and answers to the questions
  • wine_project.html: html knitted from wine_project.Rmd
  • wine_project1.html: html knitted from wine_project.Rmd, 1st project submission draft
  • 3d_pca.JPG: image file of snapshot of 3d interacting plot
  • 3d_pca_subset.JPG: image file of snapshot of 3d interacting plot
  • chisq_all.jpg: image file of snapshot of interacting plot
  • chisq_rem_1.jpg: image file of snapshot of interacting plot
  • chisq_trans.jpg: image file of snapshot of interacting plot
  • chisq_trans_nosug.jpg: image file of snapshot of interacting plot

Releases

No releases published

Packages

No packages published

Languages