# Wine Data Analysis   

Download Link : https://archive.ics.uci.edu/ml/datasets/wine+quality  

Citation : P. Cortez, A. Cerdeira, F. Almeida, T. Matos and J. Reis.Modeling wine preferences by data mining from physicochemical properties. In Decision Support Systems, Elsevier, 47(4):547-553, 2009.

In [1]:
import pandas as pd

red_wine_df = pd.read_csv('winequality-red.csv', delimiter=';')
white_wine_df = pd.read_csv('winequality-white.csv', delimiter=';')

In [2]:
red_wine_df.columns

Index(['fixed acidity', 'volatile acidity', 'citric acid', 'residual sugar',
       'chlorides', 'free sulfur dioxide', 'total sulfur dioxide', 'density',
       'pH', 'sulphates', 'alcohol', 'quality'],
      dtype='object')

In [3]:
white_wine_df.columns

Index(['fixed acidity', 'volatile acidity', 'citric acid', 'residual sugar',
       'chlorides', 'free sulfur dioxide', 'total sulfur dioxide', 'density',
       'pH', 'sulphates', 'alcohol', 'quality'],
      dtype='object')

## Columns Description  
- **Fixed Acidity** : Amount of Tartaric Acid in wine, measured in g/dm<sup>3</sup>
- **Volatile Acidity** : Amount of Acetic Acid in wine, measured in g/dm<sup>3</sup>
- **Citric Acid** : Amount of citric acid in wine in g/dm<sup>3</sup>. Contributes to crispness of wine.
- **Residual Sugar** : amonunt of sugar left in wine after fermentation. Measured in in g/dm<sup>3</sup>
- **Chlorides** : amount of Sodium Cholride (salt) in wine. Measured in g/dm<sup>3</sup>
- **Free Sulfur Dioxide** : Amount of SO<sub>2</sub> in free form. Measured in mg/dm<sup>3</sup>
- **Total Sulfur Dioxide** : Total Amount of SO<sub>2</sub>. Too much SO<sub>2</sub> can lead to a pungent smell. SO<sub>2</sub> acts as antioxidant and antimicrobial agent.
- **Density** : Density of Wine in g/dm<sup>3</sup>
- **pH** : pH of Wine on a scale of 0-14 . 0 means highly Acidic, while 14 means highly basic.
- **Sulphates** : Amount of Potassium Sulphate in wine, measured in g/dm<sup>3</sup>.Contributes to the formation of SO<sub>2</sub>.
- **Alcohol** : alcohol content in wine (in terms of % volume)
- **Quality** : Wine Quality graded on a scale of 1 - 10 (Higher is better)

# Questions that we can try to answer ?  
- Which factor or combination of factors affect the quality of Red Wine/White Wine ?
- Do the different types of wines (red or white) have different factors affecting quality ?
- Is there any interesting trends that exist in other columns besides Quality ?