# Interpretation of Best Fitting Model # 

We have found the best model through forward selection to be the one with the following 8 covariates: 
- `fixed acidity`
- `volatile acidity`
- `residual sugar`
- `chlorides`
- `total sulfur dioxide`
- `density`
- `sulphates`
- `alcohol`

To answer our research question *"Do residual sugar and alcohol level have an association with wine quality?"* we look further at this model, specifically our variables of interest.

In [None]:
rw <- read.csv("data/winequality-red.csv")
rw$quality <- ifelse(rw$quality >=7,1,0)


In [8]:
rw_reg8 <- glm(quality ~ fixed.acidity+volatile.acidity+residual.sugar+chlorides+total.sulfur.dioxide+density+sulphates+alcohol, data = rw, family = binomial)           
summary(rw_reg8)


Call:
glm(formula = quality ~ fixed.acidity + volatile.acidity + residual.sugar + 
    chlorides + total.sulfur.dioxide + density + sulphates + 
    alcohol, family = binomial, data = rw)

Deviance Residuals: 
    Min       1Q   Median       3Q      Max  
-3.0158  -0.4314  -0.2220  -0.1255   2.9883  

Coefficients:
                       Estimate Std. Error z value Pr(>|z|)    
(Intercept)           2.268e+02  9.163e+01   2.475 0.013336 *  
fixed.acidity         2.812e-01  8.029e-02   3.502 0.000462 ***
volatile.acidity     -2.913e+00  6.467e-01  -4.504 6.66e-06 ***
residual.sugar        2.328e-01  7.009e-02   3.322 0.000893 ***
chlorides            -8.441e+00  3.259e+00  -2.590 0.009593 ** 
total.sulfur.dioxide -1.360e-02  3.447e-03  -3.946 7.95e-05 ***
density              -2.409e+02  9.202e+01  -2.618 0.008835 ** 
sulphates             3.699e+00  5.287e-01   6.997 2.62e-12 ***
alcohol               7.823e-01  1.120e-01   6.983 2.88e-12 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.0

In [25]:
odds_sugar <- exp(coef(rw_reg8)["residual.sugar"])
prob_sugar <- odds_sugar/(1+odds_sugar)
odds_alcohol <- exp(coef(rw_reg8)["alcohol"])
prob_alcohol <- odds_alcohol/(1+odds_alcohol)


From the summary we can see that **all of the parameters are statistically significant at a 5% significance level**. Thus, we can conclude that residual sugar and alcohol level are associated with quality.

Residual sugar has an estimate of .233, or, holding all other variables constant, a g/L increase in residual sugar is associated with a .233 increase in the log-odds of wine quality. In simpler terms, a g/L increase in residual sugar is associated with a probability of .558 of being rated a high quality wine. The estimate of alcohol is .782 meaning, holding all other variables constant, a percent increase in alcohol level is associated with a probability of .686 of being highly rated.
