# Exploration of Evalutation Metrics

Evaluation of Model Performance based on Accuracy:

$$
Accuracy = \frac{CorrectlyClassifiedExamples}{Total Examples} \\
$$

$$
Accuracy = P(Correct) => P(Correct\cap Disease) + P(Correct\cap Normal)
$$

In terms of Conditional Probability:

$$
\because P(A\cap B) = P(A | B) P(B) \\ \therefore P(Correct) = P(C|D)P(D) + P(C|N)P(N)
$$

$$
\boxed{Accuracy = P(+|disease)P(disease) + P(-|normal)P(normal)}
$$

## Sensitivity (True Positive rate)

Given Patient has a disease, the probability that the model predicts Positive.

$$
Sensitivity = P(+|Disease) = \frac{No.(+ and Disease)}{No.(Disease)}
$$

## Specieficity (True Negative rate)

Given Patient is Normal, the probability that the model predicts Negative.

$$
Specificity = P(-| Normal) = \frac{No.(-and Normal)}{No.(Normal)}
$$

## Prevalance

The probability of patient having a disease in a population is prevalance.   

$$
Prevalance = P(Disease) = \frac{No.(Disease)}{No.(Total_{examples})}
$$

Then, $P(Normal) = (1-prevalance)$

Now, Accuracy with given terminilogies :

$$
\boxed{Accuracy = Sensitivity * Prevalance + Specificity * (1-Prevalance)}
$$

## Positive and Negative Predictive Value

During Diagnoistic It might be of Interest to Calculate:

**PPV** : Given the model prediction is positive, the probability that a patient has a disease

$$
PPV = P(Disease| +) = \frac{No.(+and Disease)}{No.(+)}
$$

**NPV** : Given the model prediction is negative, the probability that a patient is normal

$$
NPV = P(Normal| -ve ) = \frac{No.(-and Normal)}{No.(-)}
$$

## Confusion Matrix

![](./Imgs/confusion_matrix.png)

@ [ImgSrc](https://towardsdatascience.com/model-evaluation-techniques-for-classification-models-eac30092c38b)

The performance of a classifier in the form of table. The row representing ground truth and the column representing the prediction of the model.

**Model Output Eg.**

| Ground Truth/ Model Output | +ve | -ve |
| -------------------------- | --- | --- |
| Disease                    | 2   | 1   |
| Normal                     | 2   | 5   |

## Calculating the PPV in terms of sensitivity, specificity and Prevalance

- $pos$ : Actual Positive

- $\widehat{pos}$ :  Predicted Positive

$\because PPV = P(Disease|+)$  above :arrow_up:,  $\therefore \boxed{PPV = P(pos|\widehat{pos})}$ 

Now, using Bayes rule:

$$
PPV = \frac{P(\widehat{pos}|pos) \times P(pos))}{P(\widehat{pos})}
$$

We can also represent the followings in terms of Actual and Predicted positives:

- $Sensitivity = P(\widehat{pos}|pos)$ : $P(+|Disease)$ :arrow_up:

- $Prevalance = P(pos)$ : $P(Disease)$ :arrow_up:

The total positive prediction of the model is the sum of it correctly positives prediction and incorrect positive predicting. i.e. 

$$
P(\widehat{pos}) = TruePos + FalsePos
$$

Now,

$$
TruePos = P(\widehat{pos}|pos) \times P(pos) = Sensitivity \times Prevalance
$$

$$
FalsePos = P(\widehat{pos}|neg) \times P(neg) = (1-specificity) \times (1-prevalance)
$$

After Substitution the PPV becomes

$$
\boxed{PPV = 
\frac{sensitivity \times prevalance}
     {sensitivity \times prevalance + (1-specificity) \times (1-prevalance)}}
$$

## Threshold

While taking the prediction probability of a model for a given disease into account , a threshold (or Operating Point) can be defined such that;   

$$
Diagnosis = 
\begin{cases}
    \text{ Positive if } P_{model} > Threshold \\
    \text{ Negative if } P_{model} < Threshold
  \end{cases}
$$

**Choice of Threshold**:

Given, Sensitivity $P(+|Disease)$ and Specificity $P(-|Normal)$

- If $t = 0$ : Classify all examples as Positive (Sen : 1, Spec: 0)

- If $t=1$ : Classify all examples as Negative(Sen: 0, Spec: 1)

## Confidence Interval

Reporting the variablity in our estimation:

- The Accuracy of model in whole Population $(p_{unknown})$ is infeasible to estimate, instead  is computed in sample population $(\widehat{p}_{known})$

- The estimation of accuracy of a model $(P_{unknown})$ being in the interval [x,y]  with confidence  score(e.g. 95%) using sample population.

- In repeated sampling, This method produces intervals that include the population accuracy in about 95% of  samples

- If the sample size increases, the width of the interval decreases (i.e. better estimate of a population accuracy ) and If the sample size is small, the interval width increases.

## ROC Curve