<h1>Evaluation Metrics</h1>

<h3>Accuracy</h3>

<p>Accuracy is the percent of predictions that are correct.</p>
<p>For example, if you have 100 datapoints and predict 70 of them correctly and 30 incorrectly, the accuracy is 70%.</p>

<p>Accuracy is a very straightforward and easy to understand metric, however it’s not always the best one.</p>

<p><strong>For example,</strong> let’s say I have a model to predict whether a credit card charge is fraudulent. Of 10000 credit card chards, we have 9900 legitimate charges and 100 fraudulent charges. I could build a model that just predicts that every single charge is legitimate and it would get 9900/10000 (99%) of the predictions correct!</p>

<p><strong>Another example,</strong> let’s say you have a model to predict spam email. Your training set has 1000 emails, 950 are legitimate emails and 50 are spam emails. If you build a model that just predicts every email is legitimate and then you get 950/1000 (95%) accuracy score</p>

<strong>This means accuracy is a good measure if our classes are evenly split, but is very misleading if we have imbalanced classes.</strong>

<p>Therefore always use caution with accuracy. You need to know the distribution of the classes to know how to interpret the value</p>

<h3>Confusion Matrix</h3>

<p>As we noticed in the previous part, we care not only about how many datapoints we predict the correct class for, we care about how many of the positive datapoints we predict correctly for as well as how many of the negative datapoints we predict correctly.</p>

<p>We can see all the important values in what is called the Confusion Matrix (also called the Error Matrix or the Table of Confusion).</p>

<p>The Confusion Matrix is a table showing four values:</p>
<ul>
    <li><strong>True Positive (TP):</strong> Datapoints we predicted positive that are actually positive.</li>
    <li><strong>False Negative (FN):</strong> Datapoints we predicted positive that are actually negative.</li>
    <li><strong>False Positive (FP)</strong> Datapoints we predicted negative that are actually positive.</li>
    <li><strong>True Negative (TN)</strong> Datapoints we predicted negative that are actually negative.</li>
</ul>

<p>The terms can be a little hard to keep track of. The way to remember is that the second word is what our prediction is (positive or negative) and the first word is whether that prediction was correct (true or false).</p>

<p>In our Titanic dataset, we have 887 passengers, 342 survived (positive) and 545 didn’t survive (negative). The model we built in the previous module has the following confusion matrix.</p>

<table border="1">
  <tr>
      <th></th>
      <th>Actual Positive</th>
      <th>Actual Negative</th>
  </tr>
  <tr style="background-color: white;">
      <th>Predicted Positive</th>
      <td style="background-color: lightblue;">233</td>
      <td>65</td>
  </tr>
  <tr>
      <th>Predicted Negative</th>
      <td>109</td>
      <td style="background-color: lightblue;">480</td>
  </tr>
</table>

<p>The blue shaded squares are the counts of the predictions that we got correct. So of the 342 passengers that survived, we predicted 233 or them correctly (and 109 of them incorrectly). Of the 545 passengers that didn’t survive, we predicted 480 correctly (and 65 incorrectly).</p>

<p>We can use the confusion matrix to compute the accuracy. As a reminder, the accuracy is the number of datapoints predicted correctly divided by the total number of datapoints</p>

In [1]:
print((233+480)/(233+65+109+480))

0.8038331454340474


<strong>The confusion matrix fully describes how a model performs on a dataset, though is difficult to use to compare models.</strong>

<h3>True Positives, True Negatives, False Positives, False Negatives</h3>


<p>The confusion matrix is described as follows:</p>

<table border="1">
  <tr>
      <th></th>
      <th>Actual Positive</th>
      <th>Actual Negative</th>
  </tr>
  <tr style="background-color: white;">
      <th>Predicted Positive</th>
      <td style="background-color: lightblue;">TP</td>
      <td>FP</td>
  </tr>
  <tr>
      <th>Predicted Negative</th>
      <td>FN</td>
      <td style="background-color: lightblue;">TN</td>
  </tr>
</table>

<p>The four values of the confusion matrix (TP, TN, FP, FN) are used to compute several different metrics that we’ll use later on.</p>