# Question 2:

Assume we have a classifier that produces a score between 0 and 1 for the probability of a particular loan application being fraudulent. In this scenario: a) what are false positives, b) what are false negatives, and c) what are the trade-offs between them in terms of dollars and how should the model be weighted accordingly?

## Classification:

A classifier is a function $f : \mathcal{X} \to \mathcal{Y}$ from an input space $\mathcal{X}$, e.g. load applications, to the space of categorical (or binary) outcomes $\mathcal{Y} = \{ 0, 1 \}$, e.g. the application being fradulent or not, where $y = 1$ denotes a fradulent application.

## Answer:

Given an application $x \in \mathcal{X}$, and its fraudulence $y \in \mathcal{Y}$, a classifier $f$ produces a result as

\begin{equation*}
    \hat{y} = f(x)
\end{equation*}

a) A false positive is the case where $y=0$ and $\hat{y}=1$.

b) A false negative is the case where $y=1$ and $\hat{y}=0$.

c) Assume that the cost of accepting a fraudulent load application is $C_F$ and the cost of rejecting an honest application is $C_H$.

For a given classifier $f$, 

|          | Honest  | Fraud |
| -------- | ------- | ----- |
| $f(x) = \text{Honest}$  | $N_{H,H}$    | $N_{F,H}$  |
| $f(x) = \text{Fraud}$   | $N_{H,F}$    | $N_{F,F}$  |

\begin{align*}
    \text{Sensitivity} &= \frac{N_{F,F}}{N_{F,F} + N_{F,H}} \\
    &= \text{Probability of guessing that an application is fraudulent when the application is fraudulent.}
\end{align*}

\begin{align*}
    \text{Specificity} &= \frac{N_{H,H}}{N_{H,H} + N_{H,F}} \\
    &= \text{Probability of guessing that an application is not fraudulent when the application is not fraudulent.}
\end{align*}

Then the total cost is 

\begin{align*}
    C &= \left( 1 - \frac{N_{F,F}}{N_{F,F} + N_{F,H}} \right) C_F \, + \left( 1- \frac{N_{H,H}}{N_{H,H} + N_{H,F}} \right) C_H \\
    &= \left( \frac{N_{F,H}}{N_{F,F} + N_{F,H}} \right) C_F \, + \left( \frac{N_{H,F}}{N_{H,H} + N_{H,F}} \right) C_H \\
    &= \frac{N_{F,H} C_F}{N_{F,F} + N_{F,H}} + \frac{N_{H,F} C_H }{N_{H,H} + N_{H,F}}
\end{align*}

Assume that an application $x \in \mathcal{X}$ is mapped to some real number $r \in \mathbb{R}$. A simple classifer can divide this space using a single threshold value $\tau$ such that

\begin{align*}
    f(r) = \begin{cases}
        \text{Honest}, \quad \text{if} \, \, r \leq \tau \\
        \text{Fraud}, \, \, \quad \text{if} \, \, r > \tau \\
    \end{cases}
\end{align*}

Notice that increasing $\tau$ corresponds to increasing 

\begin{align*}
    \frac{N_{H,H} + N_{F,H}}{N_{H,H} + N_{F,H} + N_{H,F} + N_{F,F}}
\end{align*}

since a larger proportion of all applications will be classified as an $\text{Honest}$ application.