## Sports Betting

### Fractional Odds

- **Definition**: Ratio of potential profit to stake
- **Format**: O_A:1
- **Interpretation**: Win profit of O_A units for every 1 unit staked
- **Example**: If O_A = 3 and you bet 1 unit, potential profit is 3 units, total return is 4 units
- **Implied Probability**: $P(A) = \frac{1}{O_A+1}$

## Arbitrage in Sports Betting

Arbitrage occurs when betting on all outcomes guarantees profit

### Scenario

- Bet $x$ on A with odds O_A:1
- Bet $n-x$ on B with odds O_B:1

### Profit Equations

- A wins: $-(n-x) + O_A \times x$
- B wins: $O_B \times (n-x) - x$

### Arbitrage Conditions

Set profits equal for fixed profit $p^*$:

$-(n-x) + O_A \times x = O_B \times (n-x) - x$

Solve for optimal bet $x^*$ and fixed profit $p^*$:

- $x^* = \frac{n(1+O_B)}{2+O_A+O_B}$
- $p^* = \frac{n(O_A \times O_B - 1)}{2+O_A+O_B}$

### Positive Profit Condition 

$p^* > 0$ when $O_A \times O_B > 1$


### Question 1

You are considering placing bets on the upcoming Super Bowl between the Kansas City Cheifs and the San Francisco 49ers. Two different bookmakers are offering fair bets that imply the following probabilities:

- Bookmaker A: Chiefs win with an implied probability of 65%
- Bookmaker B: 49ers win with an implied probability of 45%

You have a total betting budget of $1000.

Assuming you can split your bets between the two bookmakers, what is the minimum and maximum amount you need to bet on the Chiefs winning to guarantee a risk-free profit regardless of the game's outcome? What is the maximum gauranteed riskfree profit you can achieve?


In [14]:
## Code Solution
n = 240
p_A_chiefs = .65
p_B_49ers = .45

#Calculate the odds for each bookmaker
o_A_49ers = p_A_chiefs/(1-p_A_chiefs)
print(o_A_49ers)
o_B_chiefs = p_B_49ers/(1-p_B_49ers)


#Calculate the minimum and maximum bets for each bookmaker
x_min = n / (1+o_B_chiefs)
x_max = n*o_A_49ers / (1+o_A_49ers)

#calculate the optimal bet and the maximum guaranteed profit
x_star = n*(1+o_A_49ers) / (2+o_B_chiefs+o_A_49ers)
p_star = n*(o_B_chiefs*o_A_49ers-1) / (2+o_B_chiefs+o_A_49ers)

print(f"Minimum bet on Chiefs: ${x_min:.2f}")
print(f"Maximum bet on Chiefs: ${x_max:.2f}")
print(f"Optimal bet on 49ers with Bookmaker A: ${n-x_star:.2f}")
print(f"Optimal bet on Chiefs with Bookmaker B: ${x_star:.2f}")
print(f"Maximum guaranteed profit: ${p_star:.2f}")

1.8571428571428574
Minimum bet on Chiefs: $132.00
Maximum bet on Chiefs: $156.00
Optimal bet on 49ers with Bookmaker A: $93.33
Optimal bet on Chiefs with Bookmaker B: $146.67
Maximum guaranteed profit: $26.67


# Probability Review: Discrete and Continuous Variables

## Overview
- **Discrete Random Variables**: Finite/countably infinite values
- **Continuous Random Variables**: Continuous range of values

## Probability Functions
- **Probability Mass Function (PMF)**: For discrete RV $X$, $P(X=x)$
- **Cumulative Distribution Function (CDF)**: For RV $X$, $F(x) = P(X \leq x)$
- **Probability Density Function (PDF)**: For continuous RV $X$, $f(x) = F'(x)$

## Expected Value (Mean)
- **Discrete RV** $X$: $E(X) = \sum_{i} x_i P(x_i)$
- **Continuous RV** $X$: $E(X) = \int_{-\infty}^{\infty} x f(x) \, dx$

## Variance
- For RV $X$: $Var(X) = E[(X - E(X))^2] = E(X^2) - [E(X)]^2$

## Properties
- $P(a \leq X \leq b) = F(b) - F(a)$ for continuous RV $X$
- $P(a \leq X \leq b) = \int_{a}^{b} f(x) \, dx$ for continuous RV $X$
- $\int_{-\infty}^{\infty} f(x) \, dx = 1$ for valid PDF $f(x)$

### Question 2

Consider a continuous random variable $X$ with the following probability density function (PDF):

$$
f(x) = \begin{cases}
cx(1-x), & 0 \leq x \leq 1 \\
0, & \text{otherwise}
\end{cases}
$$

a) Find the value of the constant $c$ that makes $f(x)$ a valid PDF.

b) Using the value of $c$ found in part (a), calculate the expected value of $X$, $E(X)$.

c) Calculate the variance of $X$, $Var(X)$, using the formula:



### Solution


a) For $f(x)$ to be a valid PDF, $\int_{-\infty}^{\infty} f(x) \, dx = 1$.

$$
\int_{0}^{1} cx(1-x) \, dx = c \left[\frac{x^2}{2} - \frac{x^3}{3}\right]_{0}^{1} = c \left(\frac{1}{2} - \frac{1}{3}\right) = \frac{c}{6} = 1
$$

Therefore, $c = 6$.

b) $E(X) = \int_{0}^{1} x \cdot 6x(1-x) \, dx = 6 \int_{0}^{1} (x^2 - x^3) \, dx = 6 \left[\frac{x^3}{3} - \frac{x^4}{4}\right]_{0}^{1} = \frac{1}{2}$

c) $E(X^2) = \int_{0}^{1} x^2 \cdot 6x(1-x) \, dx = 6 \int_{0}^{1} (x^3 - x^4) \, dx = 6 \left[\frac{x^4}{4} - \frac{x^5}{5}\right]_{0}^{1} = \frac{3}{10}$

$Var(X) = E(X^2) - [E(X)]^2 = \frac{3}{10} - \left(\frac{1}{2}\right)^2 = \frac{1}{20}$

## Normal Distribution
- Defined by parameters: mean ($\mu$), standard deviation ($\sigma$)
- PDF: $f(x|\mu,\sigma^2) = \frac{1}{\sqrt{2\pi\sigma^2}} \exp\left(-\frac{(x-\mu)^2}{2\sigma^2}\right)$
- Symmetric around mean

- Standard Normal: $Z \sim N(0,1)$, $Z = \frac{X-\mu}{\sigma}$

## Central Limit Theorem (CLT)
- Let $X_1, X_2, \ldots, X_n$ be i.i.d. random variables with $E(X_i) = \mu$ and $\text{Var}(X_i) = \sigma^2 < \infty$
- Define $\bar{X}_n = \frac{1}{n}\sum_{i=1}^n X_i$ and $Z_n = \frac{\bar{X}_n - \mu}{\sigma/\sqrt{n}}$
- Then, as $n \to \infty$, $Z_n \xrightarrow{d} N(0,1)$


### Question 3

A factory produces widgets that are packaged into boxes. The weights of the widgets are independent and identically distributed random variables with a mean of 50 grams and a standard deviation of 5 grams. Each box contains 100 widgets.

 The factory ships an order of 500 boxes. Using the Central Limit Theorem, estimate the probability that the total weight of the widgets in the 500 boxes is greater than 2,501 kilograms.



### Solution


Step 1: Determine the total number of widgets.
- Total number of widgets: $n = 100 \text{ widgets/box} \times 500 \text{ boxes} = 50,000 \text{ widgets}$

Step 2: Apply the Central Limit Theorem to the total weight of the widgets.
- The total weight of the widgets is the sum of a large number of independent and identically distributed random variables.
- By the Central Limit Theorem, the distribution of the total weight is approximately normal with:
  - Mean: $\mu_{\text{total}} = n \times \mu = 50,000 \times 50 = 2,500,000 \text{ grams} = 2,500 \text{ kilograms}$
  - Standard deviation: $\sigma_{\text{total}} = \sqrt{n} \times \sigma = \sqrt{50,000} \times 5 $

Step 3: Calculate the z-score for the target weight.
- Target weight: $x = 2,501 \text{ kilograms} = 2,501,000 \text{ grams}$
- Z-score: $z = \frac{x - \mu_{\text{total}}}{\sigma_{\text{total}}}$

Step 4: Calculate the probability using the standard normal distribution.
- Probability: $P(\text{Total weight} > 2,501 \text{ kg}) = 1 - P(Z \leq z)$


In [46]:
from scipy.stats import norm

mu = 50
n = 100 * 500
sigma = 5
x = 2501 * 1000

z_score = (x - n * mu) / (sigma * (n ** 0.5))
probability = 1 - norm.cdf(z_score)

print(f"The probability that the total weight is greater than 2,501 kg is: {probability:.4f}")

The probability that the total weight is greater than 2,501 kg is: 0.1855


### Taylor Series

The Taylor Series is a powerful tool in mathematics that allows us to represent a function as an infinite sum of terms calculated from the function's derivatives at a single point. This series is named after the mathematician Brook Taylor, who introduced it in 1715.

#### Taylor Series for Functions of One Variable

##### Definition
Let $f(x)$ be a function that is infinitely differentiable at a point $a$. The Taylor Series of $f$ at $a$ is defined as:

$$
f(x) = \sum_{n=0}^{\infty} \frac{f^{(n)}(a)}{n!}(x - a)^n
$$

where:
- $f^{(n)}(a)$ is the $n$-th derivative of $f$ evaluated at the point $a$,
- $n!$ is the factorial of $n$,
- $(x - a)^n$ is the $n$-th power of $(x - a)$.

#### Taylor Series for Functions of Two Variables

Let $f(x, y)$ be a function of two variables that is infinitely differentiable at a point $(a, b)$. The third-order Taylor Series expansion of $f$ around $(a, b)$ is:


\begin{align*}
f(x, y) &\approx f(a, b) + \frac{\partial f}{\partial x}(a,b)(x - a) + \frac{\partial f}{\partial y}(a,b)(y - b)\\
&+\frac{1}{2!}\left( \frac{\partial^2 f}{\partial x^2}(a,b)(x - a)^2 + 2\frac{\partial^2 f}{\partial x \partial y}(a,b)(x - a)(y - b) + \frac{\partial^2 f}{\partial y^2}(a,b)(y - b)^2 \right)\\
&+ \frac{1}{3!}\left( \frac{\partial^3 f}{\partial x^3}(a,b)(x - a)^3 + 3\frac{\partial^3 f}{\partial x^2 \partial y}(a,b)(x - a)^2(y - b) + 3\frac{\partial^3 f}{\partial x \partial y^2}(a,b)(x - a)(y - b)^2 + \frac{\partial^3 f}{\partial y^3}(a,b)(y - b)^3 \right)
\end{align*}



This expansion includes terms up to the third-order partial derivatives of $f$.




### Question 4

Consider the function $f(x, y) = \ln(2x + 3y + 1)$. Find the second-order Taylor Series approximation of $f(x, y)$ around the point $(1, 1)$.

#### Solution

Given:
- $f(x, y) = \ln(2x + 3y + 1)$
- The point of expansion is $(a, b) = (1, 1)$

Step 1: Calculate the necessary partial derivatives of $f(x, y)$ at $(1, 1)$.

- $f(1, 1) = \ln(2\cdot 1 + 3\cdot 1 + 1) = \ln(6)$
- $\frac{\partial f}{\partial x}(x, y) = \frac{2}{2x + 3y + 1}$, so $\frac{\partial f}{\partial x}(1, 1) = \frac{2}{6} = \frac{1}{3}$
- $\frac{\partial f}{\partial y}(x, y) = \frac{3}{2x + 3y + 1}$, so $\frac{\partial f}{\partial y}(1, 1) = \frac{3}{6} = \frac{1}{2}$
- $\frac{\partial^2 f}{\partial x^2}(x, y) = -\frac{2^2}{(2x + 3y + 1)^2}$, so $\frac{\partial^2 f}{\partial x^2}(1, 1) = -\frac{4}{6^2} = -\frac{1}{9}$
- $\frac{\partial^2 f}{\partial y^2}(x, y) = -\frac{3^2}{(2x + 3y + 1)^2}$, so $\frac{\partial^2 f}{\partial y^2}(1, 1) = -\frac{9}{6^2} = -\frac{1}{4}$
- $\frac{\partial^2 f}{\partial x \partial y}(x, y) = -\frac{2 \cdot 3}{(2x + 3y + 1)^2}$, so $\frac{\partial^2 f}{\partial x \partial y}(1, 1) = -\frac{6}{6^2} = -\frac{1}{6}$

Step 2: Substitute the partial derivatives into the second-order Taylor Series expansion formula.

$$
f(x, y) \approx \ln(6) + \frac{1}{3}(x - 1) + \frac{1}{2}(y - 1)
$$
$$
+\frac{1}{2!}\left( -\frac{1}{9}(x - 1)^2 - 2\cdot \frac{1}{6}(x - 1)(y - 1) - \frac{1}{4}(y - 1)^2 \right)
$$

Step 3: Simplify the expression.

$$
f(x, y) \approx \ln(6) + \frac{1}{3}(x - 1) + \frac{1}{2}(y - 1) - \frac{1}{18}(x - 1)^2 - \frac{1}{6}(x - 1)(y - 1) - \frac{1}{8}(y - 1)^2.
$$



### Transformation of Variables

In probability theory, the transformation of variables is a technique used to find the probability density function (PDF) of a new random variable that is a function of an existing random variable with a known PDF.

#### Monotonic Functions

If $X$ is a continuous random variable with PDF $p_X(x)$ and $Y = g(X)$ is a new random variable, where $g$ is a monotonic function (either strictly increasing or strictly decreasing), then the PDF of $Y$ is given by:

$$
p_Y(y) = p_X(g^{-1}(y)) \left| \frac{d}{dy} (g^{-1}(y)) \right|
$$

where $g^{-1}$ is the inverse function of $g$.

#### Non-Invertible Functions

If the function $g$ is not invertible (i.e., it is not monotonic), we can still find the PDF of $Y$ using the following formula:

$$
p_Y(y) = \sum_{i} \frac{p_X(x_i)}{\left| g'(x_i) \right|}
$$

where $x_i$ are the roots of the equation $y = g(x)$, and $g'(x_i)$ is the derivative of $g$ evaluated at $x_i$.



#### Question 5

Let $X$ be a continuous random variable with probability density function (PDF) given by:

$$
p_X(x) = \begin{cases}
2x, & 0 \leq x \leq 1 \\
0, & \text{otherwise}
\end{cases}
$$


If $Y = 3X + 1$, find the PDF of $Y$.

#### Solution to Question 5

To find the PDF of $Y$, we first find the inverse function of $g$:

$y = 3x + 1 \implies x = g^{-1}(y) = \frac{y - 1}{3}$

Since $g$ is monotonic (strictly increasing) on the domain of $X$, we can use the formula for monotonic functions:

$p_Y(y) = p_X(g^{-1}(y)) \left| \frac{d}{dy} (g^{-1}(y)) \right|$

$p_Y(y) = 2 \cdot \frac{y - 1}{3} \cdot \left| \frac{1}{3} \right| = \frac{2(y - 1)}{9}$, for $1 \leq y \leq 4$

Therefore, the PDF of $Y$ is:

$$
p_Y(y) = \begin{cases}
\frac{2(y - 1)}{9}, & 1 \leq y \leq 4 \\
0, & \text{otherwise}
\end{cases}
$$

### Question 6

Let $X$ be a continuous random variable with PDF given by:

$$
p_X(x) = \begin{cases}
\frac{1}{2}, & -1 \leq x \leq 1 \\
0, & \text{otherwise}
\end{cases}
$$

If $Y = X^2$, find the PDF of $Y$.

### Solution to Question 6

Given:
- $X$ has PDF $p_X(x) = \frac{1}{2}$ for $-1 \leq x \leq 1$, and 0 otherwise.
- $Y = g(X) = X^2$

In this case, $g$ is not monotonic on the domain of $X$. We need to find the roots of the equation $y = x^2$ and use the formula for non-monotonic functions.

The roots are $x_1 = -\sqrt{y}$ and $x_2 = \sqrt{y}$, for $0 \leq y \leq 1$.

$p_Y(y) = \sum_{i=1}^{2} \frac{p_X(x_i)}{\left| g'(x_i) \right|} = \frac{p_X(-\sqrt{y})}{2\sqrt{y}} + \frac{p_X(\sqrt{y})}{2\sqrt{y}} = \frac{1}{2} \cdot \frac{1}{2\sqrt{y}} + \frac{1}{2} \cdot \frac{1}{2\sqrt{y}} = \frac{1}{2\sqrt{y}}$, for $0 < y \leq 1$

Therefore, the PDF of $Y$ is:

$$
p_Y(y) = \begin{cases}
\frac{1}{2\sqrt{y}}, & 0 < y \leq 1 \\
0, & \text{otherwise}
\end{cases}
$$
