In [2]:
import numpy as np
import pandas as pd

# Discrete Uniform Distribution

The Discrete Uniform Distribution is a probability distribution for discrete random variables where all possible values have the same probability of occurring. If a variable \( Y \) follows a discrete uniform distribution over a finite set of values \( \{a, a+1, ..., b\} \), we write \( Y \leadsto Uniform(a, b) \) or \( Y \leadsto U(a, b) \).

The formula for the probability function of the Discrete Uniform Distribution is:

$$ P(Y = y) = \frac{1}{b-a+1}, \quad \text{for } y \in \{a, a+1, ..., b\} $$

where \( a \) and \( b \) are the minimum and maximum values, respectively, that the random variable can take.

## Properties of the Discrete Uniform Distribution

1. **Expectation (Mean)**:
   
   The expectation \( E(Y) \) of a random variable \( Y \) that follows a discrete uniform distribution \( U(a, b) \) is given by:
   
   $$ E(Y) = \frac{a + b}{2} $$
   
2. **Variance**:
   
   The variance \( Var(Y) \) of a random variable \( Y \) that follows a discrete uniform distribution \( U(a, b) \) is given by:
   
   $$ Var(Y) = \frac{(b - a + 1)^2 - 1}{12} $$
   
3. **Moment Generating Function (MGF)**:
   
   The moment generating function \( M_Y(t) \) for a random variable \( Y \) that follows a discrete uniform distribution \( U(a, b) \) is given by:
   
   $$ M_Y(t) = \frac{e^{at} - e^{(b+1)t}}{(b - a + 1)(1 - e^t)} $$

## Example

Suppose a random variable \( Y \) follows a discrete uniform distribution \( U(1, 6) \), representing the roll of a fair six-sided die. The probability of any value \( y \) between 1 and 6 is equal to:

$$ P(Y = y) = \frac{1}{6}, \quad \text{for } y \in \{1, 2, 3, 4, 5, 6\} $$

The expectation (mean) \( E(Y) \) and variance \( Var(Y) \) for this example are:

$$ E(Y) = \frac{1 + 6}{2} = 3.5 $$

$$ Var(Y) = \frac{(6 - 1 + 1)^2 - 1}{12} = \frac{36 - 1}{12} = \frac{35}{12} \approx 2.92 $$

The moment generating function \( M_Y(t) \) for \( Y \leadsto U(1, 6) \) is:

$$ M_Y(t) = \frac{e^t - e^{7t}}{6(1 - e^t)} $$

This formula can be used to find the moments of any order for the variable \( Y \).

In [None]:
def Uniform(a, b, x):
	if a <= x <= b:
		probabilite = 1/(b - a + 1)
		return probabilite
	else:
		return 0

In [None]:
def Uniform_CDF(a, b, x):
    if x < a:
        return 0
    elif x > b:
        return 1
    else:
        return (x - a + 1) / (b - a + 1)

### Some Uniform Distribution Estimators

Consider a random variable \( X \) that follows a uniform distribution over the interval \([a, b]\), denoted as \( X \sim U(a, b) \).

#### Method of Moments Estimator

Using the method of moments, we equate the sample moments to the population moments.

1. The mean of \( X \) is given by:

$$ E[X] = \frac{a + b}{2} $$

   The sample mean \( \overline{X_n} \) is:

$$ \overline{X_n} = \frac{\sum_{i=1}^{n} X_i}{n} $$

   Equating the two, we get:

$$ \frac{a + b}{2} = \overline{X_n} $$

   Solving for \( b \), we get:

$$ b = 2\overline{X_n} - a $$

   Hence, the method of moments estimators are:

$$ \tilde{a} = \min(X_i) $$
$$ \tilde{b} = 2\overline{X_n} - \tilde{a} $$

#### Maximum Likelihood Estimator (MLE)

For the uniform distribution, the likelihood function is:

$$ L(a, b \mid X_1, X_2, \ldots, X_n) = \prod_{i=1}^{n} \frac{1}{b-a} $$

   The log-likelihood function is:

$$ \ln L(a, b \mid X_1, X_2, \ldots, X_n) = -n \ln(b-a) $$

   To maximize the log-likelihood, the range \([a, b]\) must cover all observed data points. Thus:

$$ \hat{a} = \min(X_i) $$
$$ \hat{b} = \max(X_i) $$

#### Estimators List:

1. $$ \tilde{a} = \min(X_i) $$ (MM)
2. $$ \tilde{b} = 2\overline{X_n} - \tilde{a} $$ (MM)
3. $$ \hat{a} = \min(X_i) $$ (MLE)
4. $$ \hat{b} = \max(X_i) $$ (MLE)

#### Bias of an Estimator

Considering the MLE estimators:

$$ b_{a}[\hat{a}] = E[\hat{a}] - a $$
$$ b_{b}[\hat{b}] = E[\hat{b}] - b $$

For large \( n \):

$$ E[\hat{a}] \approx a $$
$$ E[\hat{b}] \approx b $$

Thus, the MLE estimators are approximately unbiased.

#### Risk of an Estimator

The risk (mean squared error) of the estimators:

$$ R_{a}[\hat{a}] = (b_{a}[\hat{a}])^2 + Var[\hat{a}] $$
$$ R_{b}[\hat{b}] = (b_{b}[\hat{b}])^2 + Var[\hat{b}] $$

For large \( n \), since the estimators are unbiased, we have:

$$ R_{a}[\hat{a}] \approx Var[\hat{a}] $$
$$ R_{b}[\hat{b}] \approx Var[\hat{b}] $$

#### Consistency of an Estimator

For the MLE estimators:

1. \( (X_n) \) is a collection of iid samples from a uniform distribution.
2. \( E[X_i] < \infty \)

Using the strong law of large numbers (LLN):

$$ \min(X_i) \overset{a.s.}{\longrightarrow} a $$
$$ \max(X_i) \overset{a.s.}{\longrightarrow} b $$

Thus, \( \hat{a} \) and \( \hat{b} \) are consistent estimators.

#### Convergence Speed

Using the central limit theorem (CLT), for large \( n \):

$$ \sqrt{n}(\hat{a} - a) \mathrel{\overset{d}{\longrightarrow}} \mathcal{N}(0, \sigma^2_a) $$
$$ \sqrt{n}(\hat{b} - b) \mathrel{\overset{d}{\longrightarrow}} \mathcal{N}(0, \sigma^2_b) $$

Thus, the convergence speed is:

$$ \sqrt{n}(\hat{a} - a) \mathrel{\overset{d}{\longrightarrow}} \mathcal{N}(0, \sigma^2_a) $$
$$ \sqrt{n}(\hat{b} - b) \mathrel{\overset{d}{\longrightarrow}} \mathcal{N}(0, \sigma^2_b) $$

Where \( \sigma^2_a \) and \( \sigma^2_b \) are the variances of \( \hat{a} \) and \( \hat{b} \) respectively.