# Mod3/L1 Cramer-Rao Lower Bound and Fisher Information

## Introduction
In this module, we discuss the concepts of the Cramer-Rao Lower Bound (CRLB) and Fisher Information. These concepts help evaluate the quality of an estimator, particularly the large sample properties of maximum likelihood estimators (MLEs).

## Cramer-Rao Lower Bound (CRLB)
The CRLB provides a lower bound on the variance of all unbiased estimators of a function $(\tau(\theta))$. It is useful for determining the efficiency of an estimator.

### Definition
For a random sample $(X_1, X_2, \ldots, X_n)$ from a distribution with parameter $(\theta)$, the variance of any unbiased estimator $(\hat{\tau})$ of $(\tau(\theta))$ is bounded by:
$$ \text{Var}(\hat{\tau}) \geq \frac{(\tau'(\theta))^2}{I_n(\theta)} $$

## Fisher Information
The denominator $(I_n(\theta))$ is known as the Fisher Information, which measures the amount of information that the sample provides about the parameter $(\theta)$.

### Example in R
```r
# Define the Fisher Information function for a normal distribution
fisher_information <- function(n, sigma2) {
  return(n / sigma2)
}

# Parameters
n <- 100
sigma2 <- 4

# Calculate Fisher Information
I_n <- fisher_information(n, sigma2)
cat(sprintf("Fisher Information: %.2f\n", I_n))

# Calculate Cramer-Rao Lower Bound for the mean (tau(theta) = theta)
tau_prime <- 1  # derivative of tau(theta) with respect to theta
crlb <- tau_prime^2 / I_n
cat(sprintf("Cramer-Rao Lower Bound: %.4f\n", crlb))

In [1]:
# Define the Fisher Information function for a normal distribution
fisher_information <- function(n, sigma2) {
  return(n / sigma2)
}

# Parameters
n <- 100
sigma2 <- 4

# Calculate Fisher Information
I_n <- fisher_information(n, sigma2)
cat(sprintf("Fisher Information: %.2f\n", I_n))

# Calculate Cramer-Rao Lower Bound for the mean (tau(theta) = theta)
tau_prime <- 1  # derivative of tau(theta) with respect to theta
crlb <- tau_prime^2 / I_n
cat(sprintf("Cramer-Rao Lower Bound: %.4f\n", crlb))

Fisher Information: 25.00
Cramer-Rao Lower Bound: 0.0400


## Proof of the Cramer-Rao Lower Bound
The proof of the CRLB is based on the Cauchy-Schwarz inequality. The ***CRLB states that the variance of any unbiased estimator is at least as large as the inverse of the Fisher Information***.

### Steps in the Proof
**Cauchy-Schwarz Inequality**: $$ \left( \int f(x) h(x) , dx \right)^2 \leq \left( \int f(x)^2 , dx \right) \left( \int h(x)^2 , dx \right) $$

**Expectation Form**: $$ \left( E[T \cdot \frac{\partial}{\partial \theta} \log f(X; \theta)] \right)^2 \leq E[T^2] \cdot E\left[\left(\frac{\partial}{\partial \theta} \log f(X; \theta)\right)^2\right] $$

**Fisher Information**: $$ I_n(\theta) = E\left[\left(\frac{\partial}{\partial \theta} \log f(X; \theta)\right)^2\right] $$

**CRLB**: $$ \text{Var}(\hat{\tau}) \geq \frac{(\tau'(\theta))^2}{I_n(\theta)} $$

## Example: Bernoulli Distribution
Suppose we have a random sample $(X_1, X_2, \ldots, X_n)$ from a Bernoulli distribution with parameter (p). We want to find the CRLB for (p).

### Steps
**PDF of Bernoulli Distribution**: $[ f(x; p) = p^x (1-p)^{1-x} ]$

**Log-Likelihood**: $[ \log L(p) = \sum_{i=1}^n \left[ x_i \log p + (1 - x_i) \log (1 - p) \right] ]$

**Derivative of Log-Likelihood**: $[ \frac{\partial}{\partial p} \log L(p) = \sum_{i=1}^n \left[ \frac{x_i}{p} - \frac{1 - x_i}{1 - p} \right] ]$

**Fisher Information**: $[ I_n(p) = \frac{n}{p(1-p)} ]$

**CRLB**: $[ \text{Var}(\hat{p}) \geq \frac{p(1-p)}{n} ]$

#### Example in R

In [1]:
# Parameters
n <- 100
p <- 0.5

# Fisher Information for Bernoulli distribution
fisher_information_bernoulli <- function(n, p) {
  return(n / (p * (1 - p)))
}

# Calculate Fisher Information
I_n <- fisher_information_bernoulli(n, p)
cat(sprintf("Fisher Information: %.2f\n", I_n))

# Calculate Cramer-Rao Lower Bound for p
tau_prime <- 1  # derivative of tau(p) with respect to p
crlb <- tau_prime^2 / I_n
cat(sprintf("Cramer-Rao Lower Bound: %.4f\n", crlb))

Fisher Information: 400.00
Cramer-Rao Lower Bound: 0.0025


## Conclusion
The Cramer-Rao Lower Bound (CRLB) and Fisher Information are essential tools for evaluating the efficiency of estimators. The CRLB provides a lower bound on the variance of unbiased estimators, while Fisher Information quantifies the amount of information in the sample about the parameter.

This concludes the lesson on the Cramer-Rao Lower Bound and Fisher Information. In the next lessons (refer to [mod3_summarytranscript_L2_computationalSimplificationsFisher.ipynb](mod3_summarytranscript_L2_computationalSimplificationsFisher.ipynb)), we will continue to explore more advanced topics and applications in statistical inference. 