In [23]:
from sympy import *
from Cython import declare
from sympy.stats import Expectation, Normal, Probability, Poisson
from sympy import symbols, Integral, Sum, log

In [24]:
# establish the symbols that we will use
e, n, c, t, x, y, z, L, X = symbols("e, n, c, t, x, y, z, L, X")
mu = symbols("mu")
theta = symbols("theta")
sigma = symbols("sigma", positive=True)
lamda = symbols("lamda")


### Score & Information
For simplicty take $\theta$ to be one-dimensional. (For a vector $\theta$ apply the argument below for each of the components of $\theta$).

Use the short-hand notation:

$$dX = dX_1 dX_2,...,dX_n$$

and a single integral sign to denote the integration over the region in $R^n$. Then

$$
\begin{aligned}
E \bigg[ \dfrac{\partial}{\partial \theta} \log L(X, \theta) \bigg] &= \int \dfrac{\dfrac{\partial}{\partial \theta} L(X, \theta)}{L(X, \theta)} L(X, \theta) dX \\
&= \dfrac{\partial}{\partial\theta} \int L(X, \theta)dX \\
&= \dfrac{\partial}{\partial\theta}1 \\
&= 0
\end{aligned}
$$

(Where we used the fact that the integral of any density over its support is equal to one).

The above-defined notion of Information is fundamental in Statistics. It has made RA Fisher ()1890 - 1962), a reknowned applied statistician, one of the greatest of all time.

Among his pivotal contributions to the field, the introduction of the Maximum Likelihood Estimation method, the analysis of variance (ANOVA) and the notion of *Expected Fisher Information* (as defined above) are the most outstanding.

In [25]:
# expr = ((L) * (x, theta))
# expr

## Cramer Rao Lower bound and UMVUE
Calculate the Cramer-Rao lower bound for the variance of an unbiased estimator of $\theta$ and then find a statistic with variance equal to  the bound  when $X_1, X_2,...,X_n$ are independent random variables each with a distribution from the exponential family of distributions.

### Question 1
Exponential $(\theta): f(x, \theta = \dfrac{1}{\theta}e^{-x/\theta}, x>0$

**solution**: 

CLRB: $\dfrac{\theta^2}{n}$

UMVUE: $\bar{X}$

# step 1: define the expression using the pdf

In [26]:
expr = (1/theta)*e**(-x/theta)
expr

1/(e**(x/theta)*theta)

# step 2: set up the integral

In [27]:
Integral(expr, x)

Integral(1/(e**(x/theta)*theta), x)

# step 3:

### Question 2
Bernoulli $(\theta): f(x,\theta) = \theta^x (1-\theta)^{1-x} \in \{\ 0,1 \}\ , \theta \in (0,1)$

**Solution**

CRLB: $\dfrac{\theta (1-\theta)}{n}$

UMVUE: $bar{X}$

define the expression

In [28]:
expr = theta ** x * (1 - theta) ** (1 - x)
expr

theta**x*(1 - theta)**(1 - x)

### Question 3
$N(\theta, 1): f)x, \theta) = \dfrac{1}{\sqrt{2\pi}}e^{- \dfrac{1}{2}(x - \theta)^2}, x \in R,\theta \in R$

**Solution**

CRLB: $\dfrac{1}{n}$

UMVUE: $\bar{X}$

### Question 4
$N(0,\theta): f(x,\theta) = \dfrac{1}{\sqrt{2 \pi \theta}} e^{-\dfrac{x^2}{2\theta}}, x \in R, \theta > 0$

**Hint**: Note that Var$(X_i^2)=2\theta^2$

**Solution**

CRLB: $\dfrac{2 \theta^2}{n}$

UMVUE: $\dfrac{1}{n} \sum_{i=1}^{n} X_i^2$

**Explanation**

CLRB: Since

$$f(x, \theta) = \dfrac{1}{\sqrt{2 \pi \theta}} e^{-x^2 / \theta}$$

we have

$$\log f(x, \theta) = -\dfrac{1}{2} \log 2\pi - \dfrac{1}{2} \log \theta - \dfrac{x^2}{2\theta}$$

with

$$\dfrac{\partial}{\partial \theta} \log f(x;\theta) = - \dfrac{1}{2\theta} + \dfrac{x^2}{2\theta^2}$$

and

$$\dfrac{\partial^2}{\partial \theta}\log f(x; \theta) = \dfrac{1}{2 \theta^2} - \dfrac{x^2}{\theta^3}$$

and finally

$$I_{X_1} (\theta) = -E \bigg[ \dfrac{1}{2\theta^2} - \dfrac{X^2}{\theta^3} \bigg] = - \dfrac{1}{2\theta^2} + \dfrac{1}{2\theta^2} = \dfrac{1}{2\theta^2}$$

since

$$E(X^2) = \text{Var}(X) + (E(X))^2 = \theta + 0 = \theta$$

and thus the Cramer Rao lower bound is 

$$\dfrac{(\tau'(\theta))^2}{nI_{X_1}(\theta)} = \dfrac{2\theta^2}{n}$$

# UMVUE

### Question 1
Find the UMVUE of $\theta^2$ when $X_1, X_2,...,X_n$ are independent Bernoulli $(\theta)$ random variables. Check that your estimator does have mean $\theta^2$

In [29]:
# 1) find a complete and sufficient statistic for theta

In [30]:
# 2) find an unbiased estimator for tau of theta

In [31]:
# 3) apply Lehman Scheffe theorem to find the UMVUE

### Question 2
Find the UMVUE of $\theta^2$ when $X_1, X_2,...,X_n$ are independent random variables each with density

$$f(x;\theta) = \dfrac{1}{\theta} e^{- \dfrac{x}{\theta}},  X > 0, \theta > 0,$$

**Hint**: consider $\bar{X}$

Suppose $X_1, X_2,...,X_n$ are independent Uniform $(0,\theta)$ random variables.

### Question 3
Find the UMVUE of $\theta^2$ and calculate its variance

### Question 4
find the UMVUE of $\dfrac{1}{\theta}$