In [1]:
from sympy import *

# Quadratic and Absolute Value Loss minimisation

**question 1**: show that for a given random variable $Y$ with a finite second moment, the function $q_1(a) = E[(Y-a)^2]$ is minimised for $a* = E(Y)$

**solution**
setting the derivative with respect to *a*to zero we get

$$
\begin{aligned}
\dfrac{\partial}{\partial a} E(Y - a)^2 &= \dfrac{\partial}{\partial a} \bigg[E(Y)^2 - 2E(Y^2) - 2E(Y)a+a^2 \bigg] \\
&= -2E(Y) + 2a \\
&= 0 \\
\end{aligned}
$$

from which we deduce that the stationary point is $a* = E(Y)$ and this stationary point gives rise to a minimum

**Question 2**: show that for a given random variable $Y$ with $E|Y| < \infty$, the function $q_2(b) = E|Y - b|$ is minimised for $b* = \text{median}(Y)$

**solution**
continuous case for simplicity. Denote the density of Y by $f(y)$ and the cdf by $F(y)$. Having in mind the definition of absolute value we have:

$$
\begin{aligned}
\dfrac{\partial}{\partial b}E(|Y-b|) &= \dfrac{\partial}{\partial b} \bigg[ \int_{-infty}^{b} (b-y)f(y)dy + \int_b^{\infty} (y - b)f(y)dy \bigg] \\
&= \dfrac{\partial}{\partial b} \bigg[bF(b) - \int_{-\infty}^{b} yf(y)dy - b(1 - F(b)) \bigg] \\
&= F(b) - 1 - F(b)) \\
&= 2F(b) - 1 \\
&= 0
\end{aligned}
$$

from which we can deduce that the stationary point $b*$ satisfies $F(b*)=0.5$, ie $b*$ is the mediam and the stationary point $b*$ gives rise to a minimum.

# Bayes Estimator

suppose a *single* observation $x$ is available from the uniform distribution with a density

$$f(x|\theta) = \dfrac{1}{(\theta}I_{x,\infty)}(\theta), \theta > 0$$

the prior on $\theta$ has density:

$$\tau()\theta) = \theta \text{exp}(-\theta), \theta > 0$$

**Question 1**: find the Bayes Estimator of $\theta$ with respect to quadratic loss

**Solution**
Note that we have a *single* observation $X$ only. Now $f(x|\theta) = \dfrac{1}{\theta}i_{(x,\infty)}(\theta)$ implies that 

$$g(x) = \int_{0}^{\infty} f(x|\theta) \tau (\theta) d\theta = \int_{x}^{\infty} \dfrac{1}{\theta}\theta e^{-\theta} d\theta = e^{-x}, x > 0 $$

hence

$$h(\theta | x) = \dfrac{f(x|\theta) \tau(\theta)}{g(x)} = 
\begin{Bmatrix} e^{x-\theta} & \text{ if } & \theta > x \\ 0 & \text{ if } & 0 < \theta < x \\ \end{Bmatrix} $$

with respect to quadratic loss, the Bayesian estimator $\delta_{\tau}(x)$ is given by:

$$
\begin{aligned}
\delta_{\tau}(x) &= \int_x^{\infty} \theta(\theta|x)d\theta \\
&= \int_{x}^{\infty} \theta e^{x-\theta}d\theta \\
&= e^x \int_{x}^{\infty} \theta e^{-\theta} d\theta \\
&= e^x(xe^{-x} + x^{-x}) \\
&= x+1 
\end{aligned}
$$

**Question 2**: Find the Bayes estimator of $\theta$ with respect to absolute value loss $L(\theta, a) = |\theta - a|$

**Solution**

with respect to absolute value loss, the Bayesian estimator $m$ solves the equation

$$\int_m^{\infty} e^{x - \theta} d \theta = \dfrac{1}{2}$$

and we get:

$$e^{x-m} = \dfrac{1}{2} \Rightarrow m - x = \ln 2 \Rightarrow m = x + \ln 2$$

**question**
Let $X_1, X_2,..., X_n$ be a random sample from the normal density with mean $mu$ and variance 1. Consider estimating $\mu$ with a squared-error loss. Assume that the prior $\tau(\mu)$ is a normal density with mean $\mu_0$ and variance 1

Show that the Bayes Estimator of $\mu$ is 

$$\dfrac{\mu_0 + \sum_{i=1}^{n} X_i}{n+1}$$

**Solution**

Let $X = (X_1,...,X_n)$ be the random variables. Setting $\mu_0 = x_0$ for convenience of the notation, we can write:

$$
\begin{aligned}
h(\mu|X=x) &\propto \text{exp} \bigg\{\ -\dfrac{1}{2} \sum_{i=1}^{n} (x_i - \mu)^2 \bigg\}\ \\
&\propto \text{exp} \bigg\{\ - \dfrac{n+1}{2} \bigg[\mu - \dfrac{\sum_{i=1}^{n}X_i}{n+1} \bigg] \bigg\}\
\end{aligned}
$$

This also means (by completing the square with the expression that does not depend on $\mu$)

$$h\mu|X=x) \propto \text{exp} \bigg\[\ - \dfrac{n+1}{2} [ \mu - \dfrac{\sum_{i=1}^{n}X_i}{n + 1} ]^2 \bigg\]\$$

which implies that $h\mu|X=x)$ (being a density), must be the density of

$$N \bigg( \dfrac{\sum_{i=1}^{n}X_i}{n+1}, \dfrac{1}{n+1} \bigg)$$

hence the Bayes estimator (being the posterior mean) would be

$$\dfrac{(\sum_{i=1}^{n} X_i)}{(n+1)} = \dfrac{(\mu_0 + \sum_{i=1}^{n} x_i)}{n + 1} = \dfrac{1}{n+1} \mu_0 + \dfrac{n}{n+1} \bar{X}$$

that is, the Bayes estimator is a convex combination of the mean of the prior and of $\bar{X}$. In this combination, the weight of the prior information diminishes quickly when the sample size increases. The same estimator is obtained with respect to absolute value loss.