## Bayes Risk with Absolute Loss

### Question 0 (M)

The proposed function is $f(x)=x^3$, since the derivative is $f^{'}(x)=2x^2$.  
Then, for $x_0=0$, $f^{'}(x_0)=0$ and $f(x_0)$ is not a local extremum.

### Question 1 (M + C)

<!-- To simplify the problem we propose a case where $Y$ is independant.
 -->
Let $X \leadsto U([0,1])$, we modelise $Y$ as $Y | X = x \leadsto Exp(x + 1)$.

Then : $p_{Y|X}(y|x) = (x+1)e^{-(x+1)y}$

For a constant estimator $c$:  
$R_{l_{absolute}}(c)=\int_0^{\infty}{|y-c|(x+1)e^{-(x+1)y}\;dy}=\int_0^c (c-y)(x+1)e^{-(x+1)y}dy+\int_c^{\infty}(y-c)(x+1)e^{-(x+1)y}dy= c + \frac{2}{x+1}e^{-(x+1)c} - \frac{1}{x+1}$

For the squared loss, we have $f^{*}_{l_{squared}}(x)=E[Y|X=x]=\frac{1}{x+1}$

For the absolute loss, we have $f^{*}_{l_{absolute}}(x)=\frac{\ln(2)}{x+1}$ (The median of $Y$)

The estimator of the asolute loss can be found using $P(Y \le z | X = x) = \frac{1}{2}$ (Result proven in Question 2).  
Then, $P(Y \le z | X = x) = \frac{1}{2} \Leftrightarrow 1 - e^{-(x+1)m} = \frac{1}{2} \Leftrightarrow -(x+1)m = \ln(\frac{1}{2}) \Leftrightarrow m = \frac{\ln(2)}{x+1}$

If we replace the estimators in the risk of the absolute loss:

$R_{l_{absolute}}(f^{*}_{l_{absolute}}(x))= \frac{\ln(2)}{x+1} < R_{l_{absolute}}(f^{*}_{l_{squared}}(x))= \frac{2e^{-1}}{x+1}$
$\Leftrightarrow \ln(2) < 2e^{-1}$
$\Leftrightarrow 0.6931 < 0.7357$

Hence, we have $f^{*}_{l_{squared}} \ne f^{*}_{l_{absolute}}$

In [4]:
import numpy as np

n_samples = 100000

X = np.random.uniform(0, 1, size=n_samples)

rates = X + 1
Y = np.random.exponential(scale=1/rates)

f_squared = 1 / (X + 1)
f_absolute = np.log(2) / (X + 1)

mae_squared = np.mean(np.abs(Y - f_squared))
mae_absolute = np.mean(np.abs(Y - f_absolute))

print(f"Empirical MAE of absolute-loss Bayes predictor: {mae_absolute:.4f}")
print(f"Empirical MAE of squared-loss Bayes predictor: {mae_squared:.4f}")
print(mae_absolute < mae_squared)

Empirical MAE of absolute-loss Bayes predictor: 0.4817
Empirical MAE of squared-loss Bayes predictor: 0.5104
True


### Question 2 (M)

We want to find the bayes predictor of the absolute loss, $f^{*}_{absolute}(x)=\underset{z\in\R}{\arg\min}\; g(z)$. 

We know :

$g(z) = \int_{\R}{|y-z|p_{Y|X=x}(y)}\;dy$

$g(z) = \int_{-\infty}^{z}{(z-y)p_{Y|X=x}(y)}\;dy + \int_{z}^{\infty}{(y-z)p_{Y|X=x}(y)}\;dy$

Then:

$\frac{d\;g(z)}{dz} = \frac{d}{dz} \int_{\R}{|y-z|p_{Y|X=x}(y)}\;dy = \int_{-\infty}^{z}{(1)p_{Y|X=x}(y)}\;dy + \int_{z}^{\infty}{(-1)p_{Y|X=x}(y)}\;dy = 2F_{Y|X=x}(z) - 1$

We have:
$g'(z) = 2F_{Y|X=x}(z) - 1$

We can compute the sign of the second derivative :

$g''(z) = \frac{d}{dz}[2F_{Y|X=x}(z) - 1]$

$g''(z) = 2\frac{d}{dz}F_{Y|X=x}(z) = 2\frac{d}{dz}\int_{-\infty}^{z} p_{Y|X=x}(y) dy = 2p_{Y|X=x}(y)$

Therefore:
$g''(z) = 2p_{Y|X=x}(z)$


Since $p_{Y|X=x}(z)$ is a probability density function, we have $p_{Y|X=x}(z) \geq 0$ for all $z \in \R$

Therefore, $g''(z) = 2p_{Y|X=x}(z) \geq 0$ for all $z$

This confirms that $g(z)$ is convex, at the point $g'(z) = 0$ we have a global minimum.


Then,

$g'(z)=0 \Leftrightarrow \int_{-\infty}^{z}{p_{Y|X=x}(y)}\;dy = \int_{z}^{\infty}{p_{Y|X=x}(y)}\;dy$  
&emsp;&emsp;&emsp;&emsp;&emsp;$\Leftrightarrow F_{Y|X=x}(z) = 1 - F_{Y|X=x}(z)$  
&emsp;&emsp;&emsp;&emsp;&emsp;$\Leftrightarrow F_{Y|X=x}(z) = \frac{1}{2}$

Finally,  
$f^{*}_{absolute}(x) \in \{ z \in \R, P(Y \le z | X = x) = \frac{1}{2}\} $ (since $p_{Y|X=x}$ is continuous)  

We can then conculde that $f^{*}_{absolute}(x)$ is the posterior median of $Y|X=x$.