# Honor 2 Quantile Regression and Extreme Values

## Quantile

### Order Statistics

Given a series of observations $(X_1,\dotsc,X_n)$, we can rearrange them in order:
$$X_{1,n}\leqslant X_{2,n}\leqslant \dotsc X_{n,n}$$
Then, $(X_{1,n},\dotsc,X_{n,n})$ is called the order statistics of $(X_1,\dots,X_n)$.

**Examples**

1. Median: the sample median of $(X_1,\dotsc,X_n)$ is defined by 
$$m = \left\{\begin{array}{ll}X_{\frac{n+1}{2}, n}, \quad & n {\rm \ is\ odd},\\
\frac 12\left(X_{\frac n2 , n}+X_{\frac n2+1,n}\right),\quad & n{\rm \ is\ even}.\end{array}\right.$$

Median is often more stable than the mean.


2. For $1\leqslant k\leqslant n$, 
$$X_{k,n} = \min_{1\leqslant i_1\leqslant \dotsc\leqslant i_k\leqslant n}\max \left\{X_{i_1},\dotsc,X_{i_n}\right\}
 = \max_{1\leqslant i_1\leqslant \dotsc\leqslant i_{n-k+1}\leqslant n}\min \left\{X_{i_1},\dotsc,X_{i_n}\right\}.$$

### Population Quantile

Let $F$ be the cumulatie distribution function and its (generalized) inverse $F^{-1}:[0,1]\rightarrow \bar {\mathbb R}$ given by
$$F^{-1}(p) = \inf\{x|\ F(x)\geqslant p\},\quad p\in [0,1].$$

And we call $F^{-1}(p)$ to be the p-th quantile of $F$. Warning: $F(F^{-1}(p))=p$ **DOES NOT ALWAYS HOLD**.


**Properties**

1. $F^{-1}(p)\leqslant x\Leftrightarrow p\leqslant F(x)$.

Proof: $\Rightarrow$: if $F^{-1}(p) \leqslant x$, then $\exists t\leqslant x\ {\rm s.t.\ }F(t)\geqslant p$. While $F(x)\geqslant F(t)$ by monotonicity, so $\Rightarrow $ holds. On the other hand, $\Leftarrow$ is trivial by the definition of $\inf$.


2. $x<F^{-1}(p)\Leftrightarrow F(x)<p$.

Proof: Trivial from 1.


3. $\lim_{x\rightarrow F^{-1}(p)^-}F(x)\leqslant p \leqslant F(F^{-1}(p))$.

Proof: Left: note for $\epsilon>0$, we have $F^{-1}(p) - \epsilon <F^{-1}(p)$, by property 2 we must have $F(F^{-1}(p) - \epsilon )<p$. Take the limit when $\epsilon\rightarrow 0^+$ and $F^{-1}(p) - \epsilon\rightarrow F^{-1}(p)^-$. 
Right: trivial from property 1 by taking $x = F^{-1}(p)$.

4. $F^{-1}(F(x))\leqslant x\leqslant \lim_{p\rightarrow F(x)^+}F^{-1}(p)$.

Proof: Left: trivial from property 1 by taking $p = F(x)$. Right: note for $\epsilon > 0$, we have $F(x)+\epsilon > F(x)$. Let $p = F(x)+\epsilon$ and $p\rightarrow F(x)^+$, then by property 2 we have $x < F^{-1}(p)$ and the limit gives $x\leqslant \lim_{p\rightarrow F(x)^+}F^{-1}(p)$.

5. $F^{-1}(p)$ is nondecreasing and left-continuous.

Proof: Left: let $p_1<p_2$, then $p_1<p_2\leqslant F(F^{-1}(p_2))$ by 3. Take $x = F^{-1}(p_2)$ in property 1 yields $F^{-1}(p_1)\leqslant F^{-1}(p_2)$. Now let $p_1\rightarrow p_2^-$, then $F^{-1}(p_1)$ is bounded and nondecreasing. So it has limit $\lim_{p\rightarrow p_2^-}F^{-1}(p)\leqslant F^{-1}(p_2)$. Yet for $\epsilon >0$, we have when $p > F^{-1}(p_2) - \epsilon$ that 
$$p \leqslant F(F^{-1}(p))\leqslant F(F^{-1}(p_2) - \epsilon)$$


6.


7. When $F$ is continuous at $t$, then $F(F^{-1}(t))  =t$.

### Random Number Generator

Suppose $X$ has C.D.F. $F$ while $U$ is the standard uniform random varaible. Then, 
$F^{-1}(U)\sim  X$. Further if $F$ is continuous, then $F(x)\sim U$.

Proof: Fix $t$, since $F^{-1}(U)\leqslant t\Leftrightarrow U\leqslant F(t)$, we have $\mathbb P(F^{-1}(U)\leqslant t) = \mathbb P(U\leqslant F(t)) =F(t) =\mathbb P(X\leqslant t)$. When $F$ is continuous, first we note that for any $t$
$$\mathbb P(F(X) < t) =1- \mathbb P(F(X)\geqslant t) = 1 - \mathbb P(X\geqslant F^{-1}(t)) = 1 - (1 -F(F^{-1}(t))) = F(F^{-1}(t))=t.
$$

Note that for $\epsilon>0$, $\mathbb P(F(X)<t-\epsilon) \leqslant \mathbb P(F(X)\leqslant t) \leqslant \mathbb P(F(X)<t+\epsilon)$, taking the limit gives the result.


#### Order Statistics

When $X_i\ (i=1,\dotsc,n)$ are i.i.d. from $F$ while $U_i\ (i=1,\dotsc,n)$ are i.i.d. uniform. Then the order statistics is the inverse of uniform order statistics,
$$(X_{1,n},\dotsc,X_{n,n})\sim (F^{-1}(U_{1,n}),\dotsc,F^{-1}(U_{n,n})).$$

## Convergence


### Inverse

**Theorem** $F_n\ (n=1,2,\dotsc)$ and $F$ are C.D.F.s. Then $F_n^{-1}\rightarrow_d F^{-1}$ if and only if $F_n\rightarrow_d F$.

**Proof** Let $U$ be a standard uniform random variable. $\Rightarrow$: note that $F_n^{-1}(U)\rightarrow F^{-1}(U)$ almost everywhere by the definition of convergence in distribution. So the corresponding distribution weakly converges, $F_n^{-1}(U) = F_n\rightarrow F$. For $\Leftarrow$: let $N$ be a random variable with strictly increasing and continuous C.D.F. $\Phi$, 
$$\Phi(F_n^{-1}(t)) = \mathbb P(N\leqslant F_n^{-1}(t)) = \mathbb P(N<F_n^{-1}(t)) = \mathbb P(F_n(N)<t) \rightarrow \mathbb P(F(N)<t) = \mathbb P(N<F^{-1}(t)) = \Phi(F^{-1}(t)) .$$

Then, $F_n^{-1}(t) = \Phi^{-1}(\Phi(F_n^{-1}(t)))\rightarrow \Phi^{-1}(\Phi(F^{-1}(t))) =  F^{-1}(t)$.

## Quantile Regression

