# 1 Convex

In this course we will (in most cases) only look into $\mathbb R^n$. No complex numbers,  infinite dimensional vector spaces or non-euclidean metrics, etc.

## Affine Set

Definition. $S$ is an affine set iff $\alpha x+ (1-\alpha)y\in S$ for all $x,y\in S$ and $\alpha \in \mathbb R$. 

Properties: All affine sets can be represented in the form  $C = \{y: y=A_0x+x_0\}$ or $C = \{x: A_1x = b\}$.

## Convex Set

Definition. $S$ is a convex set iff $\alpha x+ (1-\alpha)y \in S$ for all $x,y\in S$ and $\alpha \in [0,1]$. 

### Convex Hull

Definition. Convex hull of $S$, denoted by ${\rm conv}(S)$, is all the convex combinations of all points in $S$. It is convex and it is the smallest convex set that contains $S$.



### Examples

#### Hyperplane

Hyperplane is an $(n-1)$-dimension subspace in $\mathbb R^n$, 
$$S=\{x:\quad a^Tx= b\}.$$

#### Halfspace

Halfspace is one side of some hyperplane, $(a\neq 0)$
$$S=\{x:\quad a^Tx\leqslant b\}.$$

#### Polyhedron

Polyhedron is the solution set to finitely many linear equalities or inequalities. 
$$S= \{x:\quad Ax\leqslant b,\quad Cx=d\}.$$
When polyhedron $S$ is bounded, then we also call it a polytope.

#### Convex Cone

A set $S$ is a cone if for any $x\in S$, we have $tx\in S$ for all $t\in [0,+\infty)$. If a cone is convex, it is a convex cone. 

## Convexity Preserved Mapping

If set $A$ is convex and the image $\{f(x):\ x\in A\}$ is a convex set, then we call the mapping $f$ preserves convexity. Some common examples are affine mapping, perspective mapping and linear-fractional mapping.

The composition of convexity preserved mappings is also convexity preserved.

### Affine Mapping

Definition. Mapping $f: \mathbb R^{n}\rightarrow \mathbb R^m$ and 
$$f(x) = Ax + b,\quad A\in \mathbb R^{m\times n}, B\in \mathbb R^m.$$

Proof. $\alpha f(x)+(1-\alpha)f(y) = f(\alpha x+(1-\alpha)y)$.

### Perspective Mapping

Definition. Mapping $f: \mathbb R^{n+1}\cap \{(x,t): t>0\}\rightarrow \mathbb R^n$ and 

$$f(x,t) = \frac{x}{t}.$$

Proof. 

$$\alpha f(x_1,t_1) + (1-\alpha)f(x_2,t_2) = \frac{\alpha x_1}{t_1}+\frac{(1-\alpha)x_2}{t_2}=ux_1+vx_2=\frac{ux_1+vx_2}{(u+v)t_3}$$

where $u = \frac{\alpha}{t_1}$, $v = \frac{1-\alpha}{t_2}$ and $t_3 = \frac{1}{u+v}$. Note that 
$$(\frac{1}{t_3}-\frac{1}{t_1})(\frac{1}{t_3} - \frac{1}{t_2}) = (1-\alpha)(\frac{1}{t_2} - \frac{1}{t_1})
\cdot \alpha (\frac{1}{t_1}-\frac{1}{t_2})\leqslant 0.$$

This means that $t_3$ lies inbetween $t_1$ and $t_2$ and thus
$$\alpha f(x_1,t_1) + (1-\alpha)f(x_2,t_2)=f(\frac{ux_1+vx_2}{u+v},t_3)$$
is an image of $f$.

### Linear-fractional Mapping

Definition. Mapping $f: \mathbb R^n\cap \{x: c^Tx+d>0\}\rightarrow \mathbb R^m$ and 
$$f(x) = \frac{Qx+u}{c^Tx + d}.$$

Linear-fractional mapping maps a convex set to a convex set. 

Proof. This is because $x\mapsto (x,c^Tx+d)\mapsto (Qx+u,c^Tx+d)\mapsto f(x)$ is a composition of convexity preserved mappings.

### Separation Hyperplane Theorem

Two convex sets $A,B$ are disjoint: $A\cap B=\emptyset$. Then there exists a hyperplane $a^Tx + b=0\ (a\neq 0)$ that 
separates the two sets:

$$a^Tx + b\geqslant 0,\quad \forall x\in A,\\ a^Ty + b \leqslant 0, \quad \forall y\in B.$$

Proof: Consider the difference of $A,B$ defined by $C = \{x-y:\ x\in A,y\in B\}$ is a convex set not containing the origin. If $C$ is closed, then we can select a minimizer $x\in C$ such that $\Vert x\Vert = \inf_{w\in C} \Vert w\Vert$. This implies for all $\alpha \in [0,1]$, 
$$\Vert x + \alpha (w-x)\Vert^2 \geqslant \Vert x\Vert^2$$

Expand the inequality and we obtain
$$2\alpha x^T(w-x) + \alpha^2 (w-x)^T(w-x)=f(\alpha)\geqslant 0.$$

Since we know that $f(0)=0$, it requires that $f'(0)\geqslant 0$ or $x^Tw\geqslant x^Tx\geqslant 0$.

Now it is clear that for arbitrary $u\in A$ and $v\in B$ that $x^T(u-v)\geqslant 0$. Hence $\inf_{u\in A}x^Tu \geqslant 
\sup_{v\in B} x^Tv$ and the hyperplane $\{w:\ x^Tw-\sup_{v\in B} x^Tv= 0\}$ meets our demand.

If $C$ is not closed, one can fix the proof by considering a closed subset sequence that limits to $C$.

### Supporting Hyperplane Theorem

Supporting hyperplane to convex set $C$ at boundary point $x_0$ is defined by

$$\{x: a^T(x - x_0) = 0\}$$

such that $a\neq 0$ and $a^T(x - x_0) \leqslant 0$ for all $x\in C$. 

Proof of existence: It is the separation hyperplane of $\{x_0\}$ and $C \setminus \{x_0\}$.

## Convex Function

The effective domain of function $f$ is defined by

$${\rm dom}(f) = \{x\in \mathbb R^n: f(x) < +\infty\}.$$

A function $f: \mathbb R^n \rightarrow R$ is called a convex function if ${\rm dom}(f)$ is convex and 
for all $x_1,x_2\in {\rm dom}(f)$ and all $\alpha \in [0,1]$ it holds
$$f(\alpha x_1 + (1-\alpha)x_2) \leqslant \alpha f(x_1) +(1-\alpha)f(x_2).$$

When for all $x_1\neq x_2$ and $\alpha \in (0,1)$ we have the strict inequality 
$$f(\alpha x_1 + (1-\alpha)x_2) < \alpha f(x_1) +(1-\alpha)f(x_2)$$

then we call the function $f$ is strictly convex. 


<br>

Additionally, call $f$ is concave when $-f$ is convex. 

### Norm

All norms are convex functions, because
$$\Vert \alpha x +(1-\alpha)y\Vert \leqslant \alpha \Vert x\Vert+(1-\alpha)\Vert y \Vert.$$

Conversely, if a convex function is homogeneous, i.e. $f(\lambda x) = \lambda f(x)$ for all $\lambda > 0$, then it preserves triangular inequality, 
$$f(x+y) \leqslant \frac12 \left(f(2x)+f(2y)\right) = f(x) + f(y).$$

### Extended-value Function

For a function $f$ define its extended-value function $\widetilde f$ such that
$$\widetilde f(x) = f(x),\quad x\in {\rm dom}(f),\\
\ \widetilde f(x) = +\infty,\quad x\notin {\rm dom}(f).$$

### Epigraph

Definition. ${\rm epi}(f)  = \{(x,t)\in \mathbb R^n\times \mathbb R: x\in {\rm dom}(f), t\geqslant f(x)\}$.

Properties. $f$ is convex $\Leftrightarrow$ ${\rm epi}(f)$ is a convex set. 

Proof. If $f$ is convex and $(x_1,t_1), (x_2,t_2)\in {\rm epi}(f)$ with $\alpha \in [0,1]$, then we know that
$$\alpha t_1+(1-\alpha)t_2 \geqslant \alpha f(x_1)+(1-\alpha)f(x_2)\geqslant f(\alpha x_1+(1-\alpha)x_2).$$

If the epigraph is convex, then for arbitrary $(x_1,f(x_1)),(x_2,f(x_2))$ and arbitrary $\alpha \in [0,1]$ hold that
$$\alpha f(x_1) + (1-\alpha)f(x_2) \geqslant f(\alpha x_1 + (1-\alpha) x_2).$$

### Sublevel Set

Definition. $t$-sublevel set of $\mathbb R^n\rightarrow \mathbb R$ is 

$$L_t(f) = \{x\in {\rm dom}(f): t\geqslant f(x)\}.$$

Properties. $f$ is convex $\Rightarrow L_t(f)$ is convex (trivial).