# PS2-4 Constrcuting Kernels

From Mercer's theorem, we know kernel matrices $K_1$, $K_2$ and $K_3$ are PSD.

### (a) $K(x,z)=K_1(x,z)+K_2(x,z)$

$K(x,z)$ is a valid kernel.

$K$ is symmetric since
   $$K_{ij}=K(x^{(i)},x^{(j)}) = K_1(x^{(i)},x^{(j)}) + K_2(x^{(i)},x^{(j)}) = K_1(x^{(j)},x^{(i)}) + K_2(x^{(j)},x^{(i)}) = K_{ji}$$

$\forall u\in\mathbb{R}^m$, since $K=K_1+K_2$,
    $$u^TKu = u^TK_1u + u^TK_2u \geq 0$$

Therefore, $K$ is PSD for any $\{x^{(1)}, x^{(2)}, \dots, x^{(m)}\}$ and thus a valid kernel.

### (b) $K(x,z)=K_1(x,z) - K_2(x,z)$

$K(x,z)$ is not a valid kernel because $K=K_1-K_2$ is symmetric but not necessarily PSD.

Counter-example: when $K_1-K_2=I$, and $u\in\mathbb R^m$ is a column vector whose entries are all negative. 

### (c) $K(x,z)=aK_1(x,z)$

$K(x,z)$ is a valid kernel.

$K=aK_1$ is symmetric since

$$K_{ij}=aK_{1,ij}=aK_{1,ji}=K_{ji}$$

$\forall u\in\mathbb R^m$,

$$u^TKu=u^T(aK_1)u=a(u^TK_1u)\ge0$$

Therefore, $K$ is PSD and thus K(x,z) is a valid kernel.

### (d) $K(x,z)=-aK_1(x,z)$

$K(x,z)$ is not a valid kernel because $K=-aK_1$ is NSD rather than PSD.

Counter-example: any $K(x,z)=-aK_1(x,z)$.

### (e) $K(x,z)=K_1(x,z)K_2(x,z)$

$K(x,z)$ is a valid kernel.

$K=K_1\odot K_2$ ($\odot$ denotes Hadamard product) is symmetric:

$$K_{ij}=K_1(x^{(i)},x^{(j)})K_2(x^{(i)},x^{(j)})=K_1(x^{(j)},x^{(i)})K_2(x^{(j)},x^{(i)})=K_{ji}$$

$\forall u\in \mathbb{R}^m$, and let $u=[u_1, u_2, \dots, u_m]^T$, then

\begin{align*}
u^TKu &= \begin{bmatrix}\sum_{i=1}^m u_iK_{i1},\sum_{i=1}^m u_iK_{i2},\dots,
\sum_{i=1}^m u_iK_{im}
\end{bmatrix}\begin{bmatrix}u_1\\ u_2\\ \vdots\\ u_m\end{bmatrix}\\
&=\sum_{j=1}^m\sum_{i=1}^m u_iK_{ij}u_j\\
&=\sum_{i=1}^m\sum_{j=1}^m u_iK_{1,ij}K_{2,ij}u_j\\
&=\sum_{i=1}^m\sum_{j=1}^m u_iK_1(x^{(i)},x^{(j)})K_2(x^{(i)},x^{(j)})u_j\\
\end{align*}

Let $K_1(x^{(i)},x^{(j)})=\phi_1(x^{(i)})^T\phi_1(x^{(j)})$, and $K_2(x^{(i)},x^{(j)})=\phi_2(x^{(i)})^T\phi_2(x^{(j)})$, then

\begin{align*}
u^TKu &= \sum_{i=1}^m\sum_{j=1}^m u_i\phi_1(x^{(i)})^T\phi_1(x^{(j)})\phi_2(x^{(i)})^T\phi_2(x^{(j)})u_j\\
&= \sum_{i=1}^m\sum_{j=1}^m u_i \sum_{k=1}^{n} (\phi_1(x^{(i)}))_k(\phi_1(x^{(j)}))_k\sum_{l=1}^{n} (\phi_2(x^{(i)}))_l(\phi_2(x^{(j)}))_l u_j\\
&= \sum_{i=1}^m\sum_{j=1}^m\sum_{k=1}^{n}\sum_{l=1}^{n} \left[u_i(\phi_1(x^{(i)}))_k(\phi_2(x^{(i)}))_l\right]\left[u_j(\phi_1(x^{(j)}))_k(\phi_2(x^{(j)}))_l\right]\\
&= \sum_{k=1}^{n}\sum_{l=1}^{n} \left[\sum_{i=1}^mu_i(\phi_1(x^{(i)}))_k(\phi_2(x^{(i)}))_l\right]\left[\sum_{j=1}^mu_j(\phi_1(x^{(j)}))_k(\phi_2(x^{(j)}))_l\right]\\
&= \sum_{k=1}^{n}\sum_{l=1}^{n}\left[\sum_{i=1}^mu_i(\phi_1(x^{(i)}))_k(\phi_2(x^{(i)}))_l\right]^2 \ge0
\end{align*}

Therefore, $K$ is PSD and thus a valid kernel.

### (f) $K(x,z) = f(x)f(z)$

$K(x,z)$ is a valid kernel.

$K$ is symmetric since

$$K_{ij}=f(x^{(i)})f(x^{(j)})=f(x^{(j)})f(x^{(i)})=K_{ji}$$

$\forall u\in \mathbb{R}^m$, and let $u=[u_1, u_2, \dots, u_m]^T$, then

\begin{align*}
u^TKu 
&=\sum_{j=1}^m\sum_{i=1}^m u_iK_{ij}u_j\\
&=\sum_{j=1}^m\sum_{i=1}^m u_if(x^{(i)})f(x^{(j)})u_j\\
&=\left[\sum_{i=1}^mu_if(x^{(i)})\right]^2 \ge0
\end{align*}

Therefore, $K$ is PSD and thus a valid kernel.

### (g) $K(x,z)=K_3(\phi(x),\phi(z))$

$K(x,z)$ is a valid kernel.

$K$ is symmetric since

$$K_{ij}=K_3(\phi(x^{(i)}), \phi(x^{(j)}))=K_3(\phi(x^{(j)}), \phi(x^{(i)}))=K_{ji}$$

$\forall u\in \mathbb{R}^m$, and let $u=[u_1, u_2, \dots, u_m]^T$, then

\begin{align*}
u^TKu 
&=\sum_{j=1}^m\sum_{i=1}^m u_iK_{ij}u_j\\
&=\sum_{j=1}^m\sum_{i=1}^m u_iK_3(\phi(x^{(i)}), \phi(x^{(j)}))u_j\\
&=\sum_{j=1}^m\sum_{i=1}^m u_i\phi(x^{(i)})^T\phi(x^{(j)})u_j\\
&=\sum_{i=1}^m\sum_{j=1}^m u_i\left[\sum_{k=1}^d\phi(x^{(i)})_d\phi(x^{(j)})_d\right]u_j\\
&=\sum_{i=1}^m\sum_{j=1}^m\sum_{k=1}^d u_i\phi(x^{(i)})_d\phi(x^{(j)})_du_j\\
&=\sum_{k=1}^d\left [\sum_{i=1}^mu_i\phi(x^{(i)})_d\right]^2 \ge 0
\end{align*}

Therefore, $K$ is PSD and thus a valid kernel.

### (h) $K(x,z)=p(K_1(x,z))$

$K(x,z)$ is a valid kernel.

$K$ is symmetric since

$$K_{ij}=p(K_1(x^{(i)}, x^{(j)}))=p(K_1(x^{(j)}, x^{(i)}))=K_{ji}$$

$\forall u\in \mathbb{R}^m$, and let $u=[u_1, u_2, \dots, u_m]^T$, then

\begin{align*}
u^TKu 
&=\sum_{j=1}^m\sum_{i=1}^m u_iK_{ij}u_j\\
&=\sum_{i=1}^m\sum_{j=1}^m u_ip(K_1(x^{(i)}, x^{(j)}))u_j
\end{align*}

Let $p=\sum_{k=0}^r c_kx^k, c_k\ge0, c_r > 0$ and from (e) we know $K_1(x, z)^k$ is a valid kernel and thus PSD, then

\begin{align*}
u^TKu 
&=\sum_{i=1}^m\sum_{j=1}^m u_i\sum_{k=0}^rc_kK_1(x^{(i)}, x^{(j)})^ku_j\\
&=\sum_{k=0}^r c_k\sum_{i=1}^m\sum_{j=1}^mu_iK_1(x^{(i)}, x^{(j)})^ku_j\ge 0
\end{align*}

Therefore, $K$ is PSD and thus a valid kernel.