## 1.3 Kernel ridge regression

**Kernel ridge regression** (KRR) combine *Ridge regression and classification* with the *kernel trick*.

It thus learns a linear function in the space induced by the respective kernel and the data; for non-linear kernels this corresponds to a non-linear function in the original space.

The form of the `KernelRidge` model is identical to support vector regression (`SVR`), but different loss function are used: KRR uses squared error loss and SVR uses epsilon-insensitive loss. Fitting KRR is closed-form and typically daster for medium-sized datasets, but the learned model is not sparse and so is slower than SVR (which learns a sparse model at prediction time)

In [1]:
from sklearn.kernel_ridge import KernelRidge
import numpy as np

In [2]:
n_samples, n_features = 10, 5
rng = np.random.RandomState(0)

y = rng.randn(n_samples)
X = rng.randn(n_samples, n_features)

In [3]:
krr = KernelRidge(alpha=1.0)
krr.fit(X,y)

KernelRidge(alpha=1.0)