This notebook provides basic examples of how to use gp3. Use NBViewer to view the notebook [here](https://nbviewer.jupyter.org/github/as4529/gp3/blob/master/examples/basic.ipynb?flush_cache=true).

In [1]:
from gp3.inference import MFSVI, FullSVI, Laplace
from gp3.likelihoods import Poisson
from gp3.utils import data as sim
from gp3.utils.transforms import softplus, inv_softplus
from gp3.kernels import RBF
from plotly.offline import download_plotlyjs, init_notebook_mode, plot, iplot
import plotly.graph_objs as go
from plotly import tools
from IPython.display import display
init_notebook_mode(connected=True)
import warnings
warnings.filterwarnings('ignore')
from tqdm import trange
from ipywidgets import IntProgress
import numpy as np

## Data Simulation

First, let's simulate some data on an equispaced grid. We will do this in 2D for sake of visualization. We simulate from the following model

$$ f \sim \mathcal{GP}(\mu(\cdot), K(\cdot, \cdot))$$
$$y_i \sim \text{Poisson}(f(x_i) + \epsilon) $$

where $\epsilon \sim \mathcal{N}(0, 1)$. We can ignore the "inv_softplus" link below. It's for kernel learning (which is in progress).


In [2]:
X = sim.sim_X_equispaced(D = 2, N_dim = 30, lower=0, upper=100)
f = sim.sim_f(X, RBF(40., 1., 0.5), mu = 5.)
y = sim.poisson_draw(f, .5) 

In [3]:
trace_func = go.Scatter3d(x = X[:,0], y = X[:,1], z=f, mode = 'markers', marker=dict(size = 2,))
trace_draws = go.Scatter3d(x = X[:,0], y = X[:,1], z=y, mode = 'markers', marker=dict(size = 2,))
fig = tools.make_subplots(rows=1, cols=2, specs=[[{'is_3d': True}, {'is_3d': True}]])
fig.append_trace(trace_func, 1, 1)
fig.append_trace(trace_draws, 1, 2)
iplot(fig)

This is the format of your plot grid:
[ (1,1) scene1 ]  [ (1,2) scene2 ]



## Inference

Now, we run inference using both the SVI method and the Laplace method.

In [11]:
inf_svi = MFSVI(X, y, RBF(40., 1.), Poisson())
inf_lp = Laplace(X, y, RBF(40., 1.), Poisson())
inf_lp.run(20)
inf_svi.run(2000)

Objective: -3549090.02 | Step Size: 0.00:  45%|████▌     | 9/20 [00:00<00:00, 67.92it/s]


converged at 1963 iterations


Here, we make predictions and plot the inferred functions.

In [12]:
pred_svi = inf_svi.predict()
pred_lp = inf_lp.f_pred

trace_svi = go.Scatter3d(x = X[:,0], y = X[:,1], z=pred_svi, mode = 'markers', marker=dict(size = 2,), name = 'SVI posterior mean')
trace_lp = go.Scatter3d(x = X[:,0], y = X[:,1], z=pred_lp, mode = 'markers', marker=dict(size = 2,), name = 'Laplace posterior mean')
fig = tools.make_subplots(rows=1, cols=2, specs=[[{'is_3d': True}, {'is_3d': True}]])
fig.append_trace(trace_svi, 1, 1)
fig.append_trace(trace_lp, 1, 2)
iplot(fig)

This is the format of your plot grid:
[ (1,1) scene1 ]  [ (1,2) scene2 ]



To get the predictive variances, we do the following. For the SVI method, we already have them calculated. We use covariance estimation with Gaussian perturbations for the Laplace method, where the parameter indicates the number of samples to estimate based on. See "Massively Scalable GPs" for more on this method."

In [21]:
svi_variances = np.exp(inf_svi.q_S)
lp_variances = inf_lp.variance(20)

var_svi = go.Scatter3d(x = X[:,0], y = X[:,1], z=svi_variances, mode = 'markers', marker=dict(size = 2,), name = 'SVI posterior variances')
var_lp = go.Scatter3d(x = X[:,0], y = X[:,1], z=lp_variances, mode = 'markers', marker=dict(size = 2,), name = 'Laplace posterior variances')
fig = tools.make_subplots(rows=1, cols=2, specs=[[{'is_3d': True}, {'is_3d': True}]])
fig.append_trace(var_svi, 1, 1)
fig.append_trace(var_lp, 1, 2)
iplot(fig)

This is the format of your plot grid:
[ (1,1) scene1 ]  [ (1,2) scene2 ]



In [7]:
inf_lp.W

array([ 118.8829879 ,  141.53488848,  168.45334156,  200.14171352,
        236.80602759,  279.14042703,  325.88438505,  376.64128817,
        428.11458284,  479.92269062,  526.93328283,  566.87845158,
        601.47791725,  622.57525754,  637.71930155,  647.76928563,
        655.9980774 ,  666.18585804,  682.71982828,  710.92436245,
        755.4594081 ,  814.33733215,  896.0054411 , 1004.76076677,
       1131.87487599, 1263.45139073, 1385.47944096, 1481.39134504,
       1509.85284612, 1427.00452766,  136.8064426 ,  164.55682319,
        197.58599625,  236.50377093,  281.49894856,  333.40563467,
        390.5952989 ,  452.42527025,  514.62492873,  576.53317227,
        631.52998951,  676.55898195,  713.71567897,  732.79515344,
        743.38922561,  746.60231957,  746.69377366,  748.37360474,
        757.02051662,  778.90117101,  819.47498361,  876.7496108 ,
        961.32513684, 1079.29270948, 1223.67500269, 1381.99778192,
       1541.9552091 , 1686.57949947, 1766.5788344 , 1721.02955

For SVI, we can look at the values of the variational objective, likelihood, and KL terms as a function of iteration.

In [8]:
iplot([go.Scatter(x = np.array(range(len(inf_svi.elbos))), y = inf_svi.elbos)])

## Partial Grids

Here, we take a sample of 25 percent of the above grid to "observe"

In [9]:
X_part, y_part = sim.rand_partial_grid(X, y, 0.3)
X_full, y_full, obs_idx, imag_idx = sim.fill_grid(X_part, y_part)

color = np.zeros(X_full.shape[0])
color[obs_idx] = 1.0
trace_partial_obs = go.Scatter3d(x = X_full[obs_idx, 0], y = X_full[obs_idx, 1],
                                 z= y[obs_idx], mode = 'markers', marker=dict(size = 2))
iplot([trace_partial_obs])

We can run inference on partial grids by passing in the locations of the full grid, the indices of the observed points, and the values of y at the observed locations.

In [10]:
inf_svi = MFSVI(X, y_part, RBF(40., 1., 1.), Poisson(), obs_idx = obs_idx)
inf_lp = Laplace(X, y_part, RBF(40., 1., 0.1), Poisson(), obs_idx = obs_idx)
inf_svi.run(5000, n_samples = 1)
inf_lp.run(10)

KeyboardInterrupt: 

We can make predictions of the entire function below.

In [None]:
pred_svi = inf_svi.predict()
pred_lp = inf_lp.f_pred

trace_svi = go.Scatter3d(x = X[:,0], y = X[:,1], z=pred_svi, mode = 'markers', marker=dict(size = 2, color = color), name = "SVI partial grid posterior mean")
trace_lp = go.Scatter3d(x = X[:,0], y = X[:,1], z=pred_lp, mode = 'markers', marker=dict(size = 2, color = color), name = "Laplace partial grid posterior mean")
fig = tools.make_subplots(rows=1, cols=2, specs=[[{'is_3d': True}, {'is_3d': True}]])
fig.append_trace(trace_svi, 1, 1)
fig.append_trace(trace_lp, 1, 2)
iplot(fig)