Gaussian Process Regression(GPR) and Kernel-Addition GPR (KAGPR)

Installation

python setup.py install

Dependences

cupy pip install cupy

How to use

Create data and kernel

import numpy as np
import GPR
X = np.random.random((100, 10))
Y = np.sum(np.sin(X), axis=1)[:, None]
noise = 1e-4
k = GPR.kern.RBF()
k.set_lengthscale(1.0)
k.set_variance(10.0)

Train a full GP model and then save

gp = GPR.GP(X, Y, k, noise, GPU=False)
gp.optimize(messages=False)
# predict
Y_pred = gp.predict(X)
# save
gp.save("model.npz")

Train a BBMM model and then save

bbmm = GPR.BBMM(k, nGPU=1)
bbmm.initialize(X, noise)
bbmm.set_preconditioner(50, nGPU=0)
bbmm.solve_iter(Y)
# predict
Y_pred = bbmm.predict(X)
# save
bbmm.save("model.npz")

A KA-GPR example to learn $$y = \sum^{N}_{i=1} \sin (\sum^{d}_{j=1} x_{ij})$$ with varying $N$ and $d=5$. Using training data with $5\leq N < 10$, we can predict data with $10\leq N < 20$.

import numpy as np
import GPR


def func(x):
    return np.sum(np.sin(np.sum(x, axis=1)))


d = 5
# 20 test data with N from 10-20
sizes_test = np.random.randint(10, 20, size=(20, ))
X_test = [np.random.random((n, d)) for n in sizes_test]
Y_test = np.array([func(x) for x in X_test])[:, None]
for N in [5, 10, 20, 30, 50]:
    # training data with N from 5-10
    sizes = np.random.randint(5, 10, size=(N, ))
    X_train = [np.random.random((n, d)) for n in sizes]
    Y_train = np.array([func(x) for x in X_train])[:, None]
    kernel = GPR.kern.RBF()
    kernel_summation = GPR.kern.Summation(kernel)
    gp = GPR.GP(X_train, Y_train, kernel_summation, 1e-4)
    gp.optimize(messages=False)
    Y_test = np.array([func(x) for x in X_test])[:, None]
    pred_test = gp.predict(X_test)
    rel_err = np.mean(np.abs((pred_test - Y_test) / Y_test))

Plot the relation between N and rel_err:

The kernel calculations are cached by default. If you want to play with them by yourself you may want YOUR_KERNEL.clear_cache or disable the cacheing by YOUR_KERNEL.set_cache_state(False).

References

Wang, Ke Alexander, Geoff Pleiss, Jacob R. Gardner, Stephen Tyree, Kilian Q. Weinberger, and Andrew Gordon Wilson. “Exact Gaussian processes on a million data points.” arXiv preprint arXiv:1903.08114 (2019). Accepted by NeurIPS 2019 [Link]
Gardner, J. R., Pleiss, G., Bindel, D., Weinberger, K. Q., & Wilson, A. G. (2018). Gpytorch: Blackbox matrix-matrix gaussian process inference with gpu acceleration. arXiv preprint arXiv:1809.11165. Accepted by NeurIPS 2018 [Link]
Sun, J., Cheng, L., & Miller III, T. F. (2021). Molecular Energy Learning Using Alternative Blackbox Matrix-Matrix Multiplication Algorithm for Exact Gaussian Process. arXiv preprint arXiv:2109.09817. [Link]
Sun, J., Cheng, L., & Miller III, T. F. (2022). Molecular Dipole Moment Learning via Rotationally Equivariant Gaussian Process Regression with Derivatives in Molecular-orbital-based Machine Learning. arXiv:2205.15510. [Link]

Name		Name	Last commit message	Last commit date
Latest commit History 144 Commits
GPR		GPR
test		test
.gitignore		.gitignore
CITATION.cff		CITATION.cff
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPR

GPR

test

test

.gitignore

.gitignore

CITATION.cff

CITATION.cff

README.md

README.md

setup.py

setup.py

Repository files navigation

Gaussian Process Regression(GPR) and Kernel-Addition GPR (KAGPR)

Installation

Dependences

How to use

References

About

Releases

Packages

Contributors 2

Languages

SUSYUSTC/KAGPR

Folders and files

Latest commit

History

Repository files navigation

Gaussian Process Regression(GPR) and Kernel-Addition GPR (KAGPR)

Installation

Dependences

How to use

References

About

Resources

Stars

Watchers

Forks

Languages