Add KernelLinearOperator, deprecate KeOpsLinearOperator #62

gpleiss · 2023-05-05T21:11:56Z

KeOpsLinearOperator does not correctly backpropagate gradients if the covar_func closes over parameters.

KernelLinearOperator corrects for this, and is set up to replace LazyEvaluatedKernelTensor in GPyTorch down the line.

[Addresses issues in #2296]

gpleiss · 2023-05-10T12:31:57Z

@jacobrgardner @Balandat can one of you take a look at this?

m-julian · 2023-05-17T20:53:55Z

This implementation works perfectly when using one KeOps kernel, but there are a few problems when combining kernels. This is not specifically a problem with KernelLinearOperator, but the way the things are implemented in gpytorch/linear_operator currently. Hopefully some of these comments are useful for structuring the code.

I tried to multiply two keops kernels (which make a ProductKernel). These are the problems I found:

ProductKernel calls to_dense if x1 and x2 are not identical here and returns a DenseLinearOperator since MulLinearOperator is only used for square symmetric matrices currently. Since KeOps can be used for large matrices where x1 is not equal to x2, calling to_dense could give memory errors. Instead, this should give back some object that represents the multiplication of the two symbolic matrices without actually computing them.
If x1 is equal to x2, then you end up with a MulLinearOpeartor which calls root_decomposition on both the KerneLinearOperators. Then there are two issues:
2.1. You might run out of memory if the matrix is very large when doing the root decomposition (because of KeOps).
2.2. If the matrix fits into memory, it will be approximated as Lanczos is the default algorithm. This is fixed by doing gpytorch.settings.fast_computations(covar_root_decomposition=False), but it took me a while to figure out why I was getting different results when using KeOps kernels. If using a very large KeOps symbolic matrix, then I don't think root decomposition should be called, but instead the symbolic matrix should be used until some reduction operation is performed. (So should MulLinearOperator be used with KernelLinearOperator instances?)

I was thinking could KernelLinearOperator subclass from DenseLinearOperator since the KernelLinearOperator class is used to represent a full covariance matrix (just computed on the fly when a reduction operation is applied)? That way all the checks for isinstance(self, DenseLinearOperator) (for example this one) also work for KernelLinearOperator. Then this goes around the problem with MulLinearOperator but might cause other problems, so not sure if it is reasonable.

linear_operator/operators/_linear_operator.py

linear_operator/operators/kernel_linear_operator.py

test/operators/test_identity_linear_operator.py

gpleiss · 2023-05-24T21:28:54Z

@m-julian there is unfortunately no way to do a symbolic element-wise multiplication of two kernels. KeOps (and LinearOperator) can keep things symbolic by using matrix multiplication based algorithms (CG/Lanczos) for solves and log determinants. Unfortunately, matrix multiplication does not distribute over element wise product. The current Lanczos solution comes out of the Product Kernel Interpolation paper (it was the best solution we could come up with).

Therefore, I don't know if there's a better way to handle the product of two matrices than what we currently do in code.

I was thinking could KernelLinearOperator subclass from DenseLinearOperator since the KernelLinearOperator class is used to represent a full covariance matrix (just computed on the fly when a reduction operation is applied)?

This would probably be a can of worms. DenseLinearOperator assumes that the matrix is represented by a tensor.
And most of our LinearOperator classes represent full covariance matrices that are computed on the fly when reduction operations are called.

Turakar · 2023-05-25T17:24:26Z

Regarding the product discussion: Regarding a general implementation that would be part of GPyTorch, I also do not know of a better approach than what @gpleiss pointed out. However, from a user's perspective, you, @m-julian, could of course write a custom KeOps kernel. Most likely, this would even be faster than two separate kernels, as you only need one trip from global memory to processor registers on the GPU.

Balandat

I can't say I did check all the indexing logic in exhaustive detail, but hopefully we have some test coverage for that?

linear_operator/operators/_linear_operator.py

linear_operator/operators/kernel_linear_operator.py

linear_operator/operators/linear_operator_representation_tree.py

KeOpsLinearOperator does not correctly backpropagate gradients if the covar_func closes over parameters. KernelLinearOperator corrects for this, and is set up to replace LazyEvaluatedKernelTensor in GPyTorch down the line.

Previously, only positional args were added to the LinearOperator representation, and so only positional args would receive gradients from _bilinear_derivative. This commit also adds Tensor/LinearOperator kwargs to the representation, and so kwarg Tensor/LinearOperators will also receive gradients.

…re gradients

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

cornellius-gp#62 introduced an inconsistency of the `linear_ops` property of `KroneckerProductLinearOperator` (by making it a `list` rather than a `tuple` in some cases). This broke some downstream usage of this that relied on this being a tuple.

Balandat · 2023-06-03T00:05:22Z

linear_operator/operators/kronecker_product_linear_operator.py

+                "are incompatible for a Kronecker product."
+            )
+
+        if len(batch_broadcast_shape):  # Otherwise all linear_ops are non-batch, and we don't need to expand


This introduced an inconsistency in the type of linear_ops (a list rather than a tuple), which resulted in some downstream breakages in botorch. Fixed in #66.

gpleiss requested a review from Balandat May 5, 2023 21:12

Balandat reviewed May 20, 2023

View reviewed changes

gpleiss force-pushed the linops_keops branch from f6e19a3 to 9febef4 Compare May 25, 2023 14:39

This was referenced May 26, 2023

[Bug] GPytorch Kernel Partitioning Increasing memory usage cornellius-gp/gpytorch#2352

Closed

CUDA out of memory with Multiple GPUs cornellius-gp/gpytorch#1448

Closed

Balandat approved these changes May 27, 2023

View reviewed changes

gpleiss and others added 20 commits June 2, 2023 18:58

Add KernelLinearOperator, deprecate KeOpsLinearOperator

b29ba6e

KeOpsLinearOperator does not correctly backpropagate gradients if the covar_func closes over parameters. KernelLinearOperator corrects for this, and is set up to replace LazyEvaluatedKernelTensor in GPyTorch down the line.

Fix KeOpsLinearOperator deprecation

b0bf6dd

Allow for kernels with reduced batches and multiple outputs per input

cab7af0

Hyperparameters for KernelLinearOperator must be kwargs

235197d

LO._bilinear_derivative only computes derivatives for args that requi…

6e42cb5

…re gradients

Expand upon closure variables warning for KernelLinearOperator

0dde888

LO._bilinear_derivative exits early if no parameters require gradients

6255472

Refactor KernelLinearOperator._getitem

6da16e4

Allow for optional number of nonbatch parameter dimensions

5762c62

Fix LO._bilinear_derivative

da7dad9

Update linear_operator/operators/_linear_operator.py

bfde1fe

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Update linear_operator/operators/kernel_linear_operator.py

a1fb466

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Update linear_operator/operators/linear_operator_representation_tree.py

90f676c

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Update linear_operator/operators/kernel_linear_operator.py

bfa03c0

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Update linear_operator/operators/kernel_linear_operator.py

83136f4

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Update linear_operator/operators/kernel_linear_operator.py

e27cfec

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Update linear_operator/operators/kernel_linear_operator.py

5a6a3f4

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Update linear_operator/operators/kernel_linear_operator.py

b280037

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Update linear_operator/operators/kernel_linear_operator.py

dfd45f3

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

gpleiss and others added 4 commits June 2, 2023 18:58

Update linear_operator/operators/kernel_linear_operator.py

3aa7178

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Update linear_operator/operators/kernel_linear_operator.py

cb9f16f

Co-authored-by: Max Balandat <Balandat@users.noreply.github.com>

Fix errors, address comments

aefc73d

KroneckerProductLinearOperator broadcasts

cc92b70

gpleiss force-pushed the linops_keops branch from b23dc9f to a491b3b Compare June 2, 2023 18:58

gpleiss enabled auto-merge (squash) June 2, 2023 18:58

gpleiss disabled auto-merge June 2, 2023 18:58

Test cases and fixes for multitask KernelLinearOperator

f97e9d5

gpleiss force-pushed the linops_keops branch from a491b3b to f97e9d5 Compare June 2, 2023 19:05

gpleiss merged commit 7affaf3 into main Jun 2, 2023

gpleiss deleted the linops_keops branch June 2, 2023 19:07

Balandat mentioned this pull request Jun 3, 2023

Fix type of KroneckerProductLinearOperator.linear_ops (in some cases) #66

Merged

Balandat reviewed Jun 3, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add KernelLinearOperator, deprecate KeOpsLinearOperator #62

Add KernelLinearOperator, deprecate KeOpsLinearOperator #62

gpleiss commented May 5, 2023

gpleiss commented May 10, 2023

m-julian commented May 17, 2023

gpleiss commented May 24, 2023

Turakar commented May 25, 2023 •

edited

Loading

Balandat left a comment

Balandat Jun 3, 2023

Add KernelLinearOperator, deprecate KeOpsLinearOperator #62

Add KernelLinearOperator, deprecate KeOpsLinearOperator #62

Conversation

gpleiss commented May 5, 2023

gpleiss commented May 10, 2023

m-julian commented May 17, 2023

gpleiss commented May 24, 2023

Turakar commented May 25, 2023 • edited Loading

Balandat left a comment

Choose a reason for hiding this comment

Balandat Jun 3, 2023

Choose a reason for hiding this comment

Turakar commented May 25, 2023 •

edited

Loading