[Feature Request] Input Transformations #1652

Balandat · 2021-06-11T17:21:02Z

🚀 Feature Request

Essentially, upstream BoTorch's InputTransformation from https://github.com/pytorch/botorch/blob/master/botorch/models/transforms/input.py to GPyTorch. This allows to add either fixed or learnable transformations that are automatically applied when training models.

Motivation

This allows to do things like normalize inputs but also to combine the GP with a learnable transformation. This simplifies model setup. We currently have this in BoTorch and essentially apply the transform in the forward methods.

Additional context

We recently worked on having input transformations that can change the shape of the input pytorch/botorch#819, which caused some headaches for how to best set this up without a ton of boilerplate code. We were hoping to do this in the __call__ rather than forward method, but this collides with some of GPyTorch's assumptions. Moving this functionality upstream into gpytorch would allow us to solve these challenges more organically.

Describe alternatives you've considered
You could do this as we do it right now, but one would have to add boilerplate transformation code to every implementation of forward.

The text was updated successfully, but these errors were encountered:

Balandat · 2021-06-11T17:21:37Z

cc @sdaulton @saitcakmak

saitcakmak · 2021-06-11T18:17:40Z

I'll just add concrete examples of what we tried and where it fails.

Apply one-to-many transforms at model.forward(): For a batch x q x d-dim input these return batch x new_q x m-dim output. This fails at the reshape operation here:

gpytorch/gpytorch/models/exact_prediction_strategies.py

Line 43 in a0d8cd2

    
           train_labels = train_labels.reshape(*train_labels.shape[: -len(self.train_shape)], self._train_shape.numel())

Define a wrapper around ExactGP.__call__ and apply the transforms before calling ExactGP.__call__: Leads to

gpytorch/gpytorch/models/exact_gp.py

Line 256 in a0d8cd2

raise RuntimeError("You must train on the training inputs!")

We could maybe get around this by wrapping the call with with debug(False), but that also breaks tests, so probably not a good idea.

Current proposal at pytorch/botorch#819 is to apply the transforms at model.forward at the training time and at posterior call in eval mode to get around these issues (and do this at each models forward and each posterior). This is admittedly not a good design, so upstreaming the input transforms would make it much cleaner.
Edit: Just realized that this actually breaks things. So, we don't have a proper way of applying one-to-many transforms currently.
Edit 2: With some changes to input transforms (storing train inputs as transformed), applying them in both model.forward and posterior now works.

gpleiss · 2021-06-14T21:51:33Z

@Balandat and @saitcakmak is this the interface you have in mind?

class MyGP(gpytorch.models.ExactGP):
    def __init__(self, train_x, train_y, likelihood):
        # ...
        self.transform = InputTransform(...)

    def forward(self, x):
        x = self.transform(x)
        mean = self.mean_module(x)
        covar = self.covar_module(x)
        return gpytorch.distributions.MultivariateNormal(mean, covar)

saitcakmak · 2021-06-14T22:42:43Z

So, that is the current (pre pytorch/botorch#819) interface on botorch end, and there are some issues with that. It does not play nicely with transforms that change the shape of x. We also want certain transforms to apply to all inputs, including train_inputs, and others to apply only to test inputs. This interface doesn't work well there if you only wanted the transform to apply to test inputs since the forward is called after concatenating them with train_inputs. Because of these issues, I think forward is not the right place to apply the transforms.

I think ExactGP.__call__ (similarly ApproximateGP.__call__) is a better place to apply the transforms. I am thinking something along the lines of

    def __call__(self, *args, **kwargs):
        train_inputs = list(self.transform_inputs(self.train_inputs, train=True)) if self.train_inputs is not None else []
        inputs = [self.transform_inputs(i.unsqueeze(-1) if i.ndimension() == 1 else i, train=False) for i in args]
        ...

where we can handle the train argument along with transform_on_train attribute of the input transform. self.transform_inputs(...) here is a simple wrapper that calls the input transform if it exists, similar to
https://github.com/pytorch/botorch/blob/f930c01c7a15ea4db951b5346f3331422fcd04e4/botorch/models/model.py#L151-L168

wjmaddox · 2021-08-10T17:01:46Z

After some discussion with @saitcakmak, it looks like placing the input transforms in ApproximateGP.__call__ for variational GPs might be the only feasible option.

If it's done in the forwards pass, then the inducing points would be estimated on the raw, untransformed scale because of the call here.

I have a version of current botorch transforms + variational GPs in pytorch/botorch#895 but it only works for fixed, un-learnable transforms (e.g. not learnable input warping) because it forcibly sets the "training inputs" and thus the model inputs to be on the transformed scale, at least when using model fitting utilities such as botorch...fit_gpytorch_torch.

Balandat added the enhancement label Jun 11, 2021

Balandat mentioned this issue Jul 8, 2021

[Bug] Log transform not applied when computing posterior in heteroskedastic GP model pytorch/botorch#861

Open

saitcakmak mentioned this issue Jul 22, 2021

[Question] Proper use of Normalize for inputs (possible bug?) pytorch/botorch#874

Closed

saitcakmak mentioned this issue Jul 31, 2021

Fitting the model directly with fit_gpytorch_scipy/torch leads to buggy behavior with input transforms. pytorch/botorch#893

Closed

saitcakmak mentioned this issue Aug 31, 2022

[RFC] Move input transforms to GPyTorch #2114

Open

Balandat mentioned this issue May 12, 2023

Don't raise specious warning about input transforms with approximate GPs pytorch/botorch#1826

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Input Transformations #1652

[Feature Request] Input Transformations #1652

Balandat commented Jun 11, 2021

Balandat commented Jun 11, 2021

saitcakmak commented Jun 11, 2021 •

edited

Loading

gpleiss commented Jun 14, 2021

saitcakmak commented Jun 14, 2021

wjmaddox commented Aug 10, 2021 •

edited

Loading

[Feature Request] Input Transformations #1652

[Feature Request] Input Transformations #1652

Comments

Balandat commented Jun 11, 2021

🚀 Feature Request

Motivation

Additional context

Balandat commented Jun 11, 2021

saitcakmak commented Jun 11, 2021 • edited Loading

gpleiss commented Jun 14, 2021

saitcakmak commented Jun 14, 2021

wjmaddox commented Aug 10, 2021 • edited Loading

saitcakmak commented Jun 11, 2021 •

edited

Loading

wjmaddox commented Aug 10, 2021 •

edited

Loading