Gradient of inv and logdet UpperTriangular matrix is not UpperTriangular #431

aterenin · 2019-12-19T01:56:35Z

For sparse Gaussian processes, one often wants to parametrize the variational distribution using its Cholesky factors, which form an upper-triangular matrix and are trainable.

However, when taking gradients, Zygote instead computes the gradient with respect to all entries in the UpperTriangular matrix, including ones that are set to zero by virtue of it being UpperTriangular. This is related to #163.

For the moment, a workaround suitable for training models is to simply call UpperTriangular(x) on the upper-triangular matrix x before using it.

The text was updated successfully, but these errors were encountered:

sdewaele · 2019-12-21T01:10:52Z

Perhaps this is more related to #402. As an alternative you could insert the projection function ℙ in your code, see #402 (comment). Then, the adjoint will be an UpperTriangular. You would have to add the definition for the projection of the UpperTriangular:

ℙ(::Type{T},X) where {T<:UpperTriangular} = UpperTriangular(X)

mcabbott mentioned this issue May 8, 2021

Use clamptype mechanism to project onto cotangent space #965

Closed

mcabbott mentioned this issue Aug 19, 2021

Use ProjectTo in broadcasting & gradient #1044

Merged

mcabbott closed this as completed in #1044 Sep 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradient of inv and logdet UpperTriangular matrix is not UpperTriangular #431

Gradient of inv and logdet UpperTriangular matrix is not UpperTriangular #431

aterenin commented Dec 19, 2019

sdewaele commented Dec 21, 2019 •

edited

Loading

Gradient of inv and logdet UpperTriangular matrix is not UpperTriangular #431

Gradient of inv and logdet UpperTriangular matrix is not UpperTriangular #431

Comments

aterenin commented Dec 19, 2019

sdewaele commented Dec 21, 2019 • edited Loading

sdewaele commented Dec 21, 2019 •

edited

Loading